Re: Metadata 0.2

On 9/13/07, Konrad M. [email protected] wrote:

Any chance this could be expanded to add FLAC and OGG support?

Thanks!

Konrad M. [email protected] http://konrad.sobertillnoon.com/

Done! Along with a preliminary list of supported formats and some
cleanup.
Thanks for the request!

tarball: http://dark.fhtr.org/repos/metadata/metadata-0.2.tar.gz
git: http://dark.fhtr.org/repos/metadata/

Changelog:

README:
* added flacinfo, wmainfo, mp4info and ogginfo to the list of
dependencies
* prelim list of supported formats
lib/metadata/extract.rb:
* .ps.gz support
* list archive contents
* remove null fields from output
* support for flac and ogg
* untested support for wma and m4a
* Audio.Bitrate now in kbps to match shared-filemetadata-spec

Description

This package Metadata' comes with a library called metadata’ and
a small program called `mdh’.

The library probes files for their metadata (e.g. jpeg dimensions
and camera make, mp3 artist, pdf word count) and returns the metadata
as a Hash.

Mdh can print out file metadata as YAML and package the metadata
with the file.

This package has many dependencies since there is no single universal
metadata header format that all files use. Blame resource forks,
filename
extensions, bags of bytes and mimetypes.

Usage

print out metadata header

mdh -p myfile.jpg

create myfile.jpg.mdh, which consists of metadata header +

myfile.jpg
mdh myfile.jpg

print out metadata header from mdh file

mdh -e -p myfile.jpg.mdh

strip out metadata header from mdh file and save it to myfile.jpg

mdh -e myfile.jpg.mdh

irb> Metadata.extract(‘myfile.jpg’)
irb> Metadata.extract_text(‘myfile.jpg’)
irb> Pathname.new(“myfile.jpg”).metadata

List of supported formats

Audio:
Successfully tested with:
mp3, flac, ogg, wav
Should also work:
wma, m4a

Video:
What you manage to make mplayer play, which can be just about
anything.
Then again, missing title and author data, etc. (do videos even have
those?)
Successfully tested with:
wmv, mov, divx, xvid, flv, ogm, mpg

Images:
Should handle pretty much anything (apart from XCF and ORF.)
Successfully tested with:
jpeg, png, gif, nef, dng, crw, pef, psd

Documents:
Successfully tested with:
pdf, ppt, odp, sxi, ps, ps.gz, html, txt
Should work:
- OpenOffice docs work to some degree (personally, I’m using unoconv
to
convert OO docs to temp PDFs for the text & dimensions extraction,
so
those bits of data are missing.)
- MS Office docs to some degree (ppt at least, doc and xls should
work too,
dimensions missing due to the above temp PDF -thing.)

Others:
Whatever extract spits out on the five or six bits of metadata I’m
using
from it. Archive contents at least.

Requirements

  • Ruby 1.8

  • Tons of metadata extraction programs and libs,
    list of gems:
    flacinfo-rb
    wmainfo-rb
    MP4info
    list of debian packages:
    dcraw
    libimlib2-ruby
    extract
    libimage-exiftool-perl
    poppler-utils
    mplayer
    html2text
    imagemagick
    unhtml
    pstotext
    antiword
    catdoc
    shared-mime-info
    vorbis-tools

  • You do want to install the latest versions of dcraw and
    shared-mime-info to be able to handle camera raw images.
    http://cybercom.net/~dcoffin/dcraw/
    shared-mime-info

  • Python + chardet library
    http://chardet.feedparser.org/

Install

De-compress archive and enter its top directory.
Then type:

($ su)
# ruby setup.rb

These simple step installs this program under the default
location of Ruby libraries. You can also install files into
your favorite directory by supplying setup.rb some options.
Try “ruby setup.rb --help”.

License

Ruby’s

Ilmari H. <ilmari.heikkinen gmail com>

Hi Ilmari!

quoth the Ilmari H.:

* added flacinfo, wmainfo, mp4info and ogginfo to the list of
  dependencies

Cool, you were able to use a couple of my libraries (be afraid
people…).

Video:
What you manage to make mplayer play, which can be just about anything.
Then again, missing title and author data, etc. (do videos even have
those?) Successfully tested with:
wmv, mov, divx, xvid, flv, ogm, mpg

Just wanted to mention that despite the name, wmainfo will parse
anything
wrapped in an ASF audio/video container format[0], so, you could use it
to
parse wmv movies as well if your user didn’t have mplayer installed.

[0] Advanced Systems Format - Wikipedia

Ilmari H. <ilmari.heikkinen gmail com>
http://fhtr.blogspot.com

Thanks for the code, and have a good one,
-d

Ilmari H. wrote:

On 9/13/07, Konrad M. [email protected] wrote:

Any chance this could be expanded to add FLAC and OGG support?

Thanks!

Konrad M. [email protected] http://konrad.sobertillnoon.com/

Done! Along with a preliminary list of supported formats and some
cleanup.
Thanks for the request!

tarball: http://dark.fhtr.org/repos/metadata/metadata-0.2.tar.gz
git: http://dark.fhtr.org/repos/metadata/

Hi, these links are dead. Does this package still exist anywhere? Is it
available as a gem?
Thanks,
Dan

Description

This package Metadata' comes with a library called metadata’ and
a small program called `mdh’.

The library probes files for their metadata (e.g. jpeg dimensions
and camera make, mp3 artist, pdf word count) and returns the metadata
as a Hash.

Mdh can print out file metadata as YAML and package the metadata
with the file.

This package has many dependencies since there is no single universal
metadata header format that all files use. Blame resource forks,
filename
extensions, bags of bytes and mimetypes.

John W Higgins wrote:

On Thu, Sep 30, 2010 at 4:15 PM, Dan T.
[email protected]wrote:

Ilmari H. wrote:

On 9/13/07, Konrad M. [email protected] wrote:

Hi, these links are dead. Does this package still exist anywhere? Is it
available as a gem?
Thanks,
Dan

You really are going to take the time to open up a 3+ year old thread on
a
mailing list (for the second time by the way) as opposed to typing “ruby
metadata” into google?

John

My post did not show up (on ruby-forum.com) the first time.
All that google shows are these same dead links.

On Thu, Sep 30, 2010 at 4:15 PM, Dan T.
[email protected]wrote:

Ilmari H. wrote:

On 9/13/07, Konrad M. [email protected] wrote:

Hi, these links are dead. Does this package still exist anywhere? Is it
available as a gem?
Thanks,
Dan

You really are going to take the time to open up a 3+ year old thread on
a
mailing list (for the second time by the way) as opposed to typing “ruby
metadata” into google?

John