ANN: English 0.2.0

= English

http://english.rubyforge.org

English is a spin-off of Facets, collecting together all the English
language libraries together under a single unified project. The
English project intends to provided a general purpose set of English-
oriented text manipulation and natural language processing Ruby
libraries in one convenient package.

The library thus far consists of censor.rb, double_metaphone.rb,
dresner.rb, inflect.rb, jumble.rb, levenshtein.rb, metaphone.rb,
numerals.rb, patterns.rb, porter_stemming.rb, roman.rb, similarity.rb,
soundex.rb and textfilter.rb.

== Release Notes

This version incorporates some new libraries gathered from various
sources and cleans up the library as a whole. It should be a pretty
usable little library now.

== Usage

Please see the API documentation.

== Install

% gem install english

or download the .zip file and install manually:

% unzip english-x.y.z.zip
% cd english-x.y.z
% sudo task/setup

For Windows users the last line needs to be ‘ruby task/setup’.

== Authors

  • Trans
  • Paul B.
  • Tim F.
  • Derek Lewis
  • George M.
  • Michael N.

== Changes

=== 0.2.0 // 2008-02-06

  • Removed Language module. All libraries (except extensions) are now
    in toplevel English module.
  • Incorporated a couple libraries from the Text project and other
    sources.

== Copying

English, Copyright (c) 2007 Tiger Ops

Some libraries within English are copyrighted to respective authors,
or are derivatives of such acknowledged work.

English is distributed under the terms of the Ruby License. Which is a
dual GPL license.

On Feb 6, 2008, at 3:10 PM, Trans wrote:

English is a spin-off of Facets, collecting together all the English
language libraries together under a single unified project. The
English project intends to provided a general purpose set of English-
oriented text manipulation and natural language processing Ruby
libraries in one convenient package.

Just out of curiosity, did you know about the Linguistics library
before you released this? “English” has some stuff the ‘en’ module of
Linguistics doesn’t have (and vice-versa), but I’d have happily added
new functions to it if you had suggested it.

If you did know about it beforehand, was there something about it you
found objectionable? Just professional curiosity. I’d like to make
Linguistics usable for people other than myself, and if you felt
compelled to write something else to avoid having to use it, I’d take
that as a sign that it needs changing.

Thanks,

On Apr 13, 3:00 pm, Michael G. [email protected] wrote:

Linguistics doesn’t have (and vice-versa), but I’d have happily added
new functions to it if you had suggested it.

If you did know about it beforehand, was there something about it you
found objectionable? Just professional curiosity. I’d like to make
Linguistics usable for people other than myself, and if you felt
compelled to write something else to avoid having to use it, I’d take
that as a sign that it needs changing.

I had done a little searching and knew about Lingustics. My main usage
was #plural and #singular, and I already had code that used these. So
I didn’t want to have to go back and add “.en” in all those cases. Of
course, I now know one can use the :proxy option to get around this,
but at the the time I did not know that. But beyond that, I suppose it
just a matter of design “levels”, if you will. Lingustics is a much
bigger idea, not just for English but also ultimately having in mind
support for other languages. I think that’s great, but my feeling was
I just wanted something simpler, focused on English and some commonly
useful language processing.

But I wouldn’t mind talking to you more off list and see what we might
be able to do mutaully.

T.