English is a spin-off of Facets, collecting together all the English
language libraries together under a single unified project. The
English project intends to provided a general purpose set of English-
oriented text manipulation and natural language processing Ruby
libraries in one convenient package.
The library thus far consists of censor.rb, double_metaphone.rb,
dresner.rb, inflect.rb, jumble.rb, levenshtein.rb, metaphone.rb,
numerals.rb, patterns.rb, porter_stemming.rb, roman.rb, similarity.rb,
soundex.rb and textfilter.rb.
== Release Notes
This version incorporates some new libraries gathered from various
sources and cleans up the library as a whole. It should be a pretty
usable little library now.
== Usage
Please see the API documentation.
== Install
% gem install english
or download the .zip file and install manually:
% unzip english-x.y.z.zip
% cd english-x.y.z
% sudo task/setup
For Windows users the last line needs to be ‘ruby task/setup’.
== Authors
Trans
Paul B.
Tim F.
Derek Lewis
George M.
Michael N.
== Changes
=== 0.2.0 // 2008-02-06
Removed Language module. All libraries (except extensions) are now
in toplevel English module.
Incorporated a couple libraries from the Text project and other
sources.
== Copying
English, Copyright (c) 2007 Tiger Ops
Some libraries within English are copyrighted to respective authors,
or are derivatives of such acknowledged work.
English is distributed under the terms of the Ruby License. Which is a
dual GPL license.
English is a spin-off of Facets, collecting together all the English
language libraries together under a single unified project. The
English project intends to provided a general purpose set of English-
oriented text manipulation and natural language processing Ruby
libraries in one convenient package.
Just out of curiosity, did you know about the Linguistics library
before you released this? “English” has some stuff the ‘en’ module of
Linguistics doesn’t have (and vice-versa), but I’d have happily added
new functions to it if you had suggested it.
If you did know about it beforehand, was there something about it you
found objectionable? Just professional curiosity. I’d like to make
Linguistics usable for people other than myself, and if you felt
compelled to write something else to avoid having to use it, I’d take
that as a sign that it needs changing.
Linguistics doesn’t have (and vice-versa), but I’d have happily added
new functions to it if you had suggested it.
If you did know about it beforehand, was there something about it you
found objectionable? Just professional curiosity. I’d like to make
Linguistics usable for people other than myself, and if you felt
compelled to write something else to avoid having to use it, I’d take
that as a sign that it needs changing.
I had done a little searching and knew about Lingustics. My main usage
was #plural and #singular, and I already had code that used these. So
I didn’t want to have to go back and add “.en” in all those cases. Of
course, I now know one can use the :proxy option to get around this,
but at the the time I did not know that. But beyond that, I suppose it
just a matter of design “levels”, if you will. Lingustics is a much
bigger idea, not just for English but also ultimately having in mind
support for other languages. I think that’s great, but my feeling was
I just wanted something simpler, focused on English and some commonly
useful language processing.
But I wouldn’t mind talking to you more off list and see what we might
be able to do mutaully.
T.
This forum is not affiliated to the Ruby language, Ruby on Rails framework, nor any Ruby applications discussed here.