I am proud to present you a small project I have been working on for a
Picolena, a documents search engine written in Rails.
( http://picolena.devjavu.com/ ).
It obviously uses Ferret for indexing and searching, and adds some plain
text extractors in order to index OOffice.org, pdf and MS Office
documents (and some others as well).
Everything is packed in a gem (gem install picolena), with a few rake
tasks, a multi-threaded indexer, a language guesser, a rails frontend
and some specs to be sure everything works fine.
I would love to hear some feedback from acts_as_ferret developers or
My project is in now way supposed to be a competitor of AAF: we have
different goals (ActiveRecord indexing plugin vs. stand-alone rails-app
for documents indexing), but still a lot in common.
I dare say Picolena would be useful in a lot of companies (as a
google-mini alternative), and has already been working in production for
a few months without a hitch. This has been made possible thanks to
Ferret’s incredible speed. Kudos to the devs!