I’ve just released Ferret 0.10.7. It is still in beta but we are
getting closer and closer to a 1.0 release. The main changes in this
release are better handling of fields by the QueryParser. You can now
give the QueryParser a list of fields that are tokenized so that only
those fields will be analyzed in the QueryParser. This means that you
can search untokenized fields for terms with spaces in them. For
results = index.search('title:Shawshank\ Redemption')
If :title is an untokenized field, this query will be parsed as a
single TermQuery. Previously the search would have failed.
I’ve also changed the StandardTokenizer behaviour so that it will
handle email addresses that start with numbers. This also means that
200km will be parsed as a single term instead of seperate terms.
Whether this is a good thing or not is a matter of taste. You can
easily use a filter to split these terms up if you need to.
There are a lot of other bug fixes as well so Ferret should be a
little more stable.