Hello, who can tell me how i make a search engine that satisfy following
requiments:
- support for UTF-8
- search by part of word, example: dem* <- returns demo, demon,
demor…
- UTF-8 search ignoring case sensitive symbols, for example when word
start with ‘Ы’ and other starts with ‘Ñ‹’, and
i searching by criteria ‘Ñ‹’ - I get only last word.
- easy configurable and reliable
Thank you all
This is not a trivial question, or you could just search Google for
it. Some parts may already be built, but I suspect you’ll have to do
a fair amount of invention on your own.
Given that you’re using Rails, I assume you’re searching elements
that are in a database. You might look at any of the number of full
text search components that are available for rails (acts_as_solr,
acts_as_ferret). I don’t know the extent to which any of these
handle UTF-8, word stemming, etc., so you might have to implement
something yourself.
-faisal
Given that you’re using Rails, I assume you’re searching elements
that are in a database. You might look at any of the number of full
text search components that are available for rails (acts_as_solr,
acts_as_ferret). I don’t know the extent to which any of these
handle UTF-8, word stemming, etc., so you might have to implement
something yourself.
Unfortunately ferret can’t provide search for UTF-8 symbols(for english
symbols it does).
One plugin - acts_as_fulltext_indexed - provide search by UTF-8 but i
get a problems with upper/downcase symbols.
On Sep 1, 3:08 am, “Igor K.” [email protected] wrote:
One plugin - acts_as_fulltext_indexed - provide search by UTF-8 but i
get a problems with upper/downcase symbols.
look at iconv:
t = Iconv.new(‘ASCII//TRANSLIT’, ‘utf-8’).iconv(inputstring)
or Ferret’s LowerCaseFilter
http://rubyforge.org/pipermail/ferret-talk/2007-August/003887.html