Again search engine question

Hello, who can tell me how i make a search engine that satisfy following
requiments:

  1. support for UTF-8
  2. search by part of word, example: dem* <- returns demo, demon,
    demor…
  3. UTF-8 search ignoring case sensitive symbols, for example when word
    start with ‘Ы’ and other starts with ‘Ñ‹’, and
    i searching by criteria ‘Ñ‹’ - I get only last word.
  4. easy configurable and reliable

Thank you all

This is not a trivial question, or you could just search Google for
it. Some parts may already be built, but I suspect you’ll have to do
a fair amount of invention on your own.

Given that you’re using Rails, I assume you’re searching elements
that are in a database. You might look at any of the number of full
text search components that are available for rails (acts_as_solr,
acts_as_ferret). I don’t know the extent to which any of these
handle UTF-8, word stemming, etc., so you might have to implement
something yourself.

-faisal

Given that you’re using Rails, I assume you’re searching elements
that are in a database. You might look at any of the number of full
text search components that are available for rails (acts_as_solr,
acts_as_ferret). I don’t know the extent to which any of these
handle UTF-8, word stemming, etc., so you might have to implement
something yourself.

Unfortunately ferret can’t provide search for UTF-8 symbols(for english
symbols it does).

One plugin - acts_as_fulltext_indexed - provide search by UTF-8 but i
get a problems with upper/downcase symbols.

On Sep 1, 3:08 am, “Igor K.” [email protected] wrote:

One plugin - acts_as_fulltext_indexed - provide search by UTF-8 but i
get a problems with upper/downcase symbols.

look at iconv:

t = Iconv.new(‘ASCII//TRANSLIT’, ‘utf-8’).iconv(inputstring)

or Ferret’s LowerCaseFilter

http://rubyforge.org/pipermail/ferret-talk/2007-August/003887.html