I’m using Ferret on a Swedish website and I get some unexpected
behaviour on searches containing the swedish charchters Ã¥Ã¤Ã¶.
An exampel, if I index a string “VarfÃ¶r fungerar det inte” (“Why doesnt
it work” in swedish) and search for “fÃ¶r” I’ll get one (1) match. The
expected behaviour would be no matches since ‘fÃ¶r’ is part of the word
And if I do a search for “varfÃ¶*” it returns no matches. Expecting one.
My guess is that it has something to do with the UTF-8 encoding but I
can’t seem to figure out exactly what is is…
I’m using the StandardAnalyzer b.t.w.