Strange thing.
Minimal example :
Indexing a three accented words text like "aprèl après aprèt" and asking
for one of the three words, then two cases appear :
- plain indexing : all three give a hit,
- indexing with FULL_FRENCH_STOP_WORDS, only one ("après") gives a hit.
I made extensive checks : no clear pattern appears for what type of
accented words work and what do not : f.i. "Hélène" does not work,
"Jérôme" works...
By the way, the list of French stop words appearing in stopwords.c is
strange, as some of them do not exist in the French language (flexed
participles...).
on 2009-03-16 13:09
Please log in before posting. Registration is free and takes only a minute.
Existing account
(Switch to SSL-encrypted connection)
NEW: Do you have a Google/GoogleMail or Yahoo account? No registration required!
Log in with Google account | Log in with Yahoo account
Log in with Google account | Log in with Yahoo account
No account? Register here.