Is it possible with Ferret to find the location of the matches in a
document? For example imagine I have 100 documents and I search with
the phrase “bob~0.5” and that returns 3 matching documents. How can I
then find all locations in a specific document where it matched
“bob~0.5”. What I need is something like an array that contains the
start index and length for each match within a given document. Does
this exact? Should I break up my matching document into subdocuments
then search on that?
Also for my application I will be searching for fairly large pieces
of text ( many sentences long ) and doing fuzzy matching. I suppose
what I am doing is very similar to trying to find matching phrases
within an essay to catch people plagiarizing (that’s not what I’m
doing at all, but it’s close enough in terms of methods).
Are both of these possible with Ferret? Is there another technology I
should look at for doing this? I will have a relatively small index
size ( somewhere between 100 and 500 ) and so I’m not really
concerned with speed issues.
Thanks so much for any help!