Ferret+Lucene Index

We use Nutch and Lucene for our heavy duty text analysis jobs but I¹m
trying
to use ferrret to do some experiments. I understood that Ferret used
the
same index format as lucene but I cannot look into a lucene index with
ferret and cannot read a ferret index with luke (the lucene index
browser).
Am I doing somehting wrong or have the formats diverged?

On Jan 24, 2008, at 9:25 PM, Pat Ferrel wrote:

We use Nutch and Lucene for our heavy duty text analysis jobs but
I’m trying to use ferrret to do some experiments. I understood that
Ferret used the same index format as lucene but I cannot look into a
lucene index with ferret and cannot read a ferret index with luke
(the lucene index browser). Am I doing somehting wrong or have the
formats diverged?

The formats have diverged.

-ryan

Can I use an earlier version of ferret maybe? Does anyone know when the
formats diverged?

On Fri, Jan 25, 2008 at 08:12:24AM -0800, Pat Ferrel wrote:

Can I use an earlier version of ferret maybe? Does anyone know when the
formats diverged?

Yeah, Versions 0.3.x should definitely work, and possibly also 0.9.x.

Afair there always were some substantial problems in terms of UTF8
character handling, so you might run into problems even with the older
versions.

Cheers,
Jens

(the lucene index browser). Am I doing somehting wrong or have the


Ferret-talk mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/ferret-talk


Jens Krämer
webit! Gesellschaft für neue Medien mbH
Schnorrstraße 76 | 01069 Dresden
Telefon +49 351 46766-0 | Telefax +49 351 46766-66
[email protected] | www.webit.de

Amtsgericht Dresden | HRB 15422
GF Sven Haubold

I am a little new to Ruby so forgive if there is an obvious answer. But
one
way to solve my problem might be to use jruby and manipulate the lucene
index through java.

I have never done the ruby->java->ruby thing but it seems it would be
nicer
if there were a ruby class interface that hides some of the complexity
of
the raw java one. Has anyone done this or know of such a ruby interface
or
example code?

On 1/25/08 8:33 AM, “Jens K.” [email protected] wrote:

Cheers,
Jens

On 1/24/08 10:41 PM, “Ryan K.” [email protected] wrote:

On Jan 24, 2008, at 9:25 PM, Pat Ferrel wrote:

We use Nutch and Lucene for our heavy duty text analysis jobs but
I¹m trying to use ferrret to do some experiments. I understood
that

Ferret used the same index format as lucene but I cannot look into
a

On Sun, Feb 03, 2008 at 01:01:10PM -0800, Pat Ferrel wrote:

I am a little new to Ruby so forgive if there is an obvious answer. But one
way to solve my problem might be to use jruby and manipulate the lucene
index through java.

I have never done the ruby->java->ruby thing but it seems it would be nicer
if there were a ruby class interface that hides some of the complexity of
the raw java one. Has anyone done this or know of such a ruby interface or
example code?

Yeah, a Ruby-friendly wrapper around Lucene, maybe even compatible with
Ferret’s
API, would for sure be a nice thing.

But afaik there’s no such thing yet.

Cheers,
Jens


Jens Krämer
http://www.jkraemer.net/ - Blog
http://www.omdb.org/ - The new free film database

This forum is not affiliated to the Ruby language, Ruby on Rails framework, nor any Ruby applications discussed here.

| Privacy Policy | Terms of Service | Remote Ruby Jobs