Forum: Ferret Chinese search support

Announcement (2017-05-07): www.ruby-forum.com is now read-only since I unfortunately do not have the time to support and maintain the forum any more. Please see rubyonrails.org/community and ruby-lang.org/en/community for other Rails- und Ruby-related community platforms.
B6a210fb79b3733db184fb7933d7363a?d=identicon&s=25 Jerry Liu (Guest)
on 2006-02-23 00:32
I need decide on if our site will go with Java or Ruby on Rails. The
major factor is that does Farret support Lucene's ChineseAnalyzer or
CJKAnalyzer or not.

Can anyboby shine some lights on Farret's Chinese search support?

Really appreciate.
B5e329ffa0cc78efbfc7ae2d084c149f?d=identicon&s=25 David Balmain (Guest)
on 2006-02-23 07:55
(Received via mailing list)
Hi Jerry,
Basically you'll have to write an analyzer that matches Chinese tokens
(words). If you can write a regular expression in Ruby that matches
Chinese tokens then it's very simple to write an Analyzer for Ferret.
I haven't looked at teh CJKAnalyzer in Lucene but I can't imagine it
would be too hard to port to Ruby.

Cheers,
Dave
4d6a47158a7c8a032e5f6a4da8976d7d?d=identicon&s=25 Erik Hatcher (Guest)
on 2006-02-23 09:20
(Received via mailing list)
There is nothing fancy about the CJKAnalyzer.... it chunks characters
into pairs.  So the phrase 你好� would be tokenized into two
tokens [你好] [好�].

	Erik
This topic is locked and can not be replied to.