swanand deodhar wrote:
I am focussing on building a ETL and Data mining framework in Ruby.
That is quite a tall order, IMHO. I would suggest having a look at Weka:
I used it last quarter in my graduate Data Mining class. It is quite
powerful and, more importantly, was written collaboratively by active
researchers in the field. It also has a big following in the academic
The best part is that I didn’t have to put my programmer’s hat on every
time I wanted to play around with a statistical algorithm in the
textbook because, usually, it was already implemented in Weka.
Although Weka is written in Java, I think you could build a Ruby
interface atop it with JRuby. However, I am not familiar with the rules
of GSoC, so making a light abstraction layer above existing software
might not be allowed?