Hi Everyone,
I just noticed that Japanese Ruby Home オブジェクト指向スクリプト言語 Ruby
announces availability of Ruby-1.8.4 preview 2 and in general contains
much more news and announcements that English Language Ruby Home Ruby Programming Language that has not been updated since early
October.
I hope it should be possible to keep both web sites in sync so that
those of us who do not speak Japanese can benefit from up-to-date
information regarding Ruby.
Hi Everyone,
I just noticed that Japanese Ruby Home オブジェクト指向スクリプト言語 Ruby
announces availability of Ruby-1.8.4 preview 2 and in general contains
much more news and announcements that English Language Ruby Home Ruby Programming Language that has not been updated since early October.
I hope it should be possible to keep both web sites in sync so that
those of us who do not speak Japanese can benefit from up-to-date
information regarding Ruby.
Of course it is possible, but it currently depends on native
English-speaking site editors to a) know of the Japanese additions, and
b) have the time to go add the new item to the English version, assuming
c) the English-speaking editor can figure out the Japanese.
It’s mainly a matter of time and energy, and yes, the announcement of
the 1.8.4 previews should have been put up on the English version site,
but sometimes thing slip by. It’s a volunteer world.
It’s there now, though, thanks to David A. Black.
James
(Now subscribing to the RDF feed for the Japanese version of the Ruby
home page …)
though a translator, and emits a new feed, to which I can subscribe.
Hmm …
I’ve got a framework running at “http://jacob.fugal.net/era/” (ERA =
English Ruby Annoucements). The only portion lacking is the
translation. I couldn’t find a web service that does japanese to
english and I didn’t want to do screenscraping. If you can point me at
an appropriate webservice or write your own bit of screen scraping
code, I’ll plug it in and you’ve got your feed!
I tried integrating that, but the 150 char limit meant that most – if
not all – of the bodies couldn’t be translated. Having translated
titles would be a slight improvement, but not enough. Thanks for the
direction though!
On Wednesday 14 December 2005 15:21, Wilson B. wrote:
code, I’ll plug it in and you’ve got your feed!
Does this lame little snippet help? Heh. Unfortunately, babelfish is
limited to 150 words.
Babelfish will also let you plug in a site and just translate that. So
why
not just plug in the site and then scrape the actual page with just
translated text?
What I could use is a tool that takes the Japanese RDF feed, runs it
though a translator, and emits a new feed, to which I can subscribe.
Hmm …
I’ve got a framework running at “http://jacob.fugal.net/era/” (ERA =
English Ruby Annoucements). The only portion lacking is the
translation. I couldn’t find a web service that does japanese to
english and I didn’t want to do screenscraping. If you can point me at
an appropriate webservice or write your own bit of screen scraping
code, I’ll plug it in and you’ve got your feed!
Does this lame little snippet help? Heh. Unfortunately, babelfish is
limited to 150 words.
Also, オブジェクト指向スクリプト言語 Ruby uses EUC encoding, so if you’re grabbing it
directly, rather than in Unicode form, you’ll want $KCODE=“E”
I tried integrating that, but the 150 char limit meant that most – if
not all – of the bodies couldn’t be translated. Having translated
titles would be a slight improvement, but not enough. Thanks for the
direction though!
You could split the strings into 150-char sections and loop over
them sending a new request for each. A Google translation might
not have such limits (quality of translation notwithstanding).
Babelfish will also let you plug in a site and just translate that. So why
not just plug in the site and then scrape the actual page with just
translated text?
FWIW, it appears that Google also supports pulling a URL and doing
Japanese->English translation (marked as Beta):
Babelfish will also let you plug in a site and just translate that. So why
not just plug in the site and then scrape the actual page with just
translated text?
Ah. Turns out this this is indeed fairly trivial with WWW::Mechanize.
Quite straightforward to pull the translation and generate a feed.
Of course it is possible, but it currently depends on native
English-speaking site editors to a) know of the Japanese additions, and
b) have the time to go add the new item to the English version, assuming
c) the English-speaking editor can figure out the Japanese.
How about creating a pool of people who can translate new items as
they pop up on the Japanese site? I’m willing to volunteer; there
just needs some kind of system to notify the translators that there’s
something to be dealt with (which could be as simple as a mailing list
to which each new item on the Japanese site is posted).
Wow, my innocent post generated 17 responses:-) It seems that the
issue of human language interoperability is quite important for Ruby
users. Unfortunately, most of discussion was along the lines of most
efficient English to Japanese translation.
I would rather touch a slightly different topic.
Ruby is already a truly international phenomenon with tens of thousand
users and scores of developers in all countries and continents
speaking dozens of various languages. English is the language that
majority of Ruby users and developers can understand even though
English might not be their mother tongue (like in may case). It is a
matter of fact that English (whether we like it or not) is a defacto
standard in ALL software projects with diverse user base.
On this basis I would suggest to make Ruby English home the MAIN
Ruby portal with Japanese and other languages translating from it. As
of now it is otherwise around – International (aka English) web site
is clearly a second class citizen. I have nothing against Japanese
language or Japanese culture. My rational for such suggestion is
strictly pragmatic.
–Leo–
P.S. It is not my intention to generate any flame wars. I apologize if
I inadvertently hurt somebody’s feelings.
I tried integrating that, but the 150 char limit meant that most – if
not all – of the bodies couldn’t be translated. Having translated
titles would be a slight improvement, but not enough. Thanks for the
direction though!
You could split the strings into 150-char sections and loop over
them sending a new request for each. A Google translation might
not have such limits (quality of translation notwithstanding).
Well, truth be told, I couldn’t even get the snippet working for me,
regardless of the 150ish-char limit[1]. The oddity is this: I can echo
out the URI I’m going to request, past that URI into my browser and
get a page with a decent translation. However, if I take that same URI
and run it through wget (or Ruby’s open-uri) I get a different result.
And the difference isn’t just in encoding, there are differences in
the returned HTML!
If I fetch that same URI via wget, the HTML returned has “Webrick DoS
^@ <86> ^@ Ã?± ^@ SECT. ^@ ^@ ^@ ^@ ^@ <84> ^@ ^@” instead (looking at
it in a non-unicode vim over putty). Even stranger, the page in my
browser has a hidden input with name=“q” and value=“About Webrick DoS
vulnerability”. The HTML returned by wget has a hidden input in the
same position (surrounding HTML identical) but with name=“kls” and
value=“0”.
Any one have any ideas what’s going on?
Jacob F.
[1] The limit imposed by bablefish is actually 150 words. However,
as a GET request, URI raises an exception about the URL being too long
far before 150 words are actually reached.