Re: Premature end of regular expression with non-ascii chara

unknown · January 31, 2006, 7:57pm

Dear Nick,

I’m glad things finally worked out for you.
Actually, I things like ‘\352’ do not to split any further (at least on
my
system (cygwin 6.7.0.0-6) on Windows XP),
this seems to be some encoding for “Ãª” etc. that is Windows-specific.
In particular, it is not equivalent to a string composed of '',
‘3’,‘5’,‘2’
(you would have to double the '' in the regexp if you were searching
for a string ‘\352’ in a text).

But the line

splitted_text=text.split(/(?=.)/)

should produce an Array with the individual letters in the string.
That involves a concept about regexps called zero-width positive
lookahead (see _http://www.regular-expressions.info/lookaround.html_
(Regex Tutorial - Lookahead and Lookbehind Zero-Length Assertions) ).

Please let us all know if you still encounter problems.

Best regards,

Axel

unknown · January 31, 2006, 9:07pm

On Jan 31, 2006, at 1:56 PM, [email protected] wrote:

(you would have to double the ‘’ in the regexp if you were searching

Please let us all know if you still encounter problems.

Best regards,

Axel

Zero width positive lookahead is a little overkill, split(//) works
just as well, AFAIK