I am not sure lookbehind is the proper means here. After all you want
to replace a sequence of dashes before a particular sequence
(according to your description above). So you would rather use
lookforward, wouldn’t you?
irb(main):008:0> s = sequence.dup
=> “-------abcde-gh----”
irb(main):009:0> while s.sub! /^(X*)-/, ‘\1X’; end
=> nil
irb(main):010:0> s
=> “XXXXXXXabcde-gh----”
But I’d rather use Pena’s solution or another block form
I believe the reason is that with lookbehind the regular expression
needs start matching at the same location (the beginning of the
sequence) multiple times. Even though lookbehind does not consume
characters I believe RX implementations prohibit matching at the same
location over and over again - partly for efficiency reasons but also to
avoid endless loops. The lookforward solution with replacement at the
end of the sequence moves the start position of the match one character
forward for every match. That’s how I explain myself why the straight
forward lookbehind does not work.
There’s another reason, why lookbehind won’t work: the docs state
“Subexp of look-behind must be fixed character length.”
At the moment I cannot think of a solution with lookbehind that would
avoid these issues because all lookbehinds must start matching at the
beginning of the sequence in order to fulfill your requirement that the
initial sequence must be replaced.
The problem with look ahead/forward is, that I have to make sure the
sequence starts with “-”. Otherwise something like this could happen:
Well, this won’t happen if the sequence “abcde” is known to appear only
after an initial sequence of dashes. (Note the difference between your
lookforward solution and mine: you created a character class while I
just matched the plain sequence). But of course it’s not the same as
“replace initial portion of dashes”.
I can recommend “Mastering regular expressions” if you want to dive
deeper into this: it’s pretty well written and does only have as much
theory of regular languages as necessary but explains very well
differences between regular expression implementations and also
considers efficiency aspects.
Thanks for the info. Just downloaded 1.9 to check it out. But I still
don’t comprehend why look behind is not working like I expected. If I
write the equivalent replacing characters at the end of the string using
look ahead it works just fine:
I am not sure lookbehind is the proper means here. After all you want
to replace a sequence of dashes before a particular sequence
(according to your description above). So you would rather use
lookforward, wouldn’t you?