I’m having trouble understanding Hpricot (thanks to an abominable lack
of documentation). I’m trying to parse HTML of the following nature:
This is one line of text
This is another line of text
It keeps going on like this
Until a new paragraph is started
Otherwise, it’s just more of the same
I know, it looks simple but, frankly, I have no clue how to parse
with Hpricot. Particularly, I don’t know how to single out the lines of
text in between the “br” tags. This is important 'cause I need to know
where the line breaks are in the text, as well as the new paragraphs.
Does anyone know how to do this with Hpricot?