apie
July 19, 2007, 8:33am
1
I’m a total hpricot newbie, so this is only out of curiousity…
Is the following expected behaviour?
I found this while struggling through a nasty xml parsing bug.
hstr = Hpricot(‘c ’).to_s #or to_html
returns => ‘c ’
I have found that using to_original_html returns ‘c ’ as
expected
but I was wondering why you would want to wrap the other contents
when
it obviously isn’t correct? Is this handy behaviour for scrapping?
regards
apie
July 19, 2007, 2:30pm
2
On Jul 19, 2007, at 1:32 AM, Ivor P. wrote:
expected
but I was wondering why you would want to wrap the other
contents when
it obviously isn’t correct? Is this handy behaviour for scrapping?
regards
Hmm… if you want modern markup, it is correct.
Link’s text here
You’d do well to avoid in favor of
apie
July 19, 2007, 3:34pm
3
On 7/19/07, Ivor P. [email protected] wrote:
but I was wondering why you would want to wrap the other contents when
it obviously isn’t correct? Is this handy behaviour for scrapping?
Yeah, I never understand that either. But it’s easily fixed:
h = Hpricot(‘ c ’, :xml => true)
h.to_s => “c ”
I think it’s because when you don’t define :xml as true, then Hpricot
acts as an HTML processor, and therefore tries to correct bad HTML…
and somehow ends up with the badness of wrapping the empty element.
Cheers,
Peter C.
apie
July 19, 2007, 8:28pm
4
Thanks peter - that is what I wanted to hear! I knew there was some
reason
like that.