Nokogiri 1.0.7 Released

nokogiri version 1.0.7 has been released!

Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser.

Changes:

1.0.7

  • Bugfixes

    • Fixed memory leak when using Dike
    • SAX parser now parses IO streams
    • Comment nodes have their own class
    • Nokogiri() should delegate to Nokogiri.parse()
    • Prepending rather than appending to ENV[‘PATH’] on windows
    • Fixed a bug in complex CSS negation selectors

FEATURES:

  • XPath support for document searching
  • CSS3 selector support for document searching
  • XML/HTML builder
  • Drop in replacement for Hpricot (though not bug for bug)

Nokogiri parses and searches XML/HTML very quickly, and also has
correctly implemented CSS3 selector support as well as XPath support.

Here is a speed test:

Nokogiri also features an Hpricot compatibility layer to help ease the
change
to using correct CSS and XPath.

SUPPORT:

The Nokogiri mailing list is available here:

The bug tracker is available here:

SYNOPSIS:

require ‘nokogiri’
require ‘open-uri’

doc =
Nokogiri::HTML(open(‘tenderlove - Google Search’))

Search for nodes by css

doc.css(‘h3.r a.l’).each do |link|
puts link.content
end

Search for nodes by xpath

doc.xpath(‘//h3/a[@class=“l”]’).each do |link|
puts link.content
end

Or mix and match.

doc.search(‘h3.r a.l’, ‘//h3/a[@class=“l”]’).each do |link|
puts link.content
end

On Wed, Dec 3, 2008 at 1:49 AM, Aaron P.
[email protected] wrote:

  • SAX parser now parses IO streams

Awesome.

I am so looking forward to using this.


Avdi

Home: http://avdi.org
Developer Blog: Avdi Grimm, Code Cleric
Twitter: http://twitter.com/avdi
Journal: http://avdi.livejournal.com