nokogiri version 1.0.5 has been released!
- http://nokogiri.rubyforge.org/
- http://github.com/tenderlove/nokogiri/wikis
- http://github.com/tenderlove/nokogiri/tree/master
- http://rubyforge.org/mailman/listinfo/nokogiri-talk
- http://nokogiri.lighthouseapp.com/projects/19607-nokogiri/overview
Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser with CSS3 and
XPath search support.
Changes:
1.0.5
-
Bugfixes
- Added mailing list and ticket tracking information to the README.txt
- Sets ENV[‘PATH’] on windows if it doesn’t exist
- Caching results of NodeSet#[] on Document
== FEATURES:
- XPath support for document searching
- CSS3 selector support for document searching
- XML/HTML builder
- Drop in replacement for Hpricot (though not bug for bug)
Nokogiri parses and searches XML/HTML very quickly, and also has
correctly implemented CSS3 selector support as well as XPath support.
Here is a speed test:
Nokogiri also features an Hpricot compatibility layer to help ease the
change
to using correct CSS and XPath.
== SUPPORT:
The Nokogiri mailing list is available here:
The bug tracker is available here:
== SYNOPSIS:
require ‘nokogiri’
require ‘open-uri’
doc =
Nokogiri::HTML(open(‘http://www.google.com/search?q=tenderlove’))
Search for nodes by css
doc.css(‘h3.r a.l’).each do |link|
puts link.content
end
Search for nodes by xpath
doc.xpath(’//h3/a[@class=“l”]’).each do |link|
puts link.content
end
Or mix and match.
doc.search(‘h3.r a.l’, ‘//h3/a[@class=“l”]’).each do |link|
puts link.content
end