Forum: Ruby nokogiri 1.0.5 Released

Announcement (2017-05-07): www.ruby-forum.com is now read-only since I unfortunately do not have the time to support and maintain the forum any more. Please see rubyonrails.org/community and ruby-lang.org/en/community for other Rails- und Ruby-related community platforms.
Be30361bb0b0c495e3077db43ad84b56?d=identicon&s=25 Aaron Patterson (Guest)
on 2008-11-13 07:43
(Received via mailing list)
nokogiri version 1.0.5 has been released!

* <http://nokogiri.rubyforge.org/>
* <http://github.com/tenderlove/nokogiri/wikis>
* <http://github.com/tenderlove/nokogiri/tree/master>
* <http://rubyforge.org/mailman/listinfo/nokogiri-talk>
* <http://nokogiri.lighthouseapp.com/projects/19607-n...

Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser with CSS3 and
XPath search support.

Changes:

### 1.0.5

* Bugfixes

  * Added mailing list and ticket tracking information to the README.txt
  * Sets ENV['PATH'] on windows if it doesn't exist
  * Caching results of NodeSet#[] on Document

== FEATURES:

* XPath support for document searching
* CSS3 selector support for document searching
* XML/HTML builder
* Drop in replacement for Hpricot (though not bug for bug)

Nokogiri parses and searches XML/HTML very quickly, and also has
correctly implemented CSS3 selector support as well as XPath support.

Here is a speed test:

  * http://gist.github.com/22176

Nokogiri also features an Hpricot compatibility layer to help ease the
change
to using correct CSS and XPath.

== SUPPORT:

The Nokogiri mailing list is available here:

  * http://rubyforge.org/mailman/listinfo/nokogiri-talk

The bug tracker is available here:

  * http://nokogiri.lighthouseapp.com/projects/19607-n...

== SYNOPSIS:

  require 'nokogiri'
  require 'open-uri'

  doc =
Nokogiri::HTML(open('http://www.google.com/search?q=tenderlove'))

  ####
  # Search for nodes by css
  doc.css('h3.r a.l').each do |link|
    puts link.content
  end

  ####
  # Search for nodes by xpath
  doc.xpath('//h3/a[@class="l"]').each do |link|
    puts link.content
  end

  ####
  # Or mix and match.
  doc.search('h3.r a.l', '//h3/a[@class="l"]').each do |link|
    puts link.content
  end

* <http://nokogiri.rubyforge.org/>
* <http://github.com/tenderlove/nokogiri/wikis>
* <http://github.com/tenderlove/nokogiri/tree/master>
* <http://rubyforge.org/mailman/listinfo/nokogiri-talk>
* <http://nokogiri.lighthouseapp.com/projects/19607-n...
83c8cca11558f32207645ca270ae8475?d=identicon&s=25 Dingding Ye (Guest)
on 2008-11-13 09:48
(Received via mailing list)
hi.

Is nested search supported?

For example,

xml =
"<root><body><subbody>subbody1</subbody></body><body><subbody>subbody2</subbody></body></root>"
doc.xpath("//body").each do |body|
  puts body.at("subbody").content
end

I expected to see

subbody1
subbody2

but seems it always use the root element.

On Thu, Nov 13, 2008 at 2:40 PM, Aaron Patterson
<aaron@tenderlovemaking.com
Be30361bb0b0c495e3077db43ad84b56?d=identicon&s=25 Aaron Patterson (Guest)
on 2008-11-13 16:35
(Received via mailing list)
On Thu, Nov 13, 2008 at 05:44:50PM +0900, Dingding Ye wrote:
> end
>
> I expected to see
>
> subbody1
> subbody2
>
> but seems it always use the root element.

Yes.  Try using an xpath search as such:

  xml =
  "<root><body><subbody>subbody1</subbody></body><body><subbody>subbody2</subbody></body></root>"

  Nokogiri::XML(xml).xpath("//body").each do |body|
    puts body.at(".//subbody").content
  end
134ea397777886d6f0aa992672a50eaa?d=identicon&s=25 Mark Thomas (Guest)
on 2008-11-13 17:05
(Received via mailing list)
On Nov 13, 3:44 am, Dingding Ye <yedingd...@gmail.com> wrote:
> doc.xpath("//body").each do |body|
>   puts body.at("subbody").content
> end
>
> I expected to see
>
> subbody1
> subbody2
>
> but seems it always use the root element.

Let me try to clear up the confusion.

at() is from Hpricot, and it searches from the root node. It is
typically used as doc.at('element')

search() is also from Hpricot, and it searches from the current node.
This is perhaps the behavior you were expecting. An alias is '/', as
in doc/"html/body"

xpath() is like find() from libxml and you can nest those calls too.
Note that the Hpricot xpath(), which returns the path to the node, is
available in Nokogiri as path().

-- Mark.
6c202f0d87225ee60cf8f455431c17a7?d=identicon&s=25 Marcin Raczkowski (Guest)
on 2008-11-13 22:32
(Received via mailing list)
Great work!
This topic is locked and can not be replied to.