How do I use XPath in Nokogiri?
Your XPath is correct and you seem to have answered your own question's first part (almost):
doc.xpath('//table/tbody[@id="threadbits_forum_251"]/tr')
"the code above will get me any table table's tr, anywhere, that has a tbody child with the attribute id equal to threadbits_forum_251"
//
means the following element can appear anywhere in the document.
/tr
at the end means, get the tr
node of the matching element.
You dont need to extract each attribute one by one. Just get the entire node containing all four attributes in Nokogiri, and get the attributes using:
theNode['href']
theNode['src']
Where theNode
is your Nokogiri Node object.
Edit:
Sorry I haven't used these libraries, but I think the XPath evaluation and parsing is being done by Mechanize. So here's how you would get the entire element and its attributes in one go.
doc.xpath("td[3]/div[1]/a").each do |anchor|
puts anchor['href']
puts anchor['src']
...
end
Seems you need to read a XPath Tutorial
Your //table/tbody[@id="threadbits_forum_251"]/tr
expression means:
//
- Anywhere in your XML documenttable/tbody
- take a table element with a tbody child[@id="threadbits_forum_251"]
- where id attribute are equals to "threadbits_forum_251"tr
- and take itstr
elements
So, basically, you need to know:
- attributes begins with
@
- conditions go inside
[]
brackets
If I correcly understood that API, you can go with doc.xpath("td[3]/div[1]/a")["href"]
, or td[3]/div[1]/a/@href
if there is just one <a>
element.