ruby - 我将如何从锚点列表中搜索“/aems/dic/list”？

Question

我有以下代码，它是 html 的一部分：

<td><a href="http://youtube.com">YouTube</a></td>
<td><a data-category="news" href=http://kathack.com/party/aems/dic/list">Reddit</a></td>
<td><a href="http://kathack.com/party/aems">Kathack</a></td>
<td><a data-category="news" href="http://www.nytimes.com">New York Times</a></td>

现在我将如何搜索/aems/dic/list并获取完整的 url 存储？

score 1 · Accepted Answer

假设您有一个 Mechanize::Page 对象page：

page.at('a[href*="/aems/dic/list"]')[:href]
#=> "http://kathack.com/party/aems/dic/list"

更新

对于更长的示例：

require 'mechanize'
agent = Mechanize.new
page = agent.get 'http://www.example.com/'
page.at('a[href*="/aems/dic/list"]')[:href]
#=> "http://kathack.com/party/aems/dic/list"

score 1 · Accepted Answer

所以，有了nokogiri，像这样：

fragment = Nokogiri::HTML::DocumentFragment.parse text
fragment.css("a").each do |link|
  href = link['href']
  return href if href =~ /\/aems\/dic\/list/
end

ruby - 我将如何从锚点列表中搜索“/aems/dic/list”？

2 回答 2

Related

Reference