5

在 Ruby 上的 Mechanize 中,我必须为我来到的每个新页面分配一个新变量。例如:

  page2 = page1.link_with(:text => "Continue").click
  page3 = page2.link_with(:text => "About").click
  ...etc

有没有办法在没有变量保存每个页面状态的情况下运行 Mechanize?像

  my_only_page.link_with(:text => "Continue").click!
  my_only_page.link_with(:text => "About").click!
4

1 回答 1

10

我不知道我是否正确理解了您的问题,但如果是动态循环浏览大量页面并处理它们,您可以这样做:

    require 'mechanize'

    url = "http://example.com"
    agent = Mechanize.new
    page = agent.get(url) #Get the starting page

    loop do
      # What you want to do on the page - ex. extract something...
      item = page.parser.css('.some_item').text
      item.save

      if link = page.link_with(:text => "Continue") # As long as there is still a nextpage link...
        page = link.click
      else # If no link left, then break out of loop
        break
      end
    end
于 2011-07-19T20:27:22.537 回答