0

我正在尝试编写一个 python 脚本来填写网站上的表格,发送它,发送后我想在结果网页上搜索关键字。

更具体地说,表格是: https ://booking.elal.co.il/newBooking/changeOrder.jsp?LANG=EN&RESSYSTEMID=1

当我在网上手动填写表格时,按下“继续”按钮后,我会得到一种“处理页面”,然后我会得到我想在其上搜索关键字的网页。

我尝试在这里使用脚本:http: //stockrt.github.io/p/handling-html-forms-with-python-mechanize-and-BeautifulSoup/,但由于某种原因,在我提交表单后:print br.response().geturl()我获取“处理页面”的网址,而不是我要搜索的网页的网址。

我的代码:

import mechanize
import cookielib
from BeautifulSoup import BeautifulSoup
import html2text

# Browser
br = mechanize.Browser()

# Cookie Jar
cj = cookielib.LWPCookieJar()
br.set_cookiejar(cj)

# Browser options
br.set_handle_equiv(True)
br.set_handle_gzip(True)
br.set_handle_redirect(True)
br.set_handle_referer(True)
br.set_handle_robots(False)

# Follows refresh 0 but not hangs on refresh > 0
br.set_handle_refresh(mechanize._http.HTTPRefreshProcessor(), max_time=1)

# User-Agent (this is cheating, ok?)
br.addheaders = [('User-agent', 'Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.0.1) Gecko/2008071615 Fedora/3.0.1-1.fc9 Firefox/3.0.1')]

# The site we will navigate into, handling it's session
br.open('https://booking.elal.co.il/newBooking/changeOrder.jsp?LANG=EN&RESSYSTEMID=1')

# Select the first (index zero) form
br.select_form(nr=0)

# User credentials
br.form['REC_LOC'] = '...'
br.form['DIRECT_RETRIEVE_LASTNAME'] = '...'

# Login
br.submit()

#Trying to print the webpage
html = br.response().read()
print html2text.html2text(html)

是否可以做我想做的事,我该怎么做?

4

0 回答 0