我正在尝试使用 BeautifulSoup 从 sameip.org 中抓取域列表,我的代码如下:
import urllib, urllib2, cookielib, re, io, sys
from bs4 import BeautifulSoup
cj = cookielib.CookieJar()
opener = urllib2.build_opener(urllib2.HTTPCookieProcessor(cj))
resp = opener.open('http://sameip.org/ip/141.101.125.122').read()
soup = BeautifulSoup(resp)
for tr in soup.find_all('tr'):
tds = tr.find_all('td')
for x in tds:
print x
工作蝙蝠抓取更多数据,我只需要抓取域名,例如:
tcjayfund.org
fjminc.com
amandabillyrock.com
fjmclinics.com
我怎样才能做到这一点?