0

我从这里尝试这个代码脚本:如何将这个 XPath 表达式转换为 BeautifulSoup? 但我收到错误。有人可以帮助我,为什么我会收到错误消息:

spider = self.crawler.spiders.create(spname, **opts.spargs)
File "C:\Python27\lib\site-packages\scrapy-0.16.5-py2.7.egg\scrapy\spidermanag er.py", line 43, in create
  raise KeyError("Spider not found: %s" % spider_name) 
KeyError: 'Spider not found: app'

我安装了pyparsing

这是代码:

from pyparsing import makeHTMLTags, withAttribute, SkipTo 
import urllib

# get the HTML from your URL 
url = "http://www.whitecase.com/Attorneys/List.aspx?LastName=&FirstName="
page = urllib.urlopen(url) 
html = page.read() 
page.close()

# define opening and closing tag expressions for <td> and <a> tags
# (makeHTMLTags also comprehends tag variations, including attributes, 
# upper/lower case, etc.) 
tdStart,tdEnd = makeHTMLTags("td") 
aStart,aEnd = makeHTMLTags("a")

# only interested in tdStarts if they have "class=altRow" attribute 
tdStart.setParseAction(withAttribute(("class","altRow")))

# compose total matching pattern (add trailing tdStart to filter out 
# extraneous <td> matches) 
patt = tdStart + aStart("a") + SkipTo(aEnd)("text") + aEnd + tdEnd + tdStart

# scan input HTML source for matching refs, and print out the text and 
# href values 
for ref,s,e in patt.scanString(html):
    print ref.text, ref.a.href

提前致谢!弗洛里亚诺

4

0 回答 0