python - 如何将输出转换为列表以计算其数量？

Question

我编写了一个脚本来解析网页并获取其上的链接数量（“a”标签）：

import urllib
import lxml.html
connection = urllib.urlopen('http://test.com')
dom =  lxml.html.fromstring(connection.read())
for link in dom.xpath('//a/@href'):
    print link

脚本的输出：

./01.html
./52.html
./801.html
http://www.blablabla.com/1.html
#top

如何将其转换为列表以计算链接数量？我使用 link.split() ，但它对我有用：

['./01.html']
['./52.html']
['./801.html']
['http://www.blablabla.com/1.html']
['#top']

但我想得到：

[./01.html, ./52.html, ./801.html, http://www.blablabla.com/1.html, #top]

谢谢！

score 7 · Accepted Answer

link.split()尝试拆分链接本身。但是您必须使用代表所有链接的实体。在你的情况下：dom.xpath('//a/@href')。

所以这必须帮助你：

links = list(dom.xpath('//a/@href'))

并使用内置len函数获取长度：

print len(links)

score 3 · Accepted Answer

list(dom.xpath('//a/@href'))

这将采用dom.xpath返回并将每个项目放入列表的迭代器。

python - 如何将输出转换为列表以计算其数量？

2 回答 2

Related

Reference