python - 在 Python 2.7 上使用 xpath 提取 href 值

Question

我有这个 HTML：

<a href="some content">Click here</a>

如何在 Python 2.7 上提取some content和click me使用？xpath

到目前为止，我有以下内容（仅从 href 结果中提取“一些内容”）：

import lxml.etree as LE
import requests

r = requests.get("http://localhost")
html = r.text
root = LH.fromstring(html)
print root.xpath('//a/@href')

score 5 · Accepted Answer

您只能使用 XPath 选择一个或另一个，但您可以选择所有<a>元素，然后选择href属性和文本内容，如下所示：

for elt in root.xpath('//a'):
    print(elt.attrib['href'], elt.text_content())

1 回答 1