indexing - urllib2：索引几个关键字

Question

我已经四处寻找了一段时间，但似乎找不到有关使用 urllib2 中的 index 命令的指南。基本上，该函数找到第一个单词，它是/或包含给定索引函数的字符，然后返回它的位置。

page_content= urllib2.urlopen(URL).read() #Gets the website
i1 = page_content.index(keyword) #finds keyword in the website

从这里开始，i1 可以用于 urllib2 中的不同内容，但这与此无关。

我的问题是 .index 没有得到与关键字匹配的所有单词，而只有第一个。如果您知道网站上只有一种，那没问题，但我希望将它们全部分配到一个列表中，然后我可以从中获取并使用它们的位置。

score 0 · Accepted Answer

用户功能

import re 
page_content= urllib2.urlopen(URL).read() #Gets the website
idall = [m.start() for m in re.finditer(keyword,page_content)]

print idall

1 回答 1