python - Python：从列表中获取 URL 并访问它

Question

我正在尝试从列表（~1500 个条目）中获取一个 URL，并使用 python 的斜纹库一个一个地访问它们。我使用斜纹布的原因是因为我喜欢它，而且我以后可能需要执行基本的表单填写。

我遇到的问题是声明循环的内容。我确信这实际上很容易解决，但目前我还没有想到这个解决方案。

            from twill.commands import *
            CONTAINER = open('urls.txt') #opening file
            CONTAINER_CONTENTS = CONTAINER.readlines() #reading
            CONTAINER_CONTENTS = map(lambda s: s.strip, CONTAINER_CONTENTS) #this is just to remove the ^N (newline) that was appended to each URL

            for i in CONTAINER_CONTENTS:
                  <educate me>
                  ..
                  go(url)
                  etc.

提前致谢。

score 0 · Accepted Answer

from twill.commands import *

with open('urls.txt') as inf:
    urls = (line.strip() for line in inf)
    for url in urls:
        go(url)
        # now do something with the page

python - Python：从列表中获取 URL 并访问它

1 回答 1

Related

Reference