0

JS/Html 的初学者,我试图通过 Pyppeteer0.2.5(Python3.6.9/Chromium 87.0.4280.66)点击链接后使用以下代码获取页面内容:

import asyncio, pyppeteer, time
from pyppeteer import launch

async def main():
    browser = await launch(executablePath='/usr/bin/chromium-browser', headless=False)
    page = await browser.newPage()
    await page.goto('https://en.wikipedia.org')
    time.sleep(10)
    cont = await page.content()
    print(cont)
asyncio.get_event_loop().run_until_complete(main())
  1. 如果我在 期间不单击或不执行任何操作time.sleep(10),则 10 秒后将按预期打印维基百科主页面内容。
  2. 如果我(手动)在 期间单击任何链接time.sleep(10),新页面将按预期打开,但 10 秒后我收到 Python 错误(见下文)。我在不同的网页上以及通过 Pyppeteer 而不是手动点击时也发生了同样的情况。page.content()单击链接后失败是否有原因?
    Traceback (most recent call last):
      File "/home/martin/.local/lib/python3.6/site-packages/pyppeteer/execution_context.py", line 105, in evaluateHandle
        'userGesture': True,
    pyppeteer.errors.NetworkError: Protocol error (Runtime.callFunctionOn): Cannot find context with specified id
    
    During handling of the above exception, another exception occurred:
    
    Traceback (most recent call last):
      File "To_delete/pythonProject/testtt.py", line 11, in <module>
        asyncio.get_event_loop().run_until_complete(main())
      File "/usr/lib/python3.6/asyncio/base_events.py", line 484, in run_until_complete
        return future.result()
      File "To_delete/pythonProject/testtt.py", line 9, in main
        cont = await page.content()
      File "/home/martin/.local/lib/python3.6/site-packages/pyppeteer/page.py", line 803, in content
        return await frame.content()
      File "/home/martin/.local/lib/python3.6/site-packages/pyppeteer/frame_manager.py", line 393, in content
        '''.strip())
      File "/home/martin/.local/lib/python3.6/site-packages/pyppeteer/frame_manager.py", line 309, in evaluate
        pageFunction, *args, force_expr=force_expr)
      File "/home/martin/.local/lib/python3.6/site-packages/pyppeteer/execution_context.py", line 54, in evaluate
        pageFunction, *args, force_expr=force_expr)
      File "/home/martin/.local/lib/python3.6/site-packages/pyppeteer/execution_context.py", line 108, in evaluateHandle
        _rewriteError(e)
      File "/home/martin/.local/lib/python3.6/site-packages/pyppeteer/execution_context.py", line 237, in _rewriteError
        raise type(error)(msg)
    pyppeteer.errors.NetworkError: Execution context was destroyed, most likely because of a navigation.
4

1 回答 1

0

事实证明,我需要在await page.waitForNavigation()上面添加await page.content()才能使其工作。

于 2021-02-11T17:21:45.147 回答