Ghost.py 应该运行 JS:http: //jeanphix.me/Ghost.py/
require.js 通过 http 获取,但据我所知,它没有运行,因为“js/main.built”永远不会被获取,并且没有任何指定的 JS 文件被加载。这一切都可以在真正的浏览器中完美运行。
In [51]: ghost = Ghost(wait_timeout=60)
In [52]: page, resources = ghost.open(url)
In [53]: [r.url for r in resources]
Out[53]:
[PyQt4.QtCore.QUrl(u'https://example.com/#consume/283e6571bcecf34143cbd60f35e0464b'),
PyQt4.QtCore.QUrl(u'https://example.com/css/ui.css'),
PyQt4.QtCore.QUrl(u'https://example.com/css/colorpicker.css'),
PyQt4.QtCore.QUrl(u'https://example.com/css/selectize.default.css'),
PyQt4.QtCore.QUrl(u'https://example.com/css/datepicker.css'),
PyQt4.QtCore.QUrl(u'https://example.com/css/site.css'),
PyQt4.QtCore.QUrl(u'https://example.com/css/jquery.fileupload-ui.css'),
PyQt4.QtCore.QUrl(u'https://example.com/css/style.css'),
PyQt4.QtCore.QUrl(u'https://example.com/css/bootstrap.css'),
PyQt4.QtCore.QUrl(u'https://example.com/css/redactor.css'),
PyQt4.QtCore.QUrl(u'https://example.com/js/lib/require.js')]
In [54]: ghost.con
ghost.confirm ghost.content
In [54]: ghost.content
Out[54]: u'<!DOCTYPE html><html lang="en"><head>\n <meta charset="utf-8">\n <meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1">\n <title>Ving</title>\n\n <link rel="stylesheet" href="css/ui.css">\n <link rel="stylesheet" href="css/bootstrap.css">\n <link rel="stylesheet" href="css/colorpicker.css">\n <link rel="stylesheet" href="css/redactor.css">\n <link rel="stylesheet" href="css/site.css">\n <link rel="stylesheet" href="css/selectize.default.css">\n <!-- My Bug fixes / overrides to work with real app -->\n <link rel="stylesheet" href="css/style.css">\n <link rel="stylesheet" href="css/datepicker.css">\n <!-- Theme for uploader -->\n <link rel="stylesheet" href="css/jquery.fileupload-ui.css">\n <!-- Shims for IE support -->\n <!--[if lte IE 8]>\n <script src="js/lib/html5shiv.js"></script>\n <script src="js/lib/r2d3.min.js" charset="utf-8"></script>\n <![endif]-->\n </head>\n\n <body>\n <div id="wrapper">\n <header id="header" class="container"></header>\n <div id="content" class="container"></div>\n </div>\n <footer id="footer" class="container"></footer>\n <div id="modal" class="modal modal-box hide fade"></div>\n <script data-main="js/main.built" src="js/lib/require.js"></script>\n\n \n\n</body></html>'
In [55]:
我还尝试加载“ https://mail.google.com/ ”:
In [7]: url='https://mail.google.com/'
In [8]: ghost = Ghost(wait_timeout=60)
In [9]: page, resources = ghost.open(url)
In [10]: ghost.content
Out[10]: u'<html><head><meta http-equiv="Refresh" content="0;URL=https://mail.google.com/mail/"></head><body><script type="text/javascript" language="javascript"><!--\nlocation.replace("https://mail.google.com/mail/")\n--></script></body></html>'
In [11]: