4

我正在做的是

  1. 谷歌一些linkedin特定链接
  2. 登录linkedin.com(成功)
  3. 重新访问主页(此处失败)
  4. 从我在第一步中搜索的链接中提取一些所需的信息

我的 scrapy 机器人在第 3 步失败了。所以我的问题是,有没有办法绕过这个内容安全策略?如果是,请详细分享您的想法,谢谢。

谷歌指出我指出的相关问题之一是 CSP 标头Eval Issue,因为它没有提供有关代理设置的任何详细信息,所以没有帮助。

2017-12-29 10:59:47 [scrapy_splash.middleware] 警告:对 Splash 的错误请求:{u'info': {u'source': u'[string "..."]', u'message' : u'Lua error: [string "..."]:32: JavaScript error: EvalError: Refused to evaluate a string as JavaScript because \'unsafe-eval\' is not a allowed source of script in the following Content Security Policy指令:“script-src \'report-sample\' \'sha256-[redacted...]=\' \'unsafe-inline\' static.licdn.com sclnkd.licdn.com static-fstl.licdn.com static-src.linkedin.com https://www.linkedin.com/voyager/service-worker-push.js https://platform.linkedin.com/js/analytics.js static-exp1.licdn.com static- exp2.licdn.com scexp1.licdn.com scexp2.licdn.com 静态-lcdn.licdn。com sclcdn.licdn.com https://www.linkedin.com/sc/ https://www.linkedin.com/scds/ https://qprod.www.linkedin.com/sc/ https://www.linkedin.com/sw.js https://www.linkedin.com/voyager /abp-detection.js ".\n', u'type': u'LUA_ERROR', u'line_number': 32, u'error': u'JavaScript error: EvalError: Refused to evaluate a string as JavaScript because \ 'unsafe-eval\' 不是以下内容安全策略指令中允许的脚本来源:“script-src \'report-sample\' \'sha256-[redacted...]=\' \'unsafe-inline \' static.licdn.com sclnkd.licdn.com static-fstl.licdn.com static-src.linkedin.com https://www.linkedin.com/voyager/service-worker-push.js https://platform .linkedin.com/js/analytics.jsstatic-exp1.licdn.com static-exp2.licdn.com scexp1.licdn.com scexp2.licdn.com static-lcdn.licdn.com sclcdn.licdn.com https://www.linkedin.com/sc/ https: //www.linkedin.com/scds/ https://qprod.www.linkedin.com/sc/ https://www.linkedin.com/sw.js https://www.linkedin.com/voyager/abp -detection.js ".'}, u'type': u'ScriptError', u'description': u'执行Lua脚本时发生错误', u'error': 400} 2017-12-29 10:59: 47 [scrapy.core.engine] DEBUG:通过 http://localhost:8050/execute > (referer: None) 2017-12-29 10:59抓取 (400) https://www.linkedin.com/feed/ :47 [scrapy.spidermiddlewares.httperror] 信息:忽略响应 <400 https://www.linkedin.com/feed/>: HTTP 状态码未处理或不允许

4

0 回答 0