Facebook 的 URL 抓取工具是否有大小限制?我们在网站上有几本书。那些 HMTL 文件大小低于特定大小 (~390KB) 的文件会被抓取并正确读取,但较大的 4 个文件则不会。这些较大的项目会收到 200 响应代码,并且会打开规范 URL。
所有这些页面都是使用相同的模板构建的,唯一的区别是每本书内容的大小以及每本书与网站上其他页面的链接数量。
- 点击规范网址
- 在 Firefox 或 Chrome 中的开发人员工具中打开 Firebug 到网络选项卡 3,对于列出的失败,*.html 大小为 >~390KB,成功为 <~390K
- 点击“查看我们的抓取工具为您的 URL 看到的确切内容”
- 失败为空白页,成功为 HTML
失败:
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftapom.html
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftbgpu.html
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fttjc.html
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftbdse.html
成功:
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fthogtc.html
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Faabibp.html
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftww.html
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftsosw.html
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fsyottc.html
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fttigtio.html
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Faadac.html
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fsiud.html
- https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftuyc.html