python - CiteULike API '禁止'

Question

我正在尝试查询 CiteULike 中研究论文的书签数量。我正在使用“ http://www.citeulike.org/api/posts/for/doi/ ” URL 来请求（使用 Python 的 urllib2 库）对包含有关书签信息的 XML 文档给定的 DOI（论文的唯一标识符）。但是我不断收到 HTTP 403 错误：禁止访问。

有谁知道我为什么会收到这个错误？我尝试将带有 DOI 的 URL 放在浏览器中，并且返回 XML 就好了，所以问题似乎与我的自动请求有关。

谢谢，纳撒尼尔

score 3 · Accepted Answer

您应该阅读http://wiki.citeulike.org/index.php/Importing_and_Exporting#Scripting_CiteULike

如果您通过自动化流程访问 CiteULike，您必须提供一种通过 User-Agent 字符串识别您自己的方法。请使用“<username>/<email> <application>”，例如“fred/fred@wilma.com myscraper/1.0”。任何无法识别您身份的网站脚本都可能导致阻止。

python - CiteULike API '禁止'

1 回答 1

Related

Reference