r - 带有 r 的谷歌趋势阅读线

Question

我有一个简单的网址 http://www.google.com/trends/fetchComponent?q=amazon&date=today%203-m&cid=TIMESERIES_GRAPH_0&export=3

把它放进去readLines给我一个错误

readLines("http://www.google.com/trends/fetchComponent?q=amazon&date=today%203-m&cid=TIMESERIES_GRAPH_0&export=3")

Error in file(con, "r") : cannot open the connection
In addition: Warning message:
In file(con, "r") :
  cannot open: HTTP status was '203 Non-Authoritative Information'

我确定我必须使用 RCurl 之类的东西并登录谷歌（就像其他帖子中建议的那样）。但我的问题是，为什么我的方法不起作用？我在没有登录的情况下在几个浏览器中尝试了这个 url，但是将它与 R 一起使用是行不通的。谁能给我一些指示？

编辑

我认为问题是因为我的 IP 或其他东西被阻止了。我在几台计算机上尝试过，结果总是一样的：它在 R 中工作了 3 到 4 次，但没有，现在仍然没有。尽管如此，它在浏览器中仍然有效。

score 0 · Accepted Answer

如果您愿意尝试不同的方法来阅读网站，您可以使用 XML 库。例如，

library(XML)
url <-"http://www.google.com/trends/fetchComponent?q=amazon&date=today%203 m&cid=TIMESERIES_GRAPH_0&export=3"

htmlParse(url)

给了我想要的输出。

r - 带有 r 的谷歌趋势阅读线

1 回答 1

Related

Reference