我正在使用该twitteR
软件包R
来收集一些推文。但是,我注意到该searchTwitter
函数返回的推文文本不是完整的推文文本,而是被删减到正好等于 140 个字符,其余文本被网络上推文的链接替换。
以我找到的一条推文为例:
require(twitteR)
require(ROAuth)
# authorize twitter with consmuer and access key/secret
setup_twitter_oauth(AAA, BBB, CCC, DDD) # actual secret codes go here...
# get sample tweet
tweet <- searchTwitter("When I was driving around earlier this afternoon I only saw two Hunters",
n=500,
since = "2017-11-04",
until = "2017-11-05",
retryOnRateLimit=5000)
# print tweet
tweet[[1]]
[1] "_TooCrazyFox_: When I was driving around earlier this afternoon I only saw two Hunters but it was during the midday break. I didn'… *SHORTENEDURL*"
# the *SHORTENEDURL* is actually a link that brings you to the tweet; stackoverflow didn't want me to a put shortened urls in here
# convert to data frame
df <- twListToDF(tweet)
# output text and ID
df$text
[1] "When I was driving around earlier this afternoon I only saw two Hunters but it was during the midday break. I didn'… *SHORTENEDURL*"
df$id
[1] "926943636641763328"
如果我通过我的网络浏览器访问这条推文,很明显twitteR
将文本缩短为 140 个字符,并包含指向包含整个文本的推文的链接。
twitteR
我在文档中没有看到任何提及这一点。有没有办法在搜索过程中保留整个推文文本?
我的假设是,这与此处引用的 Twitter 字符长度的变化有关:https ://developer.twitter.com/en/docs/tweets/tweet-updates (在“兼容模式 JSON 渲染”中)。这意味着我需要检索该full_text
字段,而不是该text
字段。但是,这似乎不是由twitteR
.