python - 在 praw 中，我正在尝试打印评论正文，但是如果遇到空评论怎么办？

Question

我正在尝试打印 subreddit 顶部帖子中的所有评论，以便我的机器人可以分析它们。我在当天早些时候运行它，但我现在尝试运行它，但遇到了一个错误。

这是我的代码：

r = praw.Reddit('Comment crawler v1.0 by /u/...')
r.login('username', 'password')
subreddit=r.get_subreddit('subreddit')
post_limit = 25
subreddit_posts = subreddit.get_hot(limit=post_limit)
subids = set()
for submission in subreddit_posts:
    subids.add(submission.id)
subid = list(subids)

i=0
while i < post_limit:
    submission = r.get_submission(submission_id=subid[i])
    flat_comments = praw.helpers.flatten_tree(submission.comments)
    with open('alreadydone.txt', 'r') as f:
        already_done = [line.strip() for line in f]
    f.close()
    for comment in flat_comments:
        if "Cricketbot, give me Australian news" in **comment.body** and comment.id not in already_done:
            info = feedparser.parse(Australia) #Australia gives a link to an RSS feed.

加星标的部分是我遇到问题的地方。我正在尝试查看写有“Cricketbot，给我澳大利亚新闻”的评论。不幸的是，如果注释的正文不存在，即注释为空，代码将返回一个属性错误并说注释没有属性“正文”。

如何解决这个问题？

score 15 · Accepted Answer

它通常有助于添加堆栈跟踪，以便人们可以看到实际的错误。但是，作为 PRAW 维护者，我知道错误类似于MoreComments type has no attribute body.

有三种简单的方法可以解决您的问题。第一种是简单地将if "Cricketbot"语句包装在 try/except 中并忽略属性错误。

try:
    if "Cricketbot..."
        ...
except AttributeError:
    pass

不过，这并不是非常令人兴奋。第二种方法是确保您实际使用的对象具有body可以通过两种方式完成的属性：

首先是显式检查属性是否存在：

for comment in flat_comments:
    if not hasattr(comment, 'body'):
        continue
    ...

第二个是验证您实际上是在使用Comment对象而不是MoreComments对象：

for comment in flat_comments:
    if not isinstance(comment, praw.objects.Comment):
        continue
    ...

但是，在运行上述任何解决方案时，您实际上并没有处理提交的所有评论，因为您缺少隐藏在MoreComments对象 [ ref ] 后面的任何内容。用一些注释替换MoreComments对象（全部替换可能非常低效）需要在展平树之前使用该函数：replace_more_comments

submission = r.get_submission(submission_id=subid[i])
submission.replace_more_comments(limit=16, threshold=10)
flat_comments = praw.helpers.flatten_tree(submission.comments)

设置limit=16和threshold=10意味着提出不超过 16 个额外的请求，并且只提出将导致至少 10 个额外评论的请求。您可以根据需要使用这些值，但请注意，每次替换都需要一个额外的请求（2 秒），有些只产生一条评论。

我希望这会有所帮助。

python - 在 praw 中，我正在尝试打印评论正文，但是如果遇到空评论怎么办？

1 回答 1

Related

Reference