python - 从列表中解析 Ijson

Question

我有一个列表，其中每个项目都包含 JSON 数据，因此我尝试使用解析数据，Ijson因为数据负载会很大。

列表的图像

这就是我想要实现的目标：

article_data=#variable which contains the list
parser = ijson.parse(article_data)
for id in ijson.items(parser, 'item'):
    if(id['article_type'] != "Monthly Briefing" and id['article_type']!="Conference"):
        data_article_id.append(id['article_id'])
        data_article_short_desc.append(id['short_desc'])
        data_article_long_desc.append(id['long_desc'])

这是我得到的错误：

AttributeError：“生成器”对象没有“读取”属性

我想将转换list为string然后尝试解析Ijson，但它失败并给了我同样的错误。

请问有什么建议吗？

data_article_id=[] 
data_article_short_desc=[] 
data_article_long_desc=[] 

for index in article_data: 
    parser = ijson.parse(index)
    for id in ijson.items(parser, 'item'):
        if(id['article_type'] != "Monthly Briefing" and id['article_type']!="Conference"):
            data_article_id.append(id['article_id'])
            data_article_short_desc.append(id['short_desc'])
            data_article_long_desc.append(id['long_desc'])

因为它在列表中，我也尝试了这个..但它给了我同样的错误。

“生成器”对象没有“读取”属性

score 0 · Accepted Answer

我假设您有一个要解析的字节字符串 json 对象列表。

ijson.items(JSON, prefix) 将可读的字节对象作为输入。也就是说，它将打开的文件或类似文件的对象作为输入。具体来说，输入应该是字节类文件对象。

如果您使用的是 Python 3，则可以使用io带有 io.BytesIO的模块来创建内存中的二进制流。

例子

假设输入是 [b'{"id": "ab"}', b'{"id": "cd"}']

list_json = [b'{"id": "ab"}', b'{"id": "cd"}']
for json in list_json:
    item = ijson.items(io.BytesIO(json), "")
    for i in item:
        print(i['id'])
Output: 
    ab
    cd

python - 从列表中解析 Ijson

1 回答 1

例子

Related

Reference