我有一个 .gz 类型的文件,里面有 JSON 对象,例如:
input:
{ "name":"John", "age":21, "gender":"male" }
{ "name":"Mike", "age":29, "gender":"male" }
{ "name":"Tim", "age":20, "gender":"male" }
{ "name":"Kim", "age":39, "gender":"female" }
注意:请注意,每个 JSON obj 的末尾没有逗号。
我使用以下内容将其保存到数据框:
import pandas as pd
data_location = 's3://myBucket/myFolder'
raw_json_data = pd.read_json(data_location, lines=True)
raw_json_data.head(2)
问题:我想将其转换为 CSV,可能是这样的:
expected output:
name, age, gender
John, 21, male
Mike, 29, male
Tim, 20, male
Kim, 39, female
我使用了这个,但没有提供预期的输出 - 我错过了什么吗?
df=pd.read_json(raw_json_data)
df.to_csv('results.csv')