我有一个字典,我想在使用 pandas-profiling 进行分析后提取数据。我正在尝试获取 gfcid 的数据?我试图看看keys() 是什么,它返回4 个键。
dict_keys(['table', 'variables', 'freq', 'correlations'])
'variables': count distinct_count p_missing n_missing p_infinite n_infinite \
prf_product 200 2 0 0 0 0
gfcid 61 2 0.695 139 0 0
arrg_id 200 182 0 0 0 0
例如,我想获取其值为 139 的 gfcid p_missing,我无法计算的问题是我得到了许多 gfcid 列,我应该采取哪些最佳方法来获取这些数据。
{'table': {'n': 200,
'nvar': 3,
'total_missing': 0.23166666666666666,
'n_duplicates': 18,
'memsize': '5.0 KiB',
'recordsize': '25.6 B',
'NUM': 1,
'DATE': 0,
'CONST': 0,
'CAT': 1,
'UNIQUE': 0,
'CORR': 0,
'RECODED': 0,
'BOOL': 1,
'UNSUPPORTED': 0,
'REJECTED': 0},
'variables': count distinct_count p_missing n_missing p_infinite n_infinite \
prf_product 200 2 0 0 0 0
gfcid 61 2 0.695 139 0 0
arrg_id 200 182 0 0 0 0
is_unique mode p_unique memorysize ... \
prf_product False Overdraft 0.01 1728 ...
gfcid False 1022506923 0.01 1928 ...
arrg_id False 458040000000328871 0.91 1728 ...
iqr kurtosis skewness sum \
prf_product NaN NaN NaN NaN
gfcid NaN NaN NaN NaN
arrg_id 1.97939e+18 -2.01933 0.000718585 -5533519320767099939
mad cv n_zeros p_zeros \
prf_product NaN NaN NaN NaN
gfcid NaN NaN NaN NaN
arrg_id 9.90032e+17 0.685487 0 0
histogram \
prf_product NaN
gfcid NaN
arrg_id data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAA...
mini_histogram
prf_product NaN
gfcid NaN
arrg_id data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAA...
[3 rows x 34 columns],
'freq': {'prf_product': Overdraft 100
Retails Cards 100
Name: prf_product, dtype: int64, 'gfcid': 1022506923 61
Name: gfcid, dtype: int64, 'arrg_id': 458040000001206947 2
458040000003802902 2
458040000003898582 2
458040000003488662 2
2409124554515908929 2
..
458040000000500916 1
2422373310444855111 1
458040000002710689 1
2459484652984972989 1
458040000000940444 1
Name: arrg_id, Length: 182, dtype: int64},
'correlations': {'pearson': gfcid arrg_id
gfcid NaN NaN
arrg_id NaN 1.0, 'spearman': gfcid arrg_id
gfcid NaN NaN
arrg_id NaN 1.0}}
我这样做是为了获取并意识到它在第一行返回
desc = profile.get_description()
result = []
for i in desc.values():
print(i.values)
这就是它返回的内容,有没有办法我们可以提取这些数据?我做了 ai[0] 并且它返回错误
<built-in method values of dict object at 0x7f2db8e31318>
[[200 2 0.0 0 0.0 0 False 'Overdraft' 0.01 1728 'Overdraft' 100 'CAT' nan
nan nan nan nan nan nan nan nan nan nan nan nan nan nan nan nan nan nan
nan nan]
[61 2 0.6950000000000001 139 0.0 0 False 1022506923 0.01 1928 1022506923
61 'BOOL' 1022506923.0 nan nan nan nan nan nan nan nan nan nan nan nan
nan nan nan nan nan nan nan nan]
[200 182 0.0 0 0.0 0 False 458040000000328871 0.91 1728 nan nan 'NUM'
1.4480719292929288e+18 9.926344464280993e+17 9.853231442356191e+35']]
<built-in method values of dict object at 0x7f2db8e31e10>
<built-in method values of dict object at 0x7f2db8e4e1f8>