1

我正在处理这个日期集:

https://www.kaggle.com/ronitf/heart-disease-uci?select=heart.csv

我正在查看的结果,pandas profiling它表明该age列具有HIGH CORRELATIONthalach列。

我检查了这些字段之间的 3 种相关性:

print(f"pearson = ",df['age'].corr(df['thalach'], method='pearson'))
print(f"spearman = ",df['age'].corr(df['thalach'], method='spearman'))
print(f"kendall = ",df['age'].corr(df['thalach'], method='kendall'))

我得到:

pearson =  -0.39852193812106734
spearman =  -0.3980524371044455
kendall =  -0.28000884141748783

3 种相关性显示出较低的相关性。

我错过了什么?有没有办法熊猫分析是错误的?

4

0 回答 0