我正在处理这个日期集:
https://www.kaggle.com/ronitf/heart-disease-uci?select=heart.csv
我正在查看的结果,pandas profiling
它表明该age
列具有HIGH CORRELATION
与thalach
列。
我检查了这些字段之间的 3 种相关性:
print(f"pearson = ",df['age'].corr(df['thalach'], method='pearson'))
print(f"spearman = ",df['age'].corr(df['thalach'], method='spearman'))
print(f"kendall = ",df['age'].corr(df['thalach'], method='kendall'))
我得到:
pearson = -0.39852193812106734
spearman = -0.3980524371044455
kendall = -0.28000884141748783
3 种相关性显示出较低的相关性。
我错过了什么?有没有办法熊猫分析是错误的?