2

我正在使用 reticulate 将 python 保存的 .pkl 转换为 R 数据帧。我看到分类 python 类型保存在 R 数据框中。我猜 R 不知道如何处理 python 分类变量。如何将这些转换为 R 中的因子变量?

#Looking at the class of some of the variables in the R dataframe, I get

class(df$variable)

$variable
[1] "pandas.core.arrays.categorical.Categorical" 
"pandas.core.arrays.base.ExtensionArray"    
[3] "pandas.core.base.PandasObject"              
"pandas.core.base.StringMixin"              
[5] "pandas.core.accessor.DirNamesMixin"         "python.builtin.object"      

#I get the right levels, but this doesn't seem to be R-friendly 

> table(levels(factor(as.character(df$varibale))))

[level 1, level 2, level 3, level 4, level 5, ..., NaN, NaN, NaN, NaN, 
NaN]\nLength: 464121\nCategories (6, object): [level 1 < level 2 < level 3 
< level 4 < level 5 < level 6] 
4

0 回答 0