4

rpart几天来我一直在努力(试图为我拥有的这个数据集制作分类树),我认为现在是时候询问生命线了 :-) 我确信这很愚蠢我没有看到,但这是我一直在做的事情:

EuropeWater <- read.csv(file=paste("/Users/artessaniccola/Documents/",
                       "Magic Briefcase/CityTypology/Europe_water.csv",sep=""))
library(rpart)
attach(EuropeWater)
names(EuropeWater)
[1] "City"          "waterpercapita_m3" "water_class"       "population"       
[5] "GDPpercapita"  "area_km2"          "populationdensity" "climate"            
EuropeWater$water_class <- factor(EuropeWater$water_class, levels=1:3, 
                                  labels=c("Low", "Medium", "High"))
EuropeWater$climate <- factor(EuropeWater$climate, levels=2:4, 
                              labels=c("Arid", "Warm temperate", "Snow"))
EuropeWater_tree <- rpart(EuropeWater$water_class ~ 
               population+GDPpercapita + area_km2 + populationdensity + 
               EuropeWater$climate, 
               data=EuropeWater, method=class)   
Error in as.character(x) : 
          cannot coerce type 'builtin' to vector of type 'character'

对于我的一生,我无法弄清楚错误是关于什么的。

4

2 回答 2

19

这行得通吗?

EuropeWater_tree <- rpart(EuropeWater$water_class ~ 
 population+GDPpercapita + area_km2 + populationdensity + EuropeWater$climate, 
 data=EuropeWater, method="class")

我认为你应该引用方法类型。

当您使用class而不是"class"then R 尝试转换为自己的字符时:

as.character(class)
Error in as.character(class) : 
    cannot coerce type 'builtin' to vector of type 'character'

Causeclass是一个类型为 的函数buildin

typeof(class)
[1] "builtin"
于 2010-02-25T20:01:03.247 回答
3

我将从修正公式开始:删除多余EuropeWater的,因为您已经提供了data=参数:

res <- rpart(water_class ~ population + GDPpercapita + area_km2 + 
                           populationdensity + climate, 
             data=EuropeWater, method="class")

此外,请确保您的所有列data.frame都是适当的类型。也许从 csv 文件中读取的某些数据被错误地读取为一个因素?快速summary(EuropeWater)可能会揭示这一点。

于 2010-02-25T19:46:15.993 回答