我正在尝试使用 PROC ARBOR 为连续变量定义 bin。生成的树效果很好,我可以通过视觉探索找到 bin 限制,但我想提取这些 bin 并使用它们以自动方式离散化原始变量。这可能吗?
我的代码是:
%macro INTINPUTS;
l_G_MERGE6_t1_monto6
%mend INTINPUTS;
proc arbor data=labo2.J_TABLA_MODELO_LOGS
Leafsize=5 Mincatsize = 5 Maxbranch=2 Maxdepth=6 alpha = 0.2
Padjust= CHAIDBEFORE DEPTH MAXRULES=1 MAXSURRS=0 Missing=USEINSEARCH Exhaustive=5000 ;
input %INTINPUTS
/ level = interval;
target A_C_0804_flag_compro / level=INTERVAL
Criterion=PROBF;;
Performance DISK
NodeSize=20000;
Assess NoValidata measure=ASE;
SUBTREE BEST ;
MAKEMACRO NLEAVES=nleaves;
save
NODESTAT=work.Tree_OUTNODES
SUMMARY=work.Tree_OUTSUMMARY
code file="C:\labo2\EMPUBLISHSCORE.sas"
group=Tree;
code file="C:\labo2\EMFLOWSCORE.sas"
group=Tree
residual;
run;
quit;
谢谢!