我正在尝试运行 ELKI 以在数据集上以 arff 文件的形式实现 k-medoids(对于 k=3)(使用 ELKI 中的 ARFFParser):
数据集有 7 个维度,但是我获得的聚类结果仅显示一维级别的聚类,并且仅针对 3 个属性执行此操作,而忽略了其余属性。像这样:
谁能帮助我如何获得所有维度的聚类可视化?
我正在尝试运行 ELKI 以在数据集上以 arff 文件的形式实现 k-medoids(对于 k=3)(使用 ELKI 中的 ARFFParser):
数据集有 7 个维度,但是我获得的聚类结果仅显示一维级别的聚类,并且仅针对 3 个属性执行此操作,而忽略了其余属性。像这样:
谁能帮助我如何获得所有维度的聚类可视化?
ELKI is mostly used with numerical data.
Currently, ELKI does not have a "mixed" data type, unfortunately.
The ARFF parser will split your data set into multiple relations:
age
sex
and region
salary
married
children
car
Apparently it has messed up the relation labels, though. But other than that, this approach works perfectly well with arff data sets that consist of numerical data + a class label, for example - the use case this parser was written for. It is a well-defined and consistent behaviour, though not what you expected it to do.
The algorithm then ran on the first relation it could work with, i.e. age
only.
So here is what you need to do:
Alternatively, you could write a script to encode your data in a numerical data set, then it will work fine. But in my opinion, the results of one-hot-encoding etc. are not very convincing usually.