1

我需要从所有命名空间的子集中选择特征组合(二次和三次)。我想避免单一的功能。例子:

1 1 '1-1-48112-c|m m_18 |o o_15 |s s_100390 |t t_20 |n n_5283 |d d_10023 |p p_2606 |w w_1 |f f_6.0 |g g_113119

我想选择-q m与命名空间'm'组合的所有功能(根据我使用选项的大众命令行)。重要的是,我想避免所有单一功能,如t_20, m_18,o_15等。

使用--ignoreparams 也会删除我不想要的组合。

在源代码中,似乎忽略名称空间在组合之前很早就被删除了。如果需要,我愿意更改 VW 的源代码。

4

1 回答 1

1

If you want to use the interactions between 2 namespaces, while simultaneously ignoring the 2 namespaces on their own, you are going to need to construct the interaction features on your own and pass them to the model.

It shouldn't be too hard: just contruct a new namespace with the concatenation of features you want, e.g: 1 1 '1-1-48112-c|mynewnamespace t_20_m_18_o_15

However, I'm curious why you want to do this? Including the base features plus their quadratic interactions will probably yield similar, if not better, predictive accuracy as taking out the base features.

于 2014-03-31T17:01:32.957 回答