问题标签 [regularized]

问问题

For questions regarding programming in ECMAScript (JavaScript/JS) and its various dialects/implementations (excluding ActionScript). Note JavaScript is NOT the same as Java! Please include all relevant tags on your question; e.g., [node.js], [jquery], [json], [reactjs], [angular], [ember.js], [vue.js], [typescript], [svelte], etc.

184 问题

0 投票

1 回答

564 浏览

caffe - caffe 是否将正则化参数乘以有偏？

我有很多关于正则化和偏见在 caffe 中的工作方式的问题。

首先，默认情况下网络中存在偏见，对吗？或者，我需要让 caffe 添加它们？

其次，在获取损失值时，不考虑正则化。这样对吗？我的意思是损失只包含损失函数值。据我了解，它只考虑梯度计算中的正则化。这样对吗？

第三，caffe在获取梯度时，是否也考虑了正则化中的biased值？还是只考虑网络在正则化中的权重？

提前致谢，

阿夫欣

caffe regularized

2016-08-25T13:38:06.060

0 投票

1 回答

4847 浏览

tensorflow - 如何在 TensorFlow-Slim 中使用正则化？

我想在我的代码中使用正则化。我使用 slim 来创建 conv2d，如下所示：

如何为此添加正则化？以及如何使用它来规范我的损失？

2016-09-01T06:55:36.113

0 投票

1 回答

46 浏览

r - 根据第二个数据集的日期和时间分配位置值

我有 2 个数据框：1 是具有相关日期时间的 GPS 位置序列（POSIXct）

另一个是具有相关日期时间（POSIXct）的深度序列。

对于每个深度位置，我想根据来自位置数据帧的内插轨迹何时建议它应该分配一个位置（纬度和经度），即，如果位置从 A 点转到 B 点，那么沿该线的哪个点做深度数据谎言，假设点之间的速度均匀，给定它的日期时间。

最终产品将是数据帧中的 2 个向量，它们为每个深度值分配一个纬度和一个经度。

谢谢你。

r dataframe gis interpolation regularized

2016-11-10T14:15:28.073

0 投票

1 回答

3316 浏览

keras - 在 Keras 中，weight_regularizer 和 activity_regularizer 有什么区别

我理解正则化通常会在损失中增加 k*w^2 以惩罚较大的权重。但是在 Keras 中有两个正则化器参数 - weight_regularizer 和 activity_regularizer。有什么不同？

keras regularized

2016-12-22T21:39:21.543

0 投票

1 回答

2392 浏览

deep-learning - caffe 中的 L2 正则化，从千层面转换

我有一个千层面代码。我想使用 caffe 创建相同的网络。我可以转换网络。但我需要有关千层面的超参数的帮助。千层面中的超参数如下所示：

我如何在 caffe 中执行 L2 正则化部分？我是否必须在每个卷积/内积层之后添加任何层进行正则化？我的solver.prototxt中的相关部分如下：

也张贴在http://datascience.stackexchange.com。等待答案。

deep-learning caffe lasagne regularized

2017-01-11T07:40:28.567

0 投票

1 回答

80 浏览

machine-learning - Regularization on Sample vs Full Dataset for Machine Learning

I have recently watched a video explaining that for Deep Learning, if you add more data, you don't need as much regularization, which sort of makes sense.

This being said, does this statement hold for "normal" Machine Learning algorithms like Random Forest for example ? And if so, when searching for the best hyper-parameters for the algorithm, in theory you should have as input dataset ( of course that gets further divided into cross validation sets etc ) as much data as you have, and not just a sample of it. This of course means a muuch longer training time, as for every combination of hyper-params you have X cross-validation sets which need to be trained and so on.

So basically, is it fair to assume that the params found for a decently size sample of your dataset are the "best" ones to use for the entire dataset or isn't it ?

machine-learning deep-learning random-forest regularized

2017-02-01T14:54:52.130

0 投票

2 回答

1168 浏览