r - 使用循环在 R 中创建矩阵

Question

我试图在一个相对较小的数据集（n = 22，p = 17）上对由 LARS 算法生成的线性回归进行留一交叉验证。本质上，我需要创建 n 个标准化数据矩阵（每列由以平均值为中心并由列的 SD 标准化的条目组成）。

我以前从未使用过列表，但只要可以操作/标准化不同矩阵的列，我就愿意制作列表。

这是我在 R 中尝试过的：

for (i in 1:n)
{
  x.standardized.i <- matrix(data = NA, nrow = (n-1), ncol = p)  #creates n matrices, all n-1 x p
  for (j in 1:p)
  {
    x.standardized.i[,j] <- ((x[-i,j]-mean(x[-i,j]))/sd(x[-i,j])) #and standardizes the p variables with the ith row missing in each n matrix (i increments from 1 to n)
  }
}

我不确定我是否可以共享数据，因为它与班级的成绩有关，但是当我运行代码时，它会通过循环并通过分配一个标准化矩阵来停止，最后一行缺失为 x.standardized.i .

score 0 · Accepted Answer

sapply您可以使用and非常简单地做到这一点scale：

# Create dummy data
m <- matrix(runif(200), ncol=10)

# Leave each row out in turn, and scale each column
A <- sapply(seq_len(nrow(m)), function(i) scale(m[-i, ]), simplify='array')

默认情况下，scale每列以其平均值为中心，并除以其 sd。

对于上面的示例，您最终将得到一个包含 19 行、10 列和 20 个切片的数组。

要访问特定的切片（即交叉验证训练折叠），您可以像这样子集：

A[,, 1] # all rows, all cols, first slice
A[,, 10] # all rows, all cols, tenth slice

要确认列以它们的平均值为中心并按 1 sd 标准化：

apply(A, c(2, 3), mean)
apply(A, c(2, 3), sd)

r - 使用循环在 R 中创建矩阵

1 回答 1

Related

Reference