0

的输出dput通常比用户为创建相同对象而键入的内容要复杂得多。我知道这可能是保证 100% 可重复性所必需的(包括例如当不同的用户使用不同的默认设置时)。但是,它并没有使示例尽可能具有可读性,而且我经常花一些时间来简化输出。

例如,考虑:

dput(data.frame(a=1:10))
> structure(list(a = 1:10), .Names = "a", row.names = c(NA, -10L), class = "data.frame")

难道没有替代方案dput可以简单地返回data.frame(a=1:10)吗?

4

1 回答 1

3

这是 data.frames 的部分实现(没有 row.names 或其他属性):

dput2 <- function(x, ...) UseMethod("dput2")
dput2.data.frame <- function(x, ...) {
    fun <- function(nm) paste(nm, "=", 
            paste(capture.output(dput(x[[nm]], file = stdout())), collapse = ""))
    L <- lapply(names(x), fun)
    cat(paste("data.frame(", paste(unlist(L), collapse = ",\n"), ")"), "\n")
}

例如,

> dput2(BOD)
data.frame( Time = c(1, 2, 3, 4, 5, 7),
demand = c(8.3, 10.3, 19, 16, 15.6, 19.8) ) 

> dput2(anscombe)
data.frame( x1 = c(10, 8, 13, 9, 11, 14, 6, 4, 12, 7, 5),
x2 = c(10, 8, 13, 9, 11, 14, 6, 4, 12, 7, 5),
x3 = c(10, 8, 13, 9, 11, 14, 6, 4, 12, 7, 5),
x4 = c(8, 8, 8, 8, 8, 8, 8, 19, 8, 8, 8),
y1 = c(8.04, 6.95, 7.58, 8.81, 8.33, 9.96, 7.24, 4.26, 10.84, 4.82, 5.68),
y2 = c(9.14, 8.14, 8.74, 8.77, 9.26, 8.1, 6.13, 3.1, 9.13, 7.26, 4.74),
y3 = c(7.46, 6.77, 12.74, 7.11, 7.81, 8.84, 6.08, 5.39, 8.15, 6.42, 5.73),
y4 = c(6.58, 5.76, 7.71, 8.84, 8.47, 7.04, 5.25, 12.5, 5.56, 7.91, 6.89) ) 
于 2014-07-28T13:27:02.390 回答