r - 森林图面网格比较多个模型的回归模型系数

Question

我目前正在处理 30 个具有相同列名但数字数据不同的数据集。我需要将线性混合模型和广义线性模型应用于数据集的每个实例，并在森林图上绘制得到的固定效应系数。

数据当前的结构如下（为简单起见，对每个列表元素使用相同的数据集）：

library(lme4)

data_list <- list()

# There's definitely a better way of doing this through lapply(), I just can't figure out how
for (i in 1:30){
     data_list[[i]] <- tibble::as_tibble(mtcars) # this would originally load different data at every instance
}

compute_model_lmm <- function(data){
     lmer("mpg ~ hp + disp + drat + (1|cyl)", data = data)
}

result_list_lmm <- lapply(data_list, compute_model_lmm)

我目前正在做的是

library(modelsummary)

modelplot(result_list_lmm)+
     facet_wrap(~model) #modelplot() takes arguments/functions from ggplot2

这需要花费大量时间，但它确实有效。

现在，我想在同一个图上比较另一个模型，如

compute_model_glm <- function(data){
     glm("mpg ~ hp + disp + drat + cyl", data = data)
}

result_list_glm <- lapply(data_list, compute_model_glm)

modelplot(list(result_list_lmm[[1]], result_list_glm[[1]]))

但对于情节的每个实例。

我该如何指定它modelplot()？

提前致谢！

score 0 · Accepted Answer

该modelplot函数为您提供了一些绘制系数和区间的基本方法（facet例如，检查参数）。

然而，函数的真正威力来自于使用draw=FALSE参数。在这种情况下，modelplot很难在方便的数据框中为您提供估计值，包括所有重命名、稳健的标准错误和函数的其他实用程序modelplot。然后，您可以使用该数据框自己进行绘图以ggplot2进行无限自定义。

library(modelsummary)
library(ggplot2)

results_lm <- lapply(1:10, function(x) lm(hp ~ mpg, data = mtcars)) |>
    modelplot(draw = FALSE) |>
    transform("Function" = "lm()")
results_glm <- lapply(1:10, function(x) glm(hp ~ mpg, data = mtcars)) |>
    modelplot(draw = FALSE) |>
    transform("Function" = "glm()")
results <- rbind(results_lm, results_glm) 

head(results)
          term   model estimate std.error conf.low conf.high Function
1  (Intercept) Model 1 324.0823   27.4333  268.056  380.1086     lm()
3  (Intercept) Model 2 324.0823   27.4333  268.056  380.1086     lm()
5  (Intercept) Model 3 324.0823   27.4333  268.056  380.1086     lm()
7  (Intercept) Model 4 324.0823   27.4333  268.056  380.1086     lm()
9  (Intercept) Model 5 324.0823   27.4333  268.056  380.1086     lm()
11 (Intercept) Model 6 324.0823   27.4333  268.056  380.1086     lm()

ggplot(results, aes(y = term, x = estimate, xmin = conf.low, xmax = conf.high)) +
    geom_pointrange(aes(color = Function), position = position_dodge(width = .5)) +
    facet_wrap(~model)

r - 森林图面网格比较多个模型的回归模型系数

1 回答 1

Related

Reference