plot - 线性混合效应模型中的显着交互，但绘图显示重叠置信区间？

Question

我试图尽可能多地向您展示数据的结构和我产生的结果。

数据结构如下：

   GroupID Person Factor2 Factor1 Rating
    <int>  <int>  <fctr>  <fctr>  <int>
1       2    109       2       0      1
2       2    109       2       1     -2
3       2    104       1       0      4
4       2    236       1       1      1
5       2    279       1       1      2
6       2    179       2       1      0

Person 是参与者 ID，GroupID 是被评定的刺激类型，Factor 1（0 级和 1 级）和 Factor 2（1 级和 2 级）是固定因素，Ratings 是结果变量。

我正在尝试在线性混合效应模型中打印一个显着交互的图。我使用包 lme4 和 lmerTest 来分析数据。

这是我们运行的模型：

> model_interaction <- lmer(Rating ~ Factor1 * Factor2 + ( 1 | Person) +
(1 | GroupID), data)
> model_interaction

Linear mixed model fit by REML ['merModLmerTest']
Formula: Rating ~ Factor1 * Factor2 + (1 | Person) + (1 | GroupID)
Data: data
REML criterion at convergence: 207223.9

Random effects:
 Groups   Name        Std.Dev.
 Person   (Intercept) 1.036   
 GroupID  (Intercept) 1.786   
 Residual             1.880   
Number of obs: 50240, groups:  Person, 157; GroupID, 80
Fixed Effects:
  (Intercept)           Factor11           Factor22  Factor11:Factor22  
     -0.43823            0.01313            0.08568            0.12440

当我使用 summary() 函数时，R 返回以下输出

> summary(model_interaction)

Linear mixed model fit by REML 
t-tests use  Satterthwaite approximations to degrees of freedom
['lmerMod']
Formula: Rating ~ Factor1 * Factor2 + (1 | Person) + (1 | GroupID)
   Data: data

REML criterion at convergence: 207223.9

Scaled residuals: 
    Min      1Q  Median      3Q     Max 
-4.8476 -0.6546 -0.0213  0.6516  4.2284 

Random effects:
 Groups   Name        Variance Std.Dev.
 Person   (Intercept) 1.074    1.036   
 GroupID  (Intercept) 3.191    1.786   
 Residual             3.533    1.880   
Number of obs: 50240, groups:  Person, 157; GroupID, 80

Fixed effects:
                    Estimate Std. Error         df t value Pr(>|t|)    
(Intercept)       -4.382e-01  2.185e-01  1.110e+02  -2.006 0.047336 *  
Factor11           1.313e-02  2.332e-02  5.004e+04   0.563 0.573419    
Factor22           8.568e-02  6.275e-02  9.793e+03   1.365 0.172138    
Factor11:Factor22  1.244e-01  3.385e-02  5.002e+04   3.675 0.000238 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Correlation of Fixed Effects:
            (Intr) Fctr11 Fctr22
Factor11    -0.047              
Factor22    -0.135  0.141       
Fctr11:Fc22  0.034 -0.694 -0.249

我知道无法解释线性混合效应模型的 p 值。所以我运行了一个额外的方差分析，将交互模型与仅具有 Factor1 和 Factor2 主效应的模型进行比较

> model_Factor1_Factor2 = lmer(Rating ~ Factor1 + Factor2 +
  ( 1 | Person) + (1 | GroupID), data)
> anova(model_Factor1_Factor2, model_interaction)

Data: data
Models:
object: Rating ~ Factor1 + Factor2 + (1 | Person) + (1 | GroupID)
..1: Rating ~ Factor1 * Factor2 + (1 | Person) + (1 | GroupID)
       Df    AIC    BIC  logLik deviance  Chisq Chi Df Pr(>Chisq)    
object  6 207233 207286 -103611   207221                             
..1     7 207222 207283 -103604   207208 13.502      1  0.0002384 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

我将此输出解释为：与仅具有因子 1 和因子 2 主效应的模型相比，因子 1 和因子 2 的交互作用解释了我的结果测量中的额外差异。

由于很难解释线性混合效应模型的输出，我想打印一张图表，显示 Factor1 和 Factor2 的相互作用。我使用 lsmeans 包这样做（首先我使用了 plot(allEffects) 但在阅读了如何在混合效应模型中获取系数及其置信区间？问题后我意识到这不是为线性混合效应模型打印图形的正确方法）。

所以这就是我所做的（按照这个网站http://rcompanion.org/handbook/G_06.html）

> leastsquare = lsmeans(model_interaction, pairwise ~ Factor2:Factor1,
 adjust="bon")
> CLD = cld(leastsquare, alpha=0.05, Letters=letters, adjust="bon")
> CLD$.group=gsub(" ", "", CLD$.group)
> CLD
 Factor2 Factor1     lsmean        SE     df   lower.CL  upper.CL .group
 1       0       -0.4382331 0.2185106 111.05 -0.9930408 0.1165746  a    
 1       1       -0.4251015 0.2186664 111.36 -0.9803048 0.1301018  a    
 2       0       -0.3525561 0.2190264 112.09 -0.9086735 0.2035612  a    
 2       1       -0.2150234 0.2189592 111.95 -0.7709700 0.3409233   b   

Degrees-of-freedom method: satterthwaite 
Confidence level used: 0.95 
Conf-level adjustment: bonferroni method for 4 estimates 
P value adjustment: bonferroni method for 6 tests 
significance level used: alpha = 0.05

这是我使用的绘图功能

> ggplot(CLD, aes(`Factor1`, y = lsmean, ymax = upper.CL,
  ymin = lower.CL, colour = `Factor2`, group = `Factor2`)) +
  geom_pointrange(stat = "identity",
  position = position_dodge(width = 0.1)) +
  geom_line(position = position_dodge(width = 0.1))

可以使用此链接找到该情节（我还不允许发布图像，请原谅解决方法）

Factor1 和 Factor2 的交互作用

现在我的问题如下：为什么我有一个显着的交互作用和这种交互作用的大量解释方差，但我在情节中的置信区间重叠？我想我对置信区间做错了什么？还是因为无法解释线性混合效应模型的显着性指数？

score 4 · Accepted Answer

因为它是苹果和橘子。

苹果：均值的置信区间。

橙子：均值差异检验。

均值和均值差是不同的统计量，它们具有不同的标准误差和其他分布特性。尤其是在混合模型中，它们可能完全不同，因为当您采取差异时，某些变化来源可能会抵消。

不要尝试使用置信区间进行比较。这就像试图用汉堡包做鸡汤。

plot - 线性混合效应模型中的显着交互，但绘图显示重叠置信区间？

1 回答 1

Related

Reference