r - 汇总数据时丢失 R 因子组织 (dplyr)

Question

我试图通过几个分类因素以及日期来总结一个数字响应变量（地上生物量 [AGB]）作为更大项目的一部分。日期被读取为一个字符，并被组织为 2020 年 4 月 10 日、2020 年 4 月 8 日、2020 年 4 月 9 日。此外，还有一列，Shoot.Plot，编号为 1-11，按顺序排列：1、10、11、2... 等等，因为它被读取为字符串（这在大多数情况下都很好）从奇怪的顺序）。我已将因素重新调整为我想要的，但是当我使用get_summary_stats()from rstatix()package 或 using总结数据时summarize()，级别组织会丢失。

这是我尝试过的：

df %>% 
  mutate(Date.Coll, factor(Date.Coll, levels = c("4/8/2020","4/9/2020","4/10/2020")), 
         Shoot.Plot, factor(Shoot.Plot, levels = 
                              c("1","2","3","4","5","6","7","8","9","10","11"))) %>%
  group_by(Date.Coll, Site, Eelgrass, Oyster, Shoot.Plot) %>%
  filter(is.na(BGB),
         Date.Coll=="4/8/2020" | Date.Coll=="4/9/2020" | Date.Coll=="4/10/2020") %>% 
  select(AGB) %>% 
  get_summary_stats(type="mean_se")

当我检查get_summary_stats()行前的数据框时，数据按照我在 mutate 函数中指定的方式组织。只有在总结之后，这两个才走出窗外。

有什么建议么？谢谢！

score 0 · Accepted Answer

当您放置时select(AGB)，您将删除数据框中的所有其他列。如果要指定要汇总的变量，请将其放入get_summary_stats. 没有数据，我无法检查它是否有效。尝试这个：

df %>% 
  mutate(Date.Coll, factor(Date.Coll, levels = c("4/8/2020","4/9/2020","4/10/2020")), 
         Shoot.Plot, factor(Shoot.Plot, levels = c("1","2","3","4","5","6","7","8","9","10","11"))) %>%
  group_by(Date.Coll, Site, Eelgrass, Oyster, Shoot.Plot) %>%
  filter(is.na(BGB),
         Date.Coll=="4/8/2020" | Date.Coll=="4/9/2020" | Date.Coll=="4/10/2020") %>% 
  get_summary_stats(AGB, type="mean_se")

r - 汇总数据时丢失 R 因子组织 (dplyr)

1 回答 1

Related

Reference