r - 使用 ggplot/plyr 通过条形图的总和对条形图重新排序

Question

我需要将以下堆叠条形图中的 11 个条按每个条的前两个段的总和重新排序，即按图中的（红色+绿色）段排序。

> dput(q1m.bl)
structure(list(ItemA = structure(c(1L, 2L, 3L, 4L, 1L, 2L, 3L, 
4L, 1L, 2L, 3L, 4L, 2L, 3L, 4L, 1L, 2L, 3L, 4L, 1L, 2L, 3L, 4L, 
1L, 2L, 3L, 4L, 2L, 3L, 4L, 1L, 2L, 3L, 4L, 1L, 2L, 3L, 4L, 1L, 
2L, 3L, 4L), .Label = c("sehr wichtig", "wichtig", "unwichtig", 
"keine Angabe"), class = "factor"), ItemQ = structure(c(1L, 1L, 
1L, 1L, 2L, 2L, 2L, 2L, 3L, 3L, 3L, 3L, 4L, 4L, 4L, 5L, 5L, 5L, 
5L, 6L, 6L, 6L, 6L, 7L, 7L, 7L, 7L, 8L, 8L, 8L, 9L, 9L, 9L, 9L, 
10L, 10L, 10L, 10L, 11L, 11L, 11L, 11L), .Label = c("PUSHERS_AA", 
"PUSHERS_COM", "PUSHERS_BED", "PUSHERS_SEC", "PUSHERS_STAB", 
"PUSHERS_COST", "PUSHERS_INNO", "PUSHERS_VAL", "PUSHERS_INDEP", 
"PUSHERS_STDS", "PUSHERS_SRC"), class = "factor"), Counts = c(1L, 
3L, 4L, 1L, 3L, 3L, 2L, 1L, 4L, 2L, 2L, 1L, 3L, 5L, 1L, 1L, 1L, 
6L, 1L, 5L, 1L, 2L, 1L, 1L, 1L, 6L, 1L, 2L, 6L, 1L, 2L, 4L, 2L, 
1L, 3L, 3L, 2L, 1L, 2L, 1L, 5L, 1L), blpos = c(0.111111111111111, 
0.444444444444444, 0.888888888888889, 1, 0.333333333333333,   0.666666666666667, 
0.888888888888889, 1, 0.444444444444444, 0.666666666666667, 0.888888888888889, 
1, 0.333333333333333, 0.888888888888889, 1, 0.111111111111111, 
0.222222222222222, 0.888888888888889, 1, 0.555555555555556, 0.666666666666667, 
0.888888888888889, 1, 0.111111111111111, 0.222222222222222, 0.888888888888889, 
1, 0.222222222222222, 0.888888888888889, 1, 0.222222222222222, 
0.666666666666667, 0.888888888888889, 1, 0.333333333333333, 0.666666666666667, 
0.888888888888889, 1, 0.222222222222222, 0.333333333333333, 0.888888888888889, 
1)), .Names = c("ItemA", "ItemQ", "Counts", "blpos"), row.names = c(NA, 
-42L), class = "data.frame")

剧情 ...

ggplot(q1m.bl, aes(x = ItemQ, y = Counts, fill = ItemA)) + 
geom_bar(stat="identity", position="fill") + 
geom_text(aes(y = blpos, label = Counts), hjust = 1) +
theme(axis.text.x=element_text(angle=90, hjust = 0), text = element_text(size=10)) +
coord_flip()

呃，没有足够的代表点来嵌入图像。带来不便敬请谅解。情节在这里：http: //i.stack.imgur.com/am0Ud.png

我玩了安排（），在检查了数据框本身之后，我认为下面的排序应该可以解决问题。（注意：blpos 表示“条形标签位置”，是图中各种数字的位置。）但是绘制这个“排序”数据框会导致与上面相同的图。我不明白要更改哪些信息来更改 ItemQ 列的绘图顺序。

q1m.bl.s <- arrange(q1m.bl, ItemA, desc(blpos))
ggplot(q1m.bl.s, ....

无论如何，最好的方法是什么？我应该在绘图之前操纵 df（使用 ddply/arrange/reorder/等）吗？因为我倾向于认为这是一个演示问题，应该在ggplot 中完成。这还重要吗？我在 SO 上找到的“ggplot 有序条形图”问题似乎同时使用了这两种方法；但我发现没有一个是指堆叠条形段并使用因子数据......因此这个新问题。

非常感谢您给我的启发！

score 1 · Accepted Answer

这都是关于重新排序ItemQ变量的因子水平。

d <- subset(q1m.bl, ItemA %in% c("sehr wichtig", "wichtig"))
totals <- aggregate(d$Counts, list(ItemQ = d$ItemQ), sum)
ItemQ.order <- as.character(totals[order(-totals$x), ]$ItemQ)
q1m.bl$ItemQ <- factor(q1m.bl$ItemQ, levels = ItemQ.order)

然后您应该能够完全按照您提供的方式运行代码，它将产生以下内容：在此处输入图像描述

编辑（digisus）：konvas，我只是重新添加您的第一个答案，显示 ddply 的使用，因为即使我对它感到不舒服/不完全理解它，我相信其他人可以从中受益。:-) 所以，在你允许的情况下，我在这里重新发布：

library(plyr)
ItemQ.order <- q1m.bl %>%
  group_by(ItemQ) %>% 
  filter(ItemA %in% c("sehr wichtig", "wichtig")) %>% 
  summarise(total = sum(Counts)) %>%
  arrange(-total) %>% 
  select(ItemQ) %>%
  unlist %>%
  as.character

q1m.bl$ItemQ <- factor(q1m.bl$ItemQ, levels = ItemQ.order)

score 0 · Accepted Answer

library(ggplot2)

fac_ord <- function(seed){
  set.seed(seed)
  return(sample(letters[1:4]))
}

# this seed simulates arbitrary sortings
seed <- 2
fac_ord(seed)

val = c(1,2,3,4,2,2,2,2)
fac = factor(c("a","b","c","d","a","b","c","d"), 
             levels=fac_ord(seed), 
             labels=fac_ord(seed), 
             ordered=FALSE)
dif = c(rep("x",4),rep("y",4))

df  = data.frame(val = val, fac = fac)

ggplot(df, aes(x=fac, y=val, fill=dif)) + 
  geom_bar(stat="identity") + 
  labs(title = sprintf("seed = %d / %s", seed, paste(fac_ord(seed),collapse=",")))

如示例所示 - ggplot 将fac在图中使用与fac. 因此，要影响绘制的顺序，您必须编写一个返回预期顺序的函数 - 取决于任何事实和值 - 并使用它来创建因子fac- 然后使用这个正确排序的因子进行绘图。

在此处输入图像描述

通过应用reorder()对因子的水平进行重新排序，也可以达到预期的结果。

r - 使用 ggplot/plyr 通过条形图的总和对条形图重新排序

2 回答 2

Related

Reference