7

将不胜感激有关如何使我的代码或 binwidth 行为的任何想法。我的数据集包含每隔几个小时“4”和每天“24”收集的时间戳数据点。我正在尝试在左侧绘制 4 小时堆叠直方图,在右侧绘制 24 小时堆叠直方图。因此,我希望右边的 binwidth 比左边的 binwidth 宽 6 倍。但是我用 binwidth 尝试过的一切都没有奏效。x 轴数据 data3$dts 似乎是连续的而不是离散的,但也许我做的不对。

关于数据的重要说明:在右侧绘制的数据,即 hours=24 数据,具有始终为整数的 dts 值。左边的数据,hours=4 数据,具有非整数 dts 值。

             "dts"  "Yes" "No" "Maybe" "hours"  "days"
"258"   15627.668   8       0   1       4   "7 Days"
"259"   15627.832   13      11  18      4   "7 Days"
"260"   15628       34      47  89      4   "7 Days"
"261"   15628       37      47  90      24  "7 Days"
"262"   15628.168   3       0   1       4   "7 Days"
"40"    15571       345     419 674     24  "90 Days"
"41"    15571.5     91      145 130     4   "90 Days"
"42"    15571.668   158     149 284     4   "90 Days"
"43"    15571.832   96      125 260     4   "90 Days"
"44"    15572       55      33  137     4   "90 Days"
"45"    15572       1050    1119 2660   24  "90 Days"

从 pastebin 中提取数据的代码:

library (ggplot2)
library (scales)
library(grid)
library(gridExtra)

color3 <- c("mediumspringgreen","red","grey44")
titles.days <-  c( "7 Days", "90 Days") 
names.facetby <- c ("dts", "hours", "days")

data3 <- read.table ("http://pastebin.com/download.php?i=wUQQUXP4", header=TRUE)
data3.melt <- melt (data3 , id = names.facetby )   
data3.melt$days <- factor (data3.melt$days, levels = titles.days)   #  put the factor in the right order, so the graphs are in the right order

 a <- ggplot     ( data3.melt 
        , aes (       x =  dts  #as.Date( dts , date1970) 
                , y =  value 
                , fill = variable)) +
        opts (axis.text.x=theme_text(angle=0, hjust=1)) +
        scale_fill_manual(values = color3) +
        scale_x_date(labels = date_format("%m/%d\n    %a") ) +
        geom_histogram (stat = "identity", position = "stack", binwidth=6.2) +  
        facet_wrap( days ~ hours, ncol=2, scales="free")            

print(a)        

当前结果,显示右侧图表的 binwidth 太窄:

在此处输入图像描述

4

2 回答 2

3

@justin指向 Hadley Wickham 帖子的链接有答案,即在不同的层中绘制左右图。

更新了在 ggplot 中使用 2 条新的 geom_histogram 线正确绘制的代码:

库(ggplot2) 库(尺度) 库(网格) 库(gridExtra)

color3 <- c("mediumspringgreen","red","grey44")
titles.days <-  c( "7 Days", "90 Days") 
names.facetby <- c ("dts", "hours", "days")

data3 <- read.table ("http://pastebin.com/download.php?i=wUQQUXP4", header=TRUE)
data3.melt <- melt (data3 , id = names.facetby )   
data3.melt$days <- factor (data3.melt$days, levels = titles.days)   #  put the factor in the right order, so the graphs are in the right order



 a <- ggplot     ( data3.melt 
        , aes (       x =  dts  #as.Date( dts , date1970) 
                , y =  value 
                , fill = variable)) +
        opts (axis.text.x=theme_text(angle=0, hjust=1)) +
        scale_fill_manual(values = color3) +
        scale_x_date(labels = date_format("%m/%d\n%a") ) +

    # bad idea, good ideas follow   geom_histogram (stat = "identity", position = "stack", binwidth=6.2) +  #, breaks = breaks.x
        geom_histogram (data =  subset(data3.melt, hours == 4),  stat = "identity", position = "stack", binwidth=0.3) + #, breaks = breaks.x
        geom_histogram (data =  subset(data3.melt, hours == 24),  stat = "identity", position = "stack", binwidth=0.9) +    #, breaks = breaks.x

        facet_wrap( days ~ hours, ncol=2, scales="free")            

print(a)        # plot the thing

修正图: http://imgur.com/9j1Xz

于 2012-10-19T15:06:59.777 回答
1

这些垃圾箱实际上是相同的宽度。不同之处在于 90 天的地块中有更多的 bin。

您可以通过设置看到这scales="free_y"一点facet_wrap

你也可以看看这篇文章,它描述了一种潜在的技术来做你正在寻找的事情。

于 2012-10-19T14:12:47.877 回答