0

我有一个 Oracle 表,其中包含类似于以下基本示例的数据:

+--------+----------+
| SERIES | CATEGORY |
+--------+----------+
| green  | apple    |
| green  | pear     |
| green  | pear     |
| yellow | apple    |
| yellow | apple    |
| yellow | pear     |
| yellow | pear     |
| yellow | pear     |
| yellow | banana   |
| yellow | banana   |
| yellow | banana   |
| red    | apple    |
+--------+----------+

我想为这些数据生成一个类似于 Pareto 的图表,它应该看起来像 Stacked Pareto Chart,

在此处输入图像描述

要创建此图,我想运行 SQL 查询并获得以下输出:

+----------+--------+-------+
| CATEGORY | SERIES | COUNT |
+----------+--------+-------+
| pear     | green  |     2 |
| pear     | yellow |     3 |
| apple    | green  |     1 |
| apple    | yellow |     2 |
| apple    | red    |     1 |
| banana   | yellow |     3 |
+----------+--------+-------+

实际表有数百万个条目,目前查询数据库需要大量时间,因为我正在使用的当前过程效率不高:

按每个类别中的条目数量对类别进行排序:

SELECT CATEGORY, COUNT(CATEGORY) FROM FRUIT GROUP BY CATEGORY ORDER BY COUNT(CATEGORY);

然后对于每个类别,我按系列的顺序列出相关系列:

SELECT SERIES, COUNT(SERIES) FROM FRUIT WHERE CATEGORY = [current category] GROUP BY SERIES ORDER BY SERIES;

查询数据库(最好是单个 SQL 语句)以获得所需输出的最有效方法是什么?

4

2 回答 2

0

CATEGORY您可以通过对和进行分组来获得所需的结果SERIES

SELECT 
    CATEGORY, SERIES, COUNT(*) 
FROM FRUIT 
GROUP BY CATEGORY, SERIES 
ORDER BY COUNT(*);

更新:

要先按总数排序,CATEGORY然后按绿色、黄色、红色排序,就像您的预期输出一样:

SELECT t1.*
FROM (
    SELECT 
        CATEGORY, SERIES, COUNT(*) AS CNT
    FROM FRUIT 
    GROUP BY CATEGORY, SERIES 
) t1
INNER JOIN (
    SELECT
        CATEGORY, COUNT(*) AS CNT
    FROM FRUIT
    GROUP BY CATEGORY
) t2
    ON t1.CATEGORY = t2.CATEGORY
ORDER BY 
    t2.CNT DESC,
    CASE t1.SERIES
        WHEN 'green' THEN 1
        WHEN 'yellow' THEN 2
        WHEN 'red' THEN 3
    END
于 2016-05-26T07:26:09.527 回答
0

一些较短的版本:

select category, series, CntS 
from (
  select  distinct count(category) over (partition by category) cntC,
   count(series)  over (partition by category, series ) cntS,
   category, series 
from fruit   ) Tab
order by CntC desc, cntS desc; 
于 2016-05-26T07:59:18.320 回答