1

我有具有以下结构的输入表 - ID、日期、值。

我正在尝试为数据集中的每条记录计算过去 10 个月的最小值。为此,我正在使用range between interval.

下面的代码在 SPARK SQL 中运行良好,但由于某种原因,我不能在雪花 SQL 中使用相同的代码。感谢有人可以指导我如何修改以下代码以在 Snowflake SQL 中运行

select *,
min(avg_Value) OVER (
        PARTITION BY ID 
        ORDER BY CAST(Date AS timestamp)  
        RANGE BETWEEN INTERVAL 10 MONTHS PRECEDING AND CURRENT ROW) as min_value_in_last_10_months
from        
(
select  ID,
        Date,
        avg(Value) as avg_Value
from table
group by ID,Date
)
4

2 回答 2

2

Snowflake 支持横向连接,因此一种方法是:

select . . .
from t cross join lateral
     (select avg(t2.value) as avg_value
      from t t2
      where t2.id = t.id and
            t2.date >= t.date - interval 10 month and
            t2.date <= t.date
     ) a
于 2020-10-13T17:38:19.717 回答
0

如果你的结果表中有所有月份,那么你也可以试试这个

select *,
min(avg_Value) OVER (
        PARTITION BY ID 
        ORDER BY CAST(Date AS timestamp)  
        ROWS BETWEEN 9 PRECEDING AND CURRENT ROW) as min_value_in_last_10_months
from        
(
select  ID,
        Date,
        avg(Value) as avg_Value
from table
group by ID,Date
)
于 2021-06-11T20:31:54.287 回答