0

我在 Treasure-data.com (HiveQL) 上运行此查询:

  SELECT
    ngrams (
      sentences (
        LOWER( v[ 'text' ] )
      )
      ,1
      ,100
    )
  FROM
    item
  WHERE
    TD_TIME_RANGE (
      TIME
      ,"2013-08-06"
      ,TD_TIME_ADD (
        "2013-08-06"
        ,"1d"
      )
    )

当需要将结果转储到 PostgreSQL 中时,我收到此错误:

started at 2013-08-06T17:42:52Z
Hive history file=/mnt/hive/tmp/2851/hive_job_log__1663452325.txt
finished at 2013-08-06T17:43:27Z
Exception in thread "main" java.lang.RuntimeException: invalid schema: unexpected '>' but got: <ngram:array<string>,estfrequency:double>>
    at com.treasure_data.type.spec.TypeSpecs.scanString(TypeSpecs.java:98)
    at com.treasure_data.type.spec.TypeSpecs.fromString(TypeSpecs.java:72)
    at com.treasure_data.type.spec.TypeSpecs.fromString(TypeSpecs.java:64)
    at com.treasure_data.type.spec.TypeSpecs.fromString(TypeSpecs.java:57)
    at com.treasure_data.hadoop.worker.result.ResultWorker.<init>(ResultWorker.java:59)
    at com.treasure_data.hadoop.worker.result.ResultWorker.main(ResultWorker.java:154)
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
    at java.lang.reflect.Method.invoke(Method.java:597)
    at org.apache.hadoop.util.RunJar.main(RunJar.java:208)

我猜我需要以某种方式修改我的选择语句。

4

1 回答 1

0

爆炸!

SELECT explode(ngrams(sentences(lower(val)), 2, 10)) AS x FROM kafka;

来源

于 2013-08-07T04:07:28.783 回答