问题标签 [amazon-redshift-spectrum]

问问题

For questions regarding programming in ECMAScript (JavaScript/JS) and its various dialects/implementations (excluding ActionScript). Note JavaScript is NOT the same as Java! Please include all relevant tags on your question; e.g., [node.js], [jquery], [json], [reactjs], [angular], [ember.js], [vue.js], [typescript], [svelte], etc.

247 问题

0 投票

2 回答

9115 浏览

amazon-web-services - 如何在红移中生成 12 位唯一数字？

我在一个表中有 3 列，即email_id, rid, final_id。

rid和的规则final_id：

如果email_id有对应的rid，rid则用作final_id。
如果email_id没有对应的rid（即为rid空），则生成一个唯一的 12 位数字并插入到final_id字段中。

如何在红移中生成 12 位唯一数字？

user8147906

2017-10-05T06:06:35.037

0 投票

1 回答

953 浏览

amazon-s3 - Redshift Spectrum 使用两个日期字段对表进行分区

我正在寻找按日期创建分区的最佳实践，使用amazon-redshift-spectrum，但示例显示了通过仅按一个日期对表进行分区来解决的问题。如果我有多个日期字段怎么办？

例如：带有user_install_date和的移动事件event_date

划分你的喜欢的表现如何s3：

它会扼杀我的select表现吗？在这种情况下，最好的策略是什么？

amazon-s3 amazon-redshift amazon-redshift-spectrum

2017-10-06T21:36:44.037

0 投票

1 回答

3201 浏览

python-3.x - 将多个文件从 Redshift 卸载到 S3

您好我正在尝试将多个表从 Redshift 卸载到特定的 S3 存储桶，但出现以下错误：

如果我在 unload_function 上添加“allowoverwrite”选项，它会在表之前覆盖并卸载 S3 中的最后一个表。

这是我给出的代码：

python-3.x amazon-s3 amazon-redshift amazon-redshift-spectrum

2017-10-10T16:30:03.460

0 投票

2 回答

1161 浏览

python - 如何使用 Psycopg2 在 Redshift Spectrum 中添加分区 -

我们有一个基于 S3 数据构建的 Redshift Spectrum 表——我们正在尝试在该表中自动添加分区——我可以在 redshift 客户端或 psql shell 中运行以下 ALTER 语句：

但这无法通过 psycopg2 执行。

在 psycopg2 的情况下，它甚至不会将查询发送到 redshift，并且在查询解析中执行失败。

现在我已经实现了使用 subprocess.popen 来执行 alter 语句 - 但我想将它切换回使用 psycopg2。

建议/想法？

谢谢，侯赛因·博拉

python psycopg2 amazon-redshift-spectrum

2017-10-19T16:12:46.103

0 投票

3 回答

1943 浏览

database - 我需要将数据库从一个 Redshift 集群复制到另一个集群

在将数据库从一个 Redshift 集群移动到另一个 Redshift 集群时，我需要帮助。在这里，我不是在复制表格。我想复制一个数据库。有人可以帮我解决这个问题。

database amazon-redshift amazon-redshift-spectrum

2017-10-20T15:38:45.443

0 投票

1 回答

1635 浏览

amazon-athena - 在 redshift 中查询外部表时获取 0 行

我们创建了如下模式：

和表格如下：

访问权限如下：

athenaQuickSight 访问、完全 Athena 访问和 s3 完全访问的角色附加到 redshift 集群

但是，当我们如下查询时，我们得到 0 条记录。请帮忙。

amazon-athena amazon-redshift-spectrum

2017-10-31T13:13:46.863

0 投票

2 回答

499 浏览

amazon-redshift - 如何将 varchar 数据类型字段转换为 redshift 中具有时区类型字段的时间戳？

我有一个表，其中timestamp存储为varchar. 我需要将其转换为timestampwithtimezone但每次我收到“无效操作”错误。

该字段的格式为：

我尝试了以下方法：

所有人都给出了这样的错误：

有人可以帮忙吗？

amazon-redshift timestamp-with-timezone amazon-redshift-spectrum

2017-11-07T11:11:09.970

0 投票

2 回答

12436 浏览

amazon-s3 - Redshift Spectrum: Automatically partition tables by date/folder

We currently generate a daily CSV export that we upload to an S3 bucket, into the following structure:

We want to be able to run reports partitioned by daily export.

According to this page, you can partition data in Redshift Spectrum by a key which is based on the source S3 folder where your Spectrum table sources its data. However, from the example, it looks like you need an ALTER statement for each partition:

Is there any way to set the table up so that data is automatically partitioned by the folder it comes from, or do we need a daily job to ALTER the table to add that day's partition?

amazon-s3 amazon-redshift amazon-redshift-spectrum

2017-11-08T16:16:01.213

0 投票

1 回答

610 浏览