问题标签 [pybigquery]

问问题

For questions regarding programming in ECMAScript (JavaScript/JS) and its various dialects/implementations (excluding ActionScript). Note JavaScript is NOT the same as Java! Please include all relevant tags on your question; e.g., [node.js], [jquery], [json], [reactjs], [angular], [ember.js], [vue.js], [typescript], [svelte], etc.

15 问题

0 投票

1 回答

94 浏览

google-bigquery - How to know the number of queries per user in Bigquery?

I want to know:

how to get the number of queries a user makes per month in BigQuery?
how many queries a specific user makes, how many per year.

I found the command

with that bring the number of jobs and with the comand bq show get the detail, but it does not bring me all the elements that I need, I would like to know if someone knows another way to achieve this.

google-bigquery pybigquery

2021-08-18T23:44:20.547

0 投票

1 回答

154 浏览

google-cloud-platform - 向表中添加新列后，BigQuery 批量更新 json 文件中的现有表行

我有一个 bigQuery 表，其中包含从 JSON 文件加载的大约 2M 行。JSON 文件中的实际字段为 10，但根据初始 DDL，表有 7 列。现在我更改了表格并添加了剩余的三列。更改后，新添加的列中的值用 NULL 填充。

现在我想回填现有 2M 行中的数据，但只有这三个新添加的列包含来自 json 文件的实际数据。我如何批量更新表，以便现有的列值保持不变并且只更新新的列值。

注意：表启用了流缓冲区，并且表未分区。

2021-08-27T14:44:10.810

0 投票

1 回答

157 浏览

python - 引用变量 'ro_sub_ros.$is_not_null' 的级别为 1，而 Parquet 列的对应字段路径有 0 个重复字段

BigQuery Python: google.api_core.exceptions.BadRequest: 400 Error while reading data, error message: Schema mismatch: referenced variable 'ro_sub_ros.$is_not_null' 的数组级别为 1，而 Parquet 列的对应字段路径有 0 个重复字段。

我的原始数据如下所示：

以下是我创建 BQ Schema 的方式：

当我尝试使用上传此数据时，bigquery client library出现此错误：

抛出：

不知道这里出了什么问题，如果我的架构太大而无法分析，有人可以展示一个REPEATED RECORD使用客户端库和熊猫数据框在谷歌大查询中上传的最小示例吗？

python google-bigquery pybigquery

2021-10-09T13:42:42.380

0 投票

0 回答

56 浏览

google-bigquery - Apache 超集连接到 BigQuery - BigQuery 数据库未显示在 UI 上

我想将 BigQuery 连接到 Apache 超集。我安装了 pybigquery (pip install pybigquery) 并重新启动了服务器，但我仍然无法在超集 UI 上看到 BigQuery（在数据库下）

google-bigquery apache-superset pybigquery

2021-12-23T04:59:14.917

0 投票

0 回答

68 浏览

google-bigquery - BigQuery 不匹配列

我在 Google 云存储中有多个 .CSV 文件，但我无法将数据推送到表中，因为 .CSV 文件中的列数不同。

假设我有 10 个带有 columns 的 .CSV 文件A, B, C, D，我还有另外 10 个带有 columns 的 .CSV 文件A, B, D。

当我将 .CSV 文件推送到 Big Query 表中时，我想创建A column to A,和B column To B, 。C column to Cmaking NULL for other 10 .CSVD column to D

假设 10 个 .CSV 文件有列：

另一个 10 .CSV 文件有列：

但在大查询表最终结果中我想要：

谁能帮我这个？

google-bigquery pybigquery

2022-01-12T20:02:09.320

1 2 3 4 5 6 7 8 9 10

问题标签 [pybigquery]

google-bigquery - How to know the number of queries per user in Bigquery?

google-cloud-platform - 向表中添加新列后，BigQuery 批量更新 json 文件中的现有表行

python - 引用变量 'ro_sub_ros.$is_not_null' 的级别为 1，而 Parquet 列的对应字段路径有 0 个重复字段

google-bigquery - Apache 超集连接到 BigQuery - BigQuery 数据库未显示在 UI 上

google-bigquery - BigQuery 不匹配列

Reference