0

我正在尝试使用示例架构和一些示例数据从 Perl 上传到 BigQuery。在他们提供的文档之后,我遇到了死胡同,所以现在我试图模仿bq命令行客户端成功完成的工作。

我正在bq通过print (method, uri, headers, body)request. httplib2我正在通过对响应执行 a 来跟踪我的 Perl 库正在做什么Dumper,其中还包括_request我发送的响应。其中的模式bq是他们POST到一个上传 URL,然后返回一个数据到locationPUT通过一系列GET请求监控相应的作业,最后它们做出响应。

在 Perl 中,我POST成功了,而我GET失败了Invalid Upload Request(但没有提示为什么它无效)。我试图弄清楚两者之间的差异可以解释我的失败。但我找不到它。

这是(省略了 access_token、IP 地址和 project_id)我得到的跟踪。

对于POST来自 Python 的信息是:

(
    u'POST',
    u'https://www.googleapis.com/upload/bigquery/v2/projects/<project ID>/jobs?uploadType=resumable&alt=json',
    {
        'content-length': '442',
        'accept-encoding': 'gzip, deflate',
        'accept': 'application/json',
        'user-agent': u'bq/2.0 google-api-python-client/1.0',
        'X-Upload-Content-Length': '84',
        'X-Upload-Content-Type': 'application/octet-stream',
        'content-type': 'application/json',
        'Authorization': u'Bearer <access token>'
    },
    '{"configuration": {"load": {"sourceFormat": "NEWLINE_DELIMITED_JSON", "destinationTable": {"projectId": "<project id>", "tableId": "demo_api", "datasetId": "tmp_bt"}, "maxBadRecords": 0, "schema": {"fields": [{"type": "STRING", "mode": "required", "name": "demo_string"}, {"type": "INTEGER", "mode": "required", "name": "demo_integer"}]}}}, "jobReference": {"projectId": "<project id>", "jobId": "bqjob_r139e633b7e522cf7_0000014031d9fb49_1"}}'
)

相应的 Perl 得到一个明显成功的响应对象(您可以在其中看到_request):

$VAR1 = bless( {
    '_protocol' => 'HTTP/1.1',
    '_content' => '',
    '_rc' => '200',
    '_headers' => bless( {
        'connection' => 'close',
        'client-response-num' => 1,
        'location' => 'https://www.googleapis.com/upload/bigquery/v2/projects/<project id>/jobs?uploadType=resumable&upload_id=AEnB2Ur0mdwmZpMot6ftkgj1IkqK0f7oPbZrXWQekUDHK_E2o2HKznJO6DK2xPYCB-nhUGrMrEJJ7z1Tz9Crnka9e5EYGP1lWQ',
        'date' => 'Tue, 06 Aug 2013 20:46:05 GMT',
        'client-ssl-cert-issuer' => '/C=US/O=Google Inc/CN=Google Internet Authority',
        'client-ssl-cipher' => 'RC4-SHA',
        'client-peer' => '<some ip>:443',
        'content-length' => '0',
        'client-date' => 'Tue, 06 Aug 2013 20:46:05 GMT',
        'content-type' => 'text/html; charset=UTF-8',
        'client-ssl-cert-subject' => '/C=US/ST=California/L=Mountain View/O=Google Inc/CN=*.googleapis.com',
        'server' => 'HTTP Upload Server Built on Jul 24 2013 17:20:01 (1374711601)',
        'client-ssl-socket-class' => 'IO::Socket::SSL'
    }, 'HTTP::Headers' ),
    '_msg' => 'OK',
    '_request' => bless( {
        '_content' => '{"configuration":{"load":{"maxBadRecords":0,"destinationTable":{"datasetId":"tmp_bt","tableId":"perl","projectId":<project id>},"sourceFormat":"NEWLINE_DELIMITED_JSON","schema":{"fields":[{"mode":"required","name":"demo_string","type":"STRING"},{"mode":"required","name":"demo_integer","type":"INTEGER"}]}}},"jobReference":{"projectId":<project id>,"jobId":"perlapi_1375821964"}}',
        '_uri' => bless( do{\(my $o = 'https://www.googleapis.com/upload/bigquery/v2/projects/<project id>/jobs?uploadType=resumable')}, 'URI::https' ),
        '_headers' => bless( {
            'user-agent' => 'libwww-perl/6.05',
            'content-type' => 'application/json',
            'accept' => 'application/json',
            ':X-Upload-Content-Type' => 'application/octet-stream',
            'content-length' => 379,
            ':X-Upload-Content-Length' => '84',
            'authorization' => 'Bearer <access token>'
        }, 'HTTP::Headers' ),
        '_method' => 'POST',
        '_uri_canonical' => $VAR1->{'_request'}{'_uri'}
    }, 'HTTP::Request' )
}, 'HTTP::Response' );

然后我们有一个PUT. 在 Python 方面,我们发送了:

(
    'PUT',
    'https://www.googleapis.com/upload/bigquery/v2/projects/<project id>/jobs?uploadType=resumable&alt=json&upload_id=AEnB2UpWMRCAOffqyR0d7zvGVtD-KWhrC9jGB-q_igecJgoyz_mIHgEFfs9cYoPxUwUxuflQScMzGxDsKKJ_CJPQq4Os-AkdZA',
     {
         'Content-Range': 'bytes 0-83/84',
         'Content-Length': '84',
         'Authorization': u'Bearer <access token>',
         'user-agent': u'bq/2.0'
    },
    <apiclient.http._StreamSlice object at 0x10ce11150>
)

(我已经验证了流切片对象与 Perl 具有相同的 84 个字节。)这是 Perl 的失败:

$VAR1 = bless( {
    '_protocol' => 'HTTP/1.1',
    '_content' => '{
 "error": {
  "errors": [
   {
    "domain": "global",
    "reason": "badRequest",
    "message": "Invalid Upload Request"
   }
  ],
  "code": 400,
  "message": "Invalid Upload Request"
 }
}
',
    '_rc' => '400',
    '_headers' => bless( {
        'connection' => 'close',
        'client-response-num' => 1,
        'date' => 'Tue, 06 Aug 2013 20:46:07 GMT',
        'client-ssl-cert-issuer' => '/C=US/O=Google Inc/CN=Google Internet Authority',
        'client-ssl-cipher' => 'RC4-SHA',
        'client-peer' => '<some IP address>:443',
        'content-length' => '193',
        'client-date' => 'Tue, 06 Aug 2013 20:46:07 GMT',
        'content-type' => 'application/json',
        'client-ssl-cert-subject' => '/C=US/ST=California/L=Mountain View/O=Google Inc/CN=*.googleapis.com',
        'server' => 'HTTP Upload Server Built on Jul 24 2013 17:20:01 (1374711601)',
        'client-ssl-socket-class' => 'IO::Socket::SSL'
    }, 'HTTP::Headers' ),
    '_msg' => 'Bad Request',
    '_request' => bless( {
        '_content' => '{"demo_string":"foo", "demo_integer":"2"}
{"demo_string":"bar", "demo_integer":"3"}
',
        '_uri' => bless( do{\(my $o = 'https://www.googleapis.com/upload/bigquery/v2/projects/<project id>/jobs?uploadType=resumable&upload_id=AEnB2Ur0mdwmZpMot6ftkgj1IkqK0f7oPbZrXWQekUDHK_E2o2HKznJO6DK2xPYCB-nhUGrMrEJJ7z1Tz9Crnka9e5EYGP1lWQ')}, 'URI::https' ),
        '_headers' => bless( {
            'user-agent' => 'libwww-perl/6.05',
            ':Content-Length' => '84',
            ':Content-Range' => '0-83/84',
            'content-length' => 84,
            'authorization' => 'Bearer <access token>'
        }, 'HTTP::Headers' ),
        '_method' => 'PUT',
        '_uri_canonical' => $VAR1->{'_request'}{'_uri'}
    }, 'HTTP::Request' )
}, 'HTTP::Response' );

我应该尝试在 Perl 方面进行哪些更改以使 BigQuery 像它一样响应我bq

4

2 回答 2

1

您的一些 PUT 标头前面有冒号,而 Python 没有:

':Content-Length' => '84',
':Content-Range' => '0-83/84',
于 2013-08-07T12:33:44.020 回答
0

我怀疑分段上传请求中存在格式错误。错误“上载请求无效”是对尝试将数据有效负载从多部分 mime 消息中拆分出来的响应。您的日志记录不包含请求正文的详细信息,因此我们无法并排比较它们是否存在意外差异。

为确保问题出在分段上传,您可以尝试从 Google Storage 加载数据的加载请求,而不是将数据包含在请求负载本身中。这将验证 perl api 请求路径是否适合您。

仅供参考:有一个 alpha Perl Google APIs 客户端可以帮助您。我没有尝试过,不知道它是否正在积极开发中,但你可能会在那里找到一些有用的提示。查看https://code.google.com/p/google-api-perl-client/

于 2013-08-26T19:13:36.067 回答