我正在尝试从 Google Cloud Natural Language API Python Samples 运行 Python 脚本
我没有做任何修改,所以我希望它会起作用。具体来说,我想对文本文件/文档进行实体分析。代码的相关部分如下。
def entities_file(gcs_uri):
"""Detects entities in the file located in Google Cloud Storage."""
client = language_v1beta2.LanguageServiceClient()
# Instantiates a plain text document.
document = types.Document(
gcs_content_uri=gcs_uri,
type=enums.Document.Type.PLAIN_TEXT)
# Detects sentiment in the document. You can also analyze HTML with:
# document.type == enums.Document.Type.HTML
entities = client.analyze_entities(document).entities
# entity types from enums.Entity.Type
entity_type = ('UNKNOWN', 'PERSON', 'LOCATION', 'ORGANIZATION',
'EVENT', 'WORK_OF_ART', 'CONSUMER_GOOD', 'OTHER')
for entity in entities:
print('=' * 20)
print(u'{:<16}: {}'.format('name', entity.name))
print(u'{:<16}: {}'.format('type', entity_type[entity.type]))
print(u'{:<16}: {}'.format('metadata', entity.metadata))
print(u'{:<16}: {}'.format('salience', entity.salience))
print(u'{:<16}: {}'.format('wikipedia_url',
entity.metadata.get('wikipedia_url', '-')))
我已将我的文本文件(utf-8 编码)放在 gs://neotokyo-cloud-bucket/TXT/TTS-01.txt 的云存储中
我正在谷歌云外壳中运行脚本。当我运行文件时:
python snippets.py entities-file gs://neotokyo-cloud-bucket/TXT/TTS-01.txt
我收到以下错误,这似乎与 protobuf 相关。
[libprotobuf ERROR google/protobuf/wire_format_lite.cc:629].
String field 'google.cloud.language.v1beta2.TextSpan.content'
contains invalid UTF-8 data when parsing a protocol buffer.
Use the 'bytes' type if you intend to send raw bytes.
ERROR:root:Exception deserializing message!
Traceback (most recent call last):
File "/usr/local/lib/python2.7/dist-packages/grpc/_common.py", line 87, in _transform
return transformer(message)
DecodeError: Error parsing message
Traceback (most recent call last):
File "snippets.py", line 336, in <module>
entities_file(args.gcs_uri)
File "snippets.py", line 114, in entities_file
entities = client.analyze_entities(document).entities
File "/usr/local/lib/python2.7/dist- packages/google/cloud/language_v1beta2/gapic/language_service_client.py", line 226, in analyze_entities
return self._analyze_entities(request, retry=retry, timeout=timeout)
File "/usr/local/lib/python2.7/dist-packages/google/api_core/gapic_v1/method.py", line 139, in __call__
return wrapped_func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/google/api_core/retry.py", line 260, in retry_wrapped_func
on_error=on_error,
File "/usr/local/lib/python2.7/dist-packages/google/api_core/retry.py", line 177, in retry_target
return target()
File "/usr/local/lib/python2.7/dist-packages/google/api_core/timeout.py", line 206, in func_with_timeout
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/google/api_core/grpc_helpers.py", line 56, in error_remapped_callable
six.raise_from(exceptions.from_grpc_error(exc), exc)
File "/usr/local/lib/python2.7/dist-packages/six.py", line 737, in raise_from
raise value
google.api_core.exceptions.InternalServerError: 500 Exception deserializing response!
我不知道protobuf,任何帮助表示赞赏!