我想使用 HuggingFace 使用预训练"xlm-mlm-xnli15-1024"
模型的转换器将中文翻译成英文。本教程展示了如何从英语到德语。
我尝试按照教程进行操作,但没有详细说明如何手动更改语言或解码结果。我不知道从哪里开始。抱歉,这个问题再具体不过了。
这是我尝试过的:
from transformers import AutoModelWithLMHead, AutoTokenizer
base_model = "xlm-mlm-xnli15-1024"
model = AutoModelWithLMHead.from_pretrained(base_model)
tokenizer = AutoTokenizer.from_pretrained(base_model)
inputs = tokenizer.encode("translate English to Chinese: Hugging Face is a technology company based in New York and Paris", return_tensors="pt")
outputs = model.generate(inputs, max_length=40, num_beams=4, early_stopping=True)
print(tokenizer.decode(outputs.tolist()[0]))
'<s>translate english to chinese : hugging face is a technology company based in new york and paris </s>china hug ™ ™ ™ ™ ™ ™ ™ ™ ™ ™ ™ ™ ™ ™ ™ ™ ™'