8

Is there a module for handling TMX(Translation Memory eXchange) files in python, if not, what would be another way to do it?

As it stands, I have a giant 2gb file with French-English subtitles. Would it be possible to even handle such a file or would I have to break it down?

4

2 回答 2

5

正如@hurrial 所说,您可以使用translate-toolkit

安装

此工具包仅可使用 pip。要安装它,请运行:

pip install translate-toolkit

用法

假设您有以下简单sample.tmx文件:

<tmx version="1.4">
  <header
    creationtool="XYZTool" creationtoolversion="1.01-023"
    datatype="PlainText" segtype="sentence"
    adminlang="en-us" srclang="en"
    o-tmf="ABCTransMem"/>
  <body>
    <tu>
      <tuv xml:lang="en">
        <seg>Hello world!</seg>
      </tuv>
      <tuv xml:lang="ar">
        <seg>اهلا بالعالم!</seg>
      </tuv>
    </tu>
  </body>
</tmx>

您可以像这样解析这个简单的文件:

>>> from translate.storage.tmx import tmxfile
>>>
>>> with open("sample.tmx", 'rb') as fin:
...     tmx_file = tmxfile(fin, 'en', 'ar')
>>>
>>> for node in tmx_file.unit_iter():
...     print(node.source, node.target)
Hello world! اهلا بالعالم!

有关更多信息,请从此处查看官方文档。

于 2019-07-09T10:19:52.353 回答
2

您可以查看以下链接:

干杯,

于 2014-09-11T09:05:11.213 回答