encoding - 将 ms word 文档的特殊字符转换为 html

翻译自：https://stackoverflow.com/questions/10811758 2012-05-30T07:25:14.363

886 次

我有使用 Apache POI Api 将 doc 转换为 html 的 word 文档和以下代码。

   serializer.setOutputProperty(OutputKeys.ENCODING, "UTF-8");

   serializer.setOutputProperty(OutputKeys.METHOD, "html");

   serializer.transform(domSource, streamResult);

但是编号 ie 1)、a)、i) 等和项目符号字符没有被正确解析，我得到像 1 这样的垃圾字符？当我在编辑器中打开 html 文件时，我得到带有不需要的框的数字。我已经尝试了很多，但我没有得到适当的解决方案。

请帮我解决这个编码问题。

谢谢

encoding - 将 ms word 文档的特殊字符转换为 html

0 回答 0

Related

Reference