3

我一直在寻找解决方案,但似乎没有太多关于这个话题的内容。我找到了建议的解决方案:

String unicodeString = new String("utf8 here");
byte[] bytes = String.getBytes("UTF8"); 
String converted = new String(bytes,"UTF16");

但是,对于从 utf8 转换为 utf16,java 不处理“UTF32”,这使得该解决方案不可行。有谁知道如何实现这一目标的任何其他方式?

4

3 回答 3

3

Java 确实处理 UTF-32,试试这个测试

    byte[] a = "1".getBytes("UTF-32");
    System.out.println(a.length);

它将显示数组的 lentgh = 4

于 2016-04-04T03:25:30.403 回答
3

搜索后我得到了这个工作:

    public static String convert16to32(String toConvert){
        for (int i = 0; i < toConvert.length(); ) {
            int codePoint = Character.codePointAt(toConvert, i);
            i += Character.charCount(codePoint);
            //System.out.printf("%x%n", codePoint);
            String utf32 = String.format("0x%x%n", codePoint);
            return utf32;
        }
        return null;
    }
于 2016-04-04T04:36:20.433 回答
1
public static char[] bytesToHex(byte[] raw) {
    int length = raw.length;
    char[] hex = new char[length * 2];
    for (int i = 0; i < length; i++) {
        int value = (raw[i] + 256) % 256;
        int highIndex = value >> 4;
        int lowIndex = value & 0x0f;
        hex[i * 2 + 0] = kDigits[highIndex];
        hex[i * 2 + 1] = kDigits[lowIndex];
    }
    return hex;
}



byte[] bytearr = converted.getBytes("UTF-32");
System.out.println("With UTF-32 encoding:\t" + String.valueOf(bytesToHex(bytearr)));
System.out.println("With UTF-32 decoding:\t" + new String((bytearr), "UTF-32"));
于 2016-04-04T02:55:38.590 回答