java - LZW 压缩 - 修改版

Question

我有一个 LZW 算法 -

private void start(int maxNumBits) throws IOException{
    System.out.println("Beginning");
    /** Compress a string to a list of output symbols. */
    // Build the dictionary.
    for (int i = 0; i < 256; i++)
        dict.put("" + (char)i, i);
    int i;
    String w = "";
    int bitsRead = 0;
    int bitsOutput = 0;
    int trieLength = 0;
    float lastCr = 0f;
    while((i = reader.read()) != EOF){
        bitsRead += 8;
        float currentCr = (float)bitsRead / (float)bitsOutput;
        if(bytesRead % 1024 == 0)
            System.out.println(currentCr);
        String wi = w + (char)i;
        if (dict.containsKey(wi) && ((currentCr >= lastCr) || (trieLength < maxNumBits))){
            w = wi;
            trieLength += 8;
        }
        else {
            fos.write(dict.get(w));
            bitsOutput += 8;
            // Add wi to the dictionary.
            dict.put(wi, mapSize++);
            w = "" + (char)i;
            trieLength = 0;
        }
        lastCr = currentCr;
    }
    // Output the code for w.
    if (!w.equals("")){
        fos.write(dict.get(w));
        bitsOutput += 8;
    }
}

wheremaxNumBits应该是 trie 的最大大小。maxNumBits假设异常在传递参数的主类中被捕获。假设dict是 a HashMap，reader是 aFileInputStream并且fos是 a FileOutputStream。

在我的版本中，如果 trie 变满（即trieLength > maxNumBits），则继续压缩，直到当前压缩率（currentCr）小于上一个压缩率（lastCr）。

我已经在一个 ~8mb 文件上运行了它，并且更改 trie 长度对累积压缩率没有任何影响。这是代码吗

if(dict.containsKey(wi) && ((currentCr >= lastCr)||(trieLength < maxNumBits)))

是否符合所描述的要求？

谢谢你的帮助，

山姆

编辑 - 感谢您在格式化方面的帮助，爱德华

score 0 · Accepted Answer

事实证明，在检查下一次迭代之前没有检查 trieLength，这意味着当它变满时没有生成新的 trie。

java - LZW 压缩 - 修改版

1 回答 1

Related

Reference