java - 为什么散列信息字典结果错误？

Question

多年来，我一直在尝试让 BitTorrent 在 Java 中使用这种散列方法，但它总是出错。

我已将其缩小到几行代码，我 99% 确定问题是：

Bencode bencode = new Bencode(Charset.forName("UTF-8"));
byte[] fileBytes = new byte[33237];
Map<String, Object> dict = bencode.decode(fileBytes, Type.DICTIONARY);
Map infoMap = (Map) object.get("info");
ByteArrayOutputStream baos = new ByteArrayOutputStream();
BencodeOutputStream bos = new BencodeOutputStream(baos);
bos.writeDictionary(infoMap);
byte[] hash = DigestUtils.sha1(baos.toByteArray());

我对数组的大小进行了硬编码，以确保问题不是由一堆零引起的。

我已经尝试过UTF-8和US-ASCII。

我已经尝试使用两个不同的库进行编码，所以它可能不存在问题所在。

编辑：从规范看来，信息字典应该被 urlencoded 为 info_hash。所以我尝试将字典写成 a ByteArrayOutputStream，然后对所持有的进行 sha1byte[]散列ByteArrayOutPutStream。

该DigestUtils.sha1方法会提供 URL 编码器吗？找不到这方面的任何信息。

score 1 · Accepted Answer

正如 Encombe 所指出的，问题在于编码。在 Bencode 规范中，它谈到了字节字符串，这似乎表明它只是一个没有任何编码的数据流。

我查看的两个库都将所有字节字符串转换为某种编码，因此我编写了一个 Bencode 库，该库仅在特别要求时才进行转换。

上面的代码基本上是正确的，但这是我现在使用的客户端代码：

public void readManifest() throws IOException, Exception {
    byte[] fileBytes = FileUtils.readFileToByteArray(file);
    ByteArrayInputStream bis = new ByteArrayInputStream(fileBytes);
    BDecoder decoder = new BDecoder(bis, "UTF-8");
    BDict dict = decoder.decodeDict();
    Map<String, Object> valueMap = dict.getValue();
    infoMap = (Map<String, Object>) valueMap.get("info");
}

public String hash() throws Exception {
    if (hash == null) {
        ByteArrayOutputStream baos = new ByteArrayOutputStream();
        BEncoder encoder = new BEncoder(baos, "UTF-8");
        encoder.encodeDict(infoMap);
        hash = DigestUtils.sha1Hex(baos.toByteArray());
    }
    return hash;
}

java - 为什么散列信息字典结果错误？

1 回答 1

Related

Reference