3

我正在编写一个简单的工具,可以让我快速检查下载的 ISO 文件的 MD5 哈希值。这是我的算法:

import sys
import hashlib

def main():
    filename = sys.argv[1] # Takes the ISO 'file' as an argument in the command line
    testFile = open(filename, "r") # Opens and reads the ISO 'file'

    # Use hashlib here to find MD5 hash of the ISO 'file'. This is where I'm having problems
    hashedMd5 = hashlib.md5(testFile).hexdigest()

    realMd5 = input("Enter the valid MD5 hash: ") # Promt the user for the valid MD5 hash

    if (realMd5 == hashedMd5): # Check if valid
        print("GOOD!")
    else:
        print("BAD!!")

main()

当我尝试获取文件的 MD5 哈希时,我的问题在第 9 行。我收到类型错误:支持所需缓冲区 API 的对象。任何人都可以阐明如何使此功能起作用吗?

4

2 回答 2

8

创建的对象hashlib.md5不接受文件对象。您需要一次向其提供一段数据,然后请求哈希摘要。

import hashlib

testFile = open(filename, "rb")
hash = hashlib.md5()

while True:
    piece = testFile.read(1024)

    if piece:
        hash.update(piece)
    else: # we're at end of file
        hex_hash = hash.hexdigest()
        break

print hex_hash # will produce what you're looking for
于 2011-07-18T01:30:15.527 回答
3

您需要阅读该文件:

import sys
import hashlib

def main():
    filename = sys.argv[1] # Takes the ISO 'file' as an argument in the command line
    testFile = open(filename, "rb") # Opens and reads the ISO 'file'

    # Use hashlib here to find MD5 hash of the ISO 'file'. This is where I'm having problems
    m = hashlib.md5()
    while True:
        data = testFile.read(4*1024*1024)
        if not data: break
        m.update(data)
    hashedMd5 = m.hexdigest()
    realMd5 = input("Enter the valid MD5 hash: ") # Promt the user for the valid MD5 hash

    if (realMd5 == hashedMd5): # Check if valid
        print("GOOD!")
    else:
        print("BAD!!")

main()

而且您可能需要以二进制(“rb”)打开文件并以块的形式读取数据块。ISO 文件可能太大而无法放入内存。

于 2011-07-18T01:29:48.907 回答