Python中MD5和SHA2的碰撞

网友

1楼 · 编辑于 2024-09-26 21:51:19

如果几个不同的哈希算法都返回相同的哈希结果，或者您的实现中存在错误，那么您遇到问题的文件几乎肯定是相同的。在

作为一个健全的测试，编写您自己的“hash”，它只返回整个文件的内容，并查看这个是否生成相同的“hash”。在

网友

2楼 · 编辑于 2024-09-26 21:51:19

正如其他人所说，除非文件是相同的，否则单个哈希冲突不太可能，而多个哈希几乎不可能发生。我建议使用外部实用程序生成总和，作为一种健全性检查。例如，在Ubuntu（以及大多数/所有其他Linux发行版）中：

blair@blair-eeepc:~$ md5sum Bandwagon.mp3
b87cbc2c17cd46789cb3a3c51a350557  Bandwagon.mp3
blair@blair-eeepc:~$ sha256sum Bandwagon.mp3 
b909b027271b4c3a918ec19fc85602233a4c5f418e8456648c426403526e7bc0  Bandwagon.mp3

谷歌快速搜索显示，在Windows机器上也有类似的实用程序。如果看到与外部实用程序的冲突，则文件是相同的。如果没有碰撞，说明你做错了什么。我怀疑Python的实现是错误的，因为在Python中进行散列时得到的结果是相同的：

^{pr2}$

网友

3楼 · 编辑于 2024-09-26 21:51:19

我有一种感觉，你正在读一个比预期要小的数据块，而这两个文件恰好是相同的。我不知道为什么，但是试着用'rb'打开二进制文件。read（）应该读取到文件末尾，但windows的行为不同。从文件中

On Windows, 'b' appended to the mode opens the file in binary mode, so there are also modes like 'rb', 'wb', and 'r+b'. Python on Windows makes a distinction between text and binary files; the end-of-line characters in text files are automatically altered slightly when data is read or written. This behind-the-scenes modification to file data is fine for ASCII text files, but it’ll corrupt binary data like that in JPEG or EXE files. Be very careful to use binary mode when reading and writing such files. On Unix, it doesn’t hurt to append a 'b' to the mode, so you can use it platform-independently for all binary files.

相关问题更多 >

编程相关推荐

热门问题

热门文章