md5 and large files

N

Nelson Minar

Tobias Pfeiffer said:
I have no clue about what the md5 algorithm works like, but I'd
think one could prove that with an number large enough, every hash
occurs twice. At last, md5 is not random.

MD5 is random, and a very strong form of random at that. But yes,
the same 128 bit hash will occur for different inputs. The trick is
that it's nearly impossible to construct an input that produces a
given hash, or even to produce collisions. For more, see here:
http://en.wikipedia.org/wiki/Md5

PS: I was wrong when I said that a native md5sum would be
significantly faster than Python version. It is a bit faster, but not
much. Both the native program and the Python program are going to
spend all their CPU time in the native-code MD5 calculation function.
 
K

Kirk Job-Sluder

Nelson said:
MD5 is random, and a very strong form of random at that. But yes,
the same 128 bit hash will occur for different inputs. The trick is
that it's nearly impossible to construct an input that produces a
given hash, or even to produce collisions. For more, see here:
http://en.wikipedia.org/wiki/Md5


Actually, it turns out to be much easier than it should be to create
collisions. It is possible that this analytical attack might lead to
broader attacks in the future. This is one big reason to transition
from MD5 to SHA1.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
474,209
Messages
2,571,088
Members
47,686
Latest member
scamivo

Latest Threads

Top