[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[EP-tech] hashes in EPrints



Happy New Year!

I am reaching out to the list about this issue of MD5 vs SHA256 hashes in EPrints.
https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Feprintsug%2FEPrintsArchivematica%2Fissues%2F6&data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb028a475d4ef4ef6e76508d6774a9d22%7C4a5378f929f44d3ebe89669d03ada9d8%7C1&sdata=mJTFt8wQpCNj1VycHlrqiF44fmqkwVfL3eJjsPCq%2BKA%3D&reserved=0

Based on digging into our database:

?         EPrints generates file.hash values of file.hash_type MD5 for the following derivative files:

o   indexcodes.txt

o   lightbox.jpg

o   preview.jpg

o   medium.jpg

o   small.jpg

?         EPrints generated file.hash value of hash_type MD5 for most (but not all) of the uploaded files (such as PDF files).  Some of the older PDF files do not have a file.hash value stored in our database.

?         EPrints did not generate any file.hash values for the dataobj.xml files for the history objects.

The EPrints source code includes a function to generate MD5 and SHA256, but it looks like only the MD5 is ever called by default.

Are these findings consistent with what you have in your EPrints instance?
Since we have MD5 by default in EPrints, do you agree that MD5 will be sufficient for the export to Archivematica?
Does anyone know why some of our uploaded files would have no file.hash?  Is that something that could have been caused by a bug EPrints that prevented hashes to be generated, but that was resolved at some point?

Tomasz



________________________________________________
Tomasz Neugebauer
Digital Projects & Systems Development Librarian / Biblioth?caire des Projets Num?riques & D?veloppement de Syst?mes
Library / Biblioth?que
Concordia University / Universit? Concordia
Tel. / T?l. 514-848-2424 ext. / poste 7738
Email / courriel: tomasz.neugebauer at concordia.ca<mailto:tomasz.neugebauer at concordia.ca>
https://emea01.safelinks.protection.outlook.com/?url=www.concordia.ca%2Ffaculty%2Ftomasz-neugebauer.html&amp;data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb028a475d4ef4ef6e76508d6774a9d22%7C4a5378f929f44d3ebe89669d03ada9d8%7C1&amp;sdata=Pkxc1L9%2BY0GNzZ4KDJHGDNWqvmkXVB8LiYwKer%2Fz43g%3D&amp;reserved=0<https://emea01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.concordia.ca%2Ffaculty%2Ftomasz-neugebauer.html&amp;data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb028a475d4ef4ef6e76508d6774a9d22%7C4a5378f929f44d3ebe89669d03ada9d8%7C1&amp;sdata=I%2BzGJFMN3Eb704t2Cp76G2OPeChJNzdpmqaiOJ8iYVI%3D&amp;reserved=0>
Mailing address / adresse postale: 1455 De Maisonneuve Blvd. W., LB-540-03, Montreal, Quebec H3G 1M8
Street address / adresse municipale: 1400 De Maisonneuve Blvd. W., LB-540-03, Montreal, Quebec H3G 1M8
library.concordia.ca

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20190110/3ae87608/attachment-0001.html