[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[EP-tech] apostrophe in file names of uploaded/deposited file
CAUTION: This e-mail originated outside the University of Southampton.
Good afternoon!
I'm trying to troubleshoot an issue with exporting out a deposited file that has an apostrophe in the filename.
This is the issue: https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Feprintsug%2FEPrintsArchivematica%2Fissues%2F40&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb4fa08abbe634af73bb308d9f4b020fc%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637809860158051531%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=rq2eXu%2F14NNKFf%2B4w5%2BO5PGXfBIow3uiY%2BUHn8HfxpQ%3D&reserved=0
Does EPrints replace apostrophes in filenames on disk with =0027?
Where in the code does that happen?
The URL of the file has the apostrophe, for example:
https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fspectrum.library.concordia.ca%2Fid%2Feprint%2F7066%2F1%2FServices_techniques_a_l%27Universite_Concordia.pdf&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb4fa08abbe634af73bb308d9f4b020fc%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637809860158051531%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=98qmS5RaI8FoLxaS65ZaiSd53r1TpeS4FgA%2F2%2BZcIqw%3D&reserved=0
But unlike other Unicode characters, the apostrophe doesn't make it into the file name on disk, and is substituted with =0027.
I'm looking for confirmation that this is how it is "supposed" to work, and for an understanding where this happens in the code, so that I might ultimately know how many OTHER characters are replaced in this way in the filename?
Tomasz
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20220220/fbcfe13d/attachment.html