[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[EP-tech] format migrations/conversions in EPrints



I am working on a digital preservation policy document, and I wanted to summarize the format migrations that take place by default in EPrints.

I am aware of these migrations, derivative files are generated as access copies for audio and video documents:


*         Audio documents ->

o    Audio.ogg (Format: Vorbis | Bitrate: 96K | Sampling Rate:44.1 kH | Container: OGG)

o   Audio.mp4 (Format: AAC (Advanced Audio Codec) | Bitrate: 96K | Sampling Rate: 44.1 kH | Container: MPEG-4)

*         Video documents ->

o   Video.ogg  (Video Format: Theora | Audio Format: Vorbis| Video Bitrate: 500 KB/s | Video Size: 640*360 (16:9) | Video Frame Rate: 10 FPS | Audio Sampling Rate: 44.1 kHz | Audio Bitrate: 500 kb/s | Container: OGG )

o   Video.mp4 (Video Format:  AVC (Advanced Video Codec ) | Audio Format: AAC (Advanced Audio Codec)  | Video Bitrate: 500 KB/s | Video Size: 640*360 (16:9) | Video Frame Rate: 10 FPS | Audio Sampling Rate: 44.1 kHz | Audio Bitrate: 500 kb/s | Container: MPEG-4 )
(see: https://github.com/eprints/eprints/blob/392474eec1b8125a66ed2d3e12b02aeb67dc07c4/perl_lib/EPrints/Plugin/Convert/Thumbnails.pm  )

I am also aware of this:

*         DOC files -> indexcodes.txt (see: https://github.com/eprints/eprints/blob/392474eec1b8125a66ed2d3e12b02aeb67dc07c4/perl_lib/EPrints/Plugin/Convert/IndexCodes.pm)
It extracts indexing terms from DOC files and places these in a TEXT file.  So this isn't a migration, but a derivative text file which gets stored. It is similar to the thumbnail "preview.jpg" image for Video and PDF documents:

*         Video or PDF file -> preview.jpg

Are there more migrations/format conversions/normalizations that take place in EPrints by default?
For example, does EPrints ever convert .DOC files to PDF?, using this: https://github.com/eprints/eprints/blob/392474eec1b8125a66ed2d3e12b02aeb67dc07c4/perl_lib/EPrints/Plugin/Convert/DocPDF.pm ?

Best wishes,
Tomasz



-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20170815/6e36c16a/attachment.html