Tech List

[index] [prev] [next] [options] [help]
See the Mailing Lists Page for how to subscribe and unsubscribe.

eprints_tech messages

Please note: this page shows emails that have been sent to the eprints_tech mailing list. Some of these may be spam emails we have failed to filter.

Re: [EP-tech] storage capacity

From: "Richard M. Davis" <r.davis AT ulcc.ac.uk>
Date: Tue, 24 Jun 2008 14:12:06 +0100


Threading: [EP-tech] storage capacity from a.m.sutton AT reading.ac.uk
      • This Message

This is a multi-part message in MIME format.
Alison Sutton wrote:
> Please can anyone advise how best to calculate this? I've been asked for
> typical storage for different categories of full text publications - 
journal
> articles, conference proceedings, books, book chapters and PhD theses.

Hi Alison

This question is a close relative of that one about the string ;)

I think it really is just guesswork, as there are so many variables.

Thinking only of single PDF type submissions: a short document might 
still be a large file if it is full of images, formulas or heavy 
formatting; a long document might be a small file if it isn't. The size 
range will probably be from 100KB to 10MB.

IMO you might as well just estimate 1 file = 1MB, and therefore that 
100GB disk space will hold 100,000 files. What you actually get will be 
somewhere between one-tenth and ten-times that: a negligible degree of 
imprecision!

But somewhere with a well-established repository, like Soton ECS, might 
be able to suggest a sounder formula based on a statistical analysis of 
their holdings, perhaps broken down by type (articles, theses, etc.): if 
so, I'd love to see it too.

Hope this helps

Richard







ATTACHMENT: r_davis.vcf


[index] [prev] [next] [options] [help]