[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[EP-tech] archive statistics



CAUTION: This e-mail originated outside the University of Southampton.

wow, great work!

I'm still on 3.3 and I just need this info for a migration, to get some
data to put in a report.

Il 22/06/21 10:49, David R Newman via Eprints-tech ha scritto:
> Hi Yuri,
>
> I was just saying to my colleague Justin, I wonder if you were a plant
> (to ask this question).  He has just been working on a tool for this
> very purpose, so that we can analyse repositories that are running out
> of disk space to see if they genuinely need more space or if things
> could be tidied up to free up sufficient space.  The tool is a bit rough
> around the edges but he is happy to make it available as a 3.4
> ingredient in an EPrints GitHub repository, when he has had a chance to
> tidy it up (over the next few days).   If you are still on 3.3 it may be
> possible to map the various files into directories in your archive and
> enable as a plugin but that is not something either Justin or I have
> tried to do.
>
> The tool uses a cronjob to produce monthly disk reports rather than a
> live status.  If there is a wider interest in the tool we could look to
> making the tool customisable to allow a greater reporting frequency.
> Unfortunately, it is not just a case of running the cron job more
> frequently, although the adaptation required to the tool as a whole
> should be fairly minor.
>
> I have deployed this rough version on tryme.demo.eprints-hosting.org if
> you want to take a look.  The disk reports are only available under the
> Admin menu, so you would need to give me the username you used for the
> account you create on tryme, so I can up this account to a repository
> admin one.
>
> Regards
>
> David Newman
>
> On 22/06/2021 07:52, Yuri via Eprints-tech wrote:
>> CAUTION: This e-mail originated outside the University of Southampton.
>>
>> Hi!
>>
>>     what is the best way to get archive statistics, like how many records
>> in the archive, or to have some size hint on them (for example to find
>> how many objects uses at least 10MB of space, 20MB and so on), how much
>> total space the archive use (for example only record on archive status),
>> maybe grouping them by type (for example thesys uses 10GB, articles uses
>> 40GB and so on)?
>>
>> I've done rough statistics using du and some unix tools but I would like
>> to refine them better.
>>
>>
>> *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>> *** Archive: https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.eprints.org%2Ftech.php%2F&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C09971ce8cd9f44c37e0308d9355cd046%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637599495011748123%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=XNp4tor%2F5K5%2F26Fh3TrPkuJC4SbvHOU18Ae%2BXzrnZMA%3D&reserved=0
>> *** EPrints community wiki: https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwiki.eprints.org%2F&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C09971ce8cd9f44c37e0308d9355cd046%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637599495011748123%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=BLs492fu5Hah%2BZl6pUfbb03bC7nYCgFTXCk34ZHJgpg%3D&reserved=0