[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[EP-tech] Re: OAI harvesting / records moving from live to buffer (or other non-deletion datasets).

If anyone's interested:
http://bazaar.eprints.org/411/ and/or https://github.com/eprintsug/oai-pmh
I think it's my first (public) Bazaar package - let me know if there are any issues with it (it is fairly simple though)!


-----Original Message-----
From: eprints-tech-bounces at ecs.soton.ac.uk [mailto:eprints-tech-bounces at ecs.soton.ac.uk] On Behalf Of John Salter
Sent: 15 June 2015 14:13
To: eprints-tech at ecs.soton.ac.uk
Subject: [EP-tech] OAI harvesting / records moving from live to buffer (or other non-deletion datasets).

I'm trying to work out an elegant solution to this issue:
EPrint is made live
EPrint is harvested over OAI-PMH
EPrint is moved to non-live dataset (e.g. buffer - or in this specific case dark-archive)
EPrint is no longer available publicly, but an OAI-PMH harvest will not see the record as deleted - and will not therefore remove it.

I've checked with OAI-PMH gurus, and they think that just flagging the record as deleted will be OK - if the record subsequently reappears, it should get re-harvested OK.

I think that the solution for this is to add a filter to the OAI-PMH searches that looks for EPrints with a datestamp (when the item was first made live), but that aren't in the 'archive' dataset.
To achieve this methods in EPrints::OpenArchives (that currently check for 'deletion' status) will need to be tweaked, and filters for 'has datestamp' added to cgi/oai2 OR $c->{oai}->{filters}.

Has anyone else come across this issue and found an elegant solution - or can see any issues with this proposal?


*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/