EPrints Technical Mailing List Archive

Message: #06716


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

[EP-tech] Making a static copy of an EPrints repo


I need to make a read-only, static, copy of an old repo (the hardware is dying, the installation was heavily tailored for the environment, and I don't have the time to re-create in a new environment.)

I can grab all the active pages:

wget --local-encoding=UTF-8 --remote-encoding=UTF-8 --no-cache --mirror -nc -k http://my.repo/

This is good, however it doesn't edit all the absolute URLs in the view pages, so we need to modify them:

  find my.repo -type f -exec sed -i 's_http://my.repo/_/_g' {} +

However this leaves me with the problem that the http://my.repo/nnn/ pages haven't been pulled down!

Any suggestions on how to do this?

Cheers

--
Ian Stuart.
Bibliographics and Multimedia Service Delivery team,
EDINA,
The University of Edinburgh.

http://edina.ac.uk/

The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.