[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[EP-tech] Re: Traceability of record changes

Hi Martin

- No, there?s no existing script to do this.
- It?s been designed this was as a path of least resistance to meet some preservation issues with not storing the state of the object on change.  An XML dump to the filesystem each time the object is written to the database ticks a lot of boxes and was almost free because there was already an XML exporter.

Creating a ?by field? view of the history instead of a ?by commit? view is a simple matter of parsing the XML and rerendering it.  No rocket science, but XML parsing is computationally expensive, so performance may be slow for items with long and chequered histories.

I?m happy to advise if anyone wants to do the leg-work on this, as long as it?s released as a bazaar package at the end.  It might lead to a training video or two, too.


Adam Field
SHERPA services analyst developer

From: <eprints-tech-bounces at ecs.soton.ac.uk<mailto:eprints-tech-bounces at ecs.soton.ac.uk>> on behalf of "martin.braendle at id.uzh.ch<mailto:martin.braendle at id.uzh.ch>" <martin.braendle at id.uzh.ch<mailto:martin.braendle at id.uzh.ch>>
Reply-To: "eprints-tech at ecs.soton.ac.uk<mailto:eprints-tech at ecs.soton.ac.uk>" <eprints-tech at ecs.soton.ac.uk<mailto:eprints-tech at ecs.soton.ac.uk>>
Date: Monday, 11 January 2016 11:26
To: EP tech <eprints-tech at ecs.soton.ac.uk<mailto:eprints-tech at ecs.soton.ac.uk>>
Subject: [EP-tech] Traceability of record changes


and a happy and successful 2016!

If one wants to find all eprints for which a given field X has changed its value from A to B, this seems to be quite cumbersome, because the history table only stores the name of the fields that have been modified, but not their content, which is saved in the revisions XML files in the documents directory.

- Is there already a method or script that enables one to do so?
- Why has this been designed this way, and not the way that all changes (which field and which content) are tracked in the SQL database? Whether disk space is used up by the database or in the filesystem should not matter. Having the changes only in the database would facilitate searching a lot; in addition, rollbacks to a given point would be possible.

Best regards,


Dr. Martin Br?ndle
Zentrale Informatik
Universit?t Z?rich
Stampfenbachstr. 73
CH-8006 Z?rich

mail: martin.braendle at id.uzh.ch<mailto:martin.braendle at id.uzh.ch>
phone: +41 44 63 56705
fax: +41 44 63 54505

Jisc is a registered charity (number 1149740) and a company limited by guarantee which is registered in England under Company No. 5747339, VAT No. GB 197 0632 86. Jisc?s registered office is: One Castlepark, Tower Hill, Bristol, BS2 0JA. T 0203 697 5800.

Jisc Services Limited is a wholly owned Jisc subsidiary and a company limited by guarantee which is registered in England under company number 2881024, VAT number GB 197 0632 86. The registered office is: One Castle Park, Tower Hill, Bristol BS2 0JA. T 0203 697 5800.  
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20160111/e3c7adae/attachment.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 6B9928AE-9C97-4E75-8330-7E24168F02D7[1].png
Type: image/png
Size: 1264 bytes
Desc: 6B9928AE-9C97-4E75-8330-7E24168F02D7[1].png
Url : http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20160111/e3c7adae/attachment.png