EPrints Technical Mailing List Archive

See the EPrints wiki for instructions on how to join this mailing list and related information.

Message: #09572


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

[EP-tech] Indexing - cleanup indexed terms after mass deletions


CAUTION: This e-mail originated outside the University of Southampton.

 

Hi All,

 

Our original repo, houses traditional outputs (Articles, conference papers etc.) as well as Theses…

We have split the Theses into a dedicated repo, cloning the original system (metadata and files), and then removed the non-theses (search->batch edit->remove all records).

 

I have noticed that there are entries in the various database index tables, referring to eprints that are no longer in the system…

I have run epadmin reindex over ‘<repo> eprint’ and ‘<repo> document’, but the indexed values persist…

 

e.g. eprint__index contains a fieldword = ‘title:elephant’ with ids = ‘:12345:’  but there is no eprint 12345 in the system any longer.

 

I thought the permanent removal of the non-theses items would have cleaned up the index tables as process occurred?

 

Any thoughts appreciated.

 

Cheers,

Matt

 

 

__________________________________________________________________
This email (including any attached files) is confidential and is 
for the intended recipient(s) only. If you received this email by 
mistake, please, as a courtesy, tell the sender, then delete this 
email.
The views and opinions are the originator's and do not necessarily 
reflect those of the University of Southern Queensland. Although 
all reasonable precautions were taken to ensure that this email 
contained no viruses at the time it was sent we accept no 
liability for any losses arising from its receipt.
The University of Southern Queensland is a registered provider 
of education with the Australian Government.
(CRICOS Institution Code QLD 00244B / NSW 02225M, TEQSA PRV12081)