[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[EP-tech] Re: Indexing issue



It will only remove the entries from the events queue, not the items from the repository.  It's the same thing that was being discussed with Edinburgh in relation to their DB slowdown issue at the end of February.
Removing entries  may cause some of your items to not be properly indexed for search, and depending on your plugins there are other things that make use of the queue as a job scheduler - e.g. the (P)IRUS plugin uses the event queue to reschedule sending download notifications to the IRUS servers if there was a problem logging them at the original time.

-----Original Message-----
From: Andrew Beeken [mailto:anbeeken at lincoln.ac.uk] 
Sent: 13 March 2015 09:37
To: eprints-tech at ecs.soton.ac.uk
Subject: [EP-tech] Re: Indexing issue

Okay; so there's a bunch of stuff but it's all from October 2013 and October 2014 - nothing that looks like the record that seems to be giving me the problem. Now, if I delete things off this list I presume it won't delete the actual record?

-----Original Message-----
From: eprints-tech-bounces at ecs.soton.ac.uk [mailto:eprints-tech-bounces at ecs.soton.ac.uk] On Behalf Of Alan.Stiles
Sent: 12 March 2015 16:22
To: eprints-tech at ecs.soton.ac.uk
Subject: [EP-tech] Re: Indexing issue

Admin page, System Tools Tab, Status button, Background Task Queue should list what the indexer is working on.
There should be an event for plugin 'Event::Indexer', action 'Index', with Parameters of '/id/eprint/xxxxxx' where xxxxx is the id of the offending eprint.  You should be able to delete or change the status on that item to a different status (e.g. Failed) to stop it being processed.

-----Original Message-----
From: Andrew Beeken [mailto:anbeeken at lincoln.ac.uk]
Sent: 12 March 2015 14:40
To: eprints-tech at ecs.soton.ac.uk
Subject: [EP-tech] Re: Indexing issue

Nope! :) How would I do that?

On 12/03/2015 14:33, "Alan.Stiles" <alan.stiles at open.ac.uk> wrote:

>Hi Andrew, have you removed the index event from the event queue?
>
>-----Original Message-----
>From: Andrew Beeken [mailto:anbeeken at lincoln.ac.uk]
>Sent: 12 March 2015 14:12
>To: eprints-tech at ecs.soton.ac.uk
>Subject: [EP-tech] Re: Indexing issue
>
>Further to this, we?ve now removed the record in question, however 
>EPrints still seems to be trying to index it - I don?t know if this is 
>a cache of some kind.
>
>On 11/03/2015 16:11, "Andrew Beeken" <anbeeken at lincoln.ac.uk> wrote:
>
>>Hello!
>>
>>We?ve got a bit of an issue with our search indexing which, after some 
>>digging, seems to be related to a specific record. Dumping a log of 
>>the ?verbose index to a file, the offending entry seems to be throwing 
>>the following error:
>>
>>Error: PDF file is damaged - attempting to reconstruct xref table...
>>
>>Error opening /tmp/ep-convertc4Azo/index.txt: No such file or 
>>directory at 
>>/usr/share/eprints3/perl_lib/EPrints/Plugin/Convert/IndexCodes.pm
>>line 57.
>>
>>The issue seems to be that once the indexer hits this record it bails 
>>out and doesn?t index records after this, meaning that some users have 
>>items which do not show up when a search is conducted based on their 
>>user number ? as this is used in conjunction with the JSON output on a 
>>search to populate our local staff directory with academics research 
>>outputs, it?s causing us some issues internally and also concerning us 
>>that the EPrints search index is incomplete.
>>
>>We?ve removed the PDF that appears to be causing the problem but it is 
>>potentially cached as it still comes up with the error; we?ve tried 
>>putting the record as ?retired? and will look tomorrow to see if that 
>>does anything, however if anyone has encountered this before or has a 
>>potentially better suggestion, let me know!
>>
>>Cheers!
>>Andrew
>>
>>The University of Lincoln, located in the heart of the city of 
>>Lincoln, has established an international reputation based on high 
>>student satisfaction, excellent graduate employment and world-class research.
>>
>>The information in this e-mail and any attachments may be confidential.
>>If you have received this email in error please notify the sender 
>>immediately and remove it from your system. Do not disclose the 
>>contents to another person or take copies.
>>
>>Email is not secure and may contain viruses. The University of Lincoln 
>>makes every effort to ensure email is sent without viruses, but cannot 
>>guarantee this and recommends recipients take appropriate precautions.
>>
>>The University may monitor email traffic data and content in 
>>accordance with its policies and English law. Further information can be found at:
>>http://www.lincoln.ac.uk/legal.
>>
>>*** Options:
>>http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>>*** Archive: http://www.eprints.org/tech.php/
>>*** EPrints community wiki: http://wiki.eprints.org/
>>*** EPrints developers Forum: http://forum.eprints.org/
>
>
>*** Options:
>http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>*** Archive: http://www.eprints.org/tech.php/
>*** EPrints community wiki: http://wiki.eprints.org/
>*** EPrints developers Forum: http://forum.eprints.org/
>-- The Open University is incorporated by Royal Charter (RC 000391), an 
>exempt charity in England & Wales and a charity registered in Scotland 
>(SC 038302). The Open University is authorised and regulated by the 
>Financial Conduct Authority.
>
>*** Options:
>http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>*** Archive: http://www.eprints.org/tech.php/
>*** EPrints community wiki: http://wiki.eprints.org/
>*** EPrints developers Forum: http://forum.eprints.org/


*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/
-- The Open University is incorporated by Royal Charter (RC 000391), an exempt charity in England & Wales and a charity registered in Scotland (SC 038302). The Open University is authorised and regulated by the Financial Conduct Authority.

*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/

*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/