EPrints Technical Mailing List Archive

Message: #04064

< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

[EP-tech] Re: Indexing issue

Further to this, we¹ve now removed the record in question, however EPrints
still seems to be trying to index it - I don¹t know if this is a cache of
some kind.

On 11/03/2015 16:11, "Andrew Beeken" <anbeeken@lincoln.ac.uk> wrote:

>We¹ve got a bit of an issue with our search indexing which, after some
>digging, seems to be related to a specific record. Dumping a log of the
>‹verbose index to a file, the offending entry seems to be throwing the
>following error:
>Error: PDF file is damaged - attempting to reconstruct xref table...
>Error opening /tmp/ep-convertc4Azo/index.txt: No such file or directory
>at /usr/share/eprints3/perl_lib/EPrints/Plugin/Convert/IndexCodes.pm line
>The issue seems to be that once the indexer hits this record it bails out
>and doesn¹t index records after this, meaning that some users have items
>which do not show up when a search is conducted based on their user
>number ­ as this is used in conjunction with the JSON output on a search
>to populate our local staff directory with academics research outputs,
>it¹s causing us some issues internally and also concerning us that the
>EPrints search index is incomplete.
>We¹ve removed the PDF that appears to be causing the problem but it is
>potentially cached as it still comes up with the error; we¹ve tried
>putting the record as ³retired² and will look tomorrow to see if that
>does anything, however if anyone has encountered this before or has a
>potentially better suggestion, let me know!
>The University of Lincoln, located in the heart of the city of Lincoln,
>has established an international reputation based on high student
>satisfaction, excellent graduate employment and world-class research.
>The information in this e-mail and any attachments may be confidential.
>If you have received this email in error please notify the sender
>immediately and remove it from your system. Do not disclose the contents
>to another person or take copies.
>Email is not secure and may contain viruses. The University of Lincoln
>makes every effort to ensure email is sent without viruses, but cannot
>guarantee this and recommends recipients take appropriate precautions.
>The University may monitor email traffic data and content in accordance
>with its policies and English law. Further information can be found at:
>*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>*** Archive: http://www.eprints.org/tech.php/
>*** EPrints community wiki: http://wiki.eprints.org/
>*** EPrints developers Forum: http://forum.eprints.org/