[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[EP-tech] Re: Search oddities
We're currently noting that sort of behaviour on our repo
(research.library.mun.ca) as well. In our case, text extraction for
single (PDF) items is occasionally failing within complete reindexes
only (such as might be scheduled periodically via cron); single item
reindexes give no issue. You might want to log the verbose output of a
full reindex to text to see if your problematic particular items are
throwing errors within the context of the larger reindex, e.g.:
eprints at yourrepo:~/bin$ script ../indexlog_20140617.log
Script started, file is ../indexlog_20140617.log
eprints at yourrepo:~/bin$ ./epadmin reindex your_repo_name eprint
eprints at yourrepo:~/bin$ exit
Script done, file is ../indexlog_20140617.log
In particular, we've noted trouble a number of pdftotext calls, though
we haven't any resolution yet.
Library Information Technology Services (LITS)
Memorial University of Newfoundland
From: eprints-tech-bounces at ecs.soton.ac.uk
[mailto:eprints-tech-bounces at ecs.soton.ac.uk] On Behalf Of Andrew Beeken
Sent: June-17-14 12:46 PM
To: eprints-tech at ecs.soton.ac.uk
Subject: [EP-tech] Search oddities
We have another search related oddity. There are a couple of items that
don't seem to be indexed for a general (search bar) search, but will be
returned in results when you do a specific search (for example, by
author). I've tried reindexing the individual item, but have yet to try
fully reindexing the site.
If it is an indexing problem, this worries me as it's not the first time
this has happened and I'm wondering how items can appear in one search
type, but not the other.
Any thoughts or prior experience with this would be very helpful.
The University of Lincoln, located in the heart of the city of Lincoln,
has established an international reputation based on high student
satisfaction, excellent graduate employment and world-class research.
The information in this e-mail and any attachments may be confidential.
If you have received this email in error please notify the sender
immediately and remove it from your system. Do not disclose the contents
to another person or take copies.
Email is not secure and may contain viruses. The University of Lincoln
makes every effort to ensure email is sent without viruses, but cannot
guarantee this and recommends recipients take appropriate precautions.
The University may monitor email traffic data and content in accordance
with its policies and English law. Further information can be found at:
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/