EPrints Technical Mailing List Archive

See the EPrints wiki for instructions on how to join this mailing list and related information.

Message: #00992

[EP-tech] Re: Full text indexing document in Xapian search

To: eprints-tech@ecs.soton.ac.uk
Subject: [EP-tech] Re: Full text indexing document in Xapian search
From: Tim Brody <tdb2@ecs.soton.ac.uk>
Date: Thu, 30 Aug 2012 14:02:24 +0100

On Thu, 2012-08-30 at 14:12 +0200, Paolo Tealdi wrote:
> Dear all,
> 
> i'm upgrading from 3.2.4 to 3.3.10 and evaluating the new features of 
> 3.3.10 version. I've installed Xapian search and i think that now simple 
> search is quicker than 3.2.4 one.
> Nevertheless, i think that fulltext index is not present in Xapian 
> search. Am i right ?
> How can i decide the fields list indexed in simple search (Xapian in my 
> case) ?

Xapian should search all fields, including the documents, if EPrints can
convert the document to plain text.

The indexing code is in lib/cfg.d/search_xapian.pl.

There isn't much help for you debugging what has gone wrong with
indexing. Best I can suggest is adding this just above
"replace_document_by_term":

my $i = $doc->termlist_begin;
print "$i, " while ++$i ne $doc->termlist_end;
print "\n";

Then:

./bin/epadmin reindex [archiveid] eprint [eprintid]

For an eprint that isn't matching.

Will show you exactly what's getting indexed for a given eprint.

-- 
All the best,
Tim

Attachment: signature.asc
Description: This is a digitally signed message part

References:
- [EP-tech] Full text indexing document in Xapian search
  - From: Paolo Tealdi <paolo.tealdi@polito.it>

Prev by Date: [EP-tech] Re: How to customise Division listings
Next by Date: [EP-tech] Re: How to customise Division listings
Previous by thread: [EP-tech] Full text indexing document in Xapian search
Next by thread: [EP-tech] Re: Full text indexing document in Xapian search
Index(es):
- Date
- Thread