[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[EP-tech] Simple search - length of query

Yesterday we had a bit of an issue with our repository when someone pasted a full citation into the simple search box.
This produced an impressive SQL query that locked things up and made users unhappy...

Is there a way to sanitise what a 'simple' search might try to handle? e.g. would restricting it to a certain number of words be acceptable?
Would the Xapian search handle a request like the one below any better?

Details below/attached if you're interested!

GET /cgi/search/simple?full=%E2%80%98Families%2C+Domesticity+and+Intimacy%3A+Changing+Relationships+in+Changi

searchexp created in cache table:
0|1|-date/creators_name/title|archive|-|full:abstract/creators_name/date/documents/title:ALL:IN:?Families, Domesticity and Intimacy%3A Changing Relationships in Changing Times?, in Richardson, D, and Robinson, V. (eds) Introducing Women's Studies, third edition. Basingstoke%3A Palgrave, 2008 pp. 125-143. |-|eprint_status:eprint_status:ANY:EQ:archive|metadata_visibility:metadata_visibility:ANY:EQ:show

The SQL generated by search is attached (get ready for this - it's a thing of beauty ;o) - you can see why it took a while to run!

-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: monster-sql.txt
Url: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20140305/4b85a7a5/attachment-0001.txt