EPrints Technical Mailing List Archive

Message: #05419


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

[EP-tech] Searching fails when database field contains Å (utf8 %c3%85)


Hello all!

 

I have encountered a weird UTF-8 related problem when querying names in the advanced search.  If the name of an author contains Å, like Ångström, (UTF-8 %c3%85, A with a ring above) then querying will fail.  I have not seen the problem for any other character, e.g. no problem with ”å” (a with ring above), %c3%a5, or any other non A-Z letter such as ä,Ä,ö, or Ö.  The problem is when the database entry itself contains an Å, which is typically when the character is the first in the name like Ångström or in a hyphened name like Per-Åke.

 

Furthermore, if the queryterm contains an “Å” then it will fail.  A few examples:

 

Mårten – works

mårten – works

MåRTEN -- works

MÅRTEN -- fails

mÅrten -- fails

 

The query field is (normally) case insensitive so it shouldn’t matter if I write “ångström” or “Ångström”.  However, hit or miss in this case depends on if the database have an Å and/or the query term contains an Å as it seems like Eprints cannot handle “Å”.  Always, displays correct and is correctly written into the database.  Only problem is the advanced search.

 

Should add that querying the database using SQL works without any problems (incl all upper/lower combinations).  Any ideas what may be wrong with Eprints and where to start looking? 

 

Regards,

Christer

 

 

Christer Enkvist, Ph D

System Administrator/System Librarian

Division of Scholarly Communication

Swedish University of Agricultural Sciences

Uppsala, Sweden

 

Telephone: 018-671042