[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[EP-tech] Re: Normalize characters for correct sorting



I suspect this is a Perl problem rather than an EPrints problem..... I 
would expect Perl to sort by Unicode Value (so 0386 before 0391)

On 09/06/15 08:40, pgasinos pgs wrote:
> Is there any configuration file(s) in Eprints that someone can normalize
> utf-8 characters so they are sorting correctly in non English languages?
> For example the Unicode entities: Ƃ GREEK CAPITAL LETTER ALPHA
> WITH TONOS and
> Ƈ GREEK CAPITAL LETTER ALPHA are the same and they have to be
> sorted together, not in separate lists.
> The vowels are even more complicated. All below, are the same letter and
> they have to be in the same list:
> ?    υ  GREEK SMALL LETTER UPSILON
> ?    ύ  GREEK SMALL LETTER UPSILON WITH TONOS
> ?    ϋ  GREEK SMALL LETTER UPSILON WITH DIALYTIKA
> ?    ΰ  GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND TONOS


-- 

Ian Stuart.
Developer: ORI, RJ-Broker, and OpenDepot.org
Bibliographics and Multimedia Service Delivery team,
EDINA,
The University of Edinburgh.

http://edina.ac.uk/

This email was sent via the University of Edinburgh.

The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.