[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[EP-tech] Antwort: Re: About IRStats2



Hi,

found a solution myself. In cfg.d/z_irstats2.pl, the use_ids flag must be set to 0:

$c->{irstats2}->{sets} = [
? ? ? ? {
? ? ? ? ? ? ? ? 'field' => 'divisions',
? ? ? ? ? ? ? ? 'groupings' => [ 'authors' ]
? ? ? ? },
? ? ? ? {
? ? ? ? ? ? ? ? 'field' => 'subjects',
? ? ? ? ? ? ? ? 'groupings' => [ 'authors' ]
? ? ? ? },
? ? ? ? {
? ? ? ? ? ? ? ? 'name' => 'type',
? ? ? ? ? ? ? ? 'field' => 'type',
? ? ? ? ? ? ? ? 'groupings' => [ 'authors' ]
? ? ? ? },
# ? ? ? # EdShare:
# ? ? ? {
# ? ? ? ? ? ? ? 'field' => 'courses',
# ? ? ? }
? ? ? ? # using creators_name and creators_id
? ? ? ? {
? ? ? ? ? ? ? ? 'name' => 'authors',
? ? ? ? ? ? ? ? 'field' => 'creators',
? ? ? ? ? ? ? ? 'groupings' => [ 'type' ],

? ? ? ? ? ? ? ? 'anon' => 1,? ? # don't show user's email address (the 'id' field)
? ? ? ? ? ? ? ? # for compound:
# if use_ids == 0 -> just use _name, same as having field => 'creators_name'
# if use_ids == 1 -> use _id as key for the set and _name for display - value will be ignored if _id is NOT set!

? ? ? ? ? ? ? ? 'use_ids' => 0,
# ? ? ? ? ? ? ? 'id_field' => 'id', ? ? ? ? ? ? # default value, optional. if the subfield is called 'email' then use 'email'
? ? ? ? ? ? ? ? minimum_filter_length => 2,

? ? ? ? },



Now authors statistics are generated correctly.

Best regards,

Martin

-----eprints-tech-bounces at ecs.soton.ac.uk schrieb: -----
An: eprints-tech at ecs.soton.ac.uk
Von: martin.braendle at id.uzh.ch
Gesendet von: eprints-tech-bounces at ecs.soton.ac.uk
Datum: 15.08.2014 08:57
Betreff: [EP-tech] Antwort: Re: About IRStats2

Hi,

it looks like irstats2 has a flaw in gathering author data. From our statistics on our test server zoratest (irstats2 is not yet deployed on the production server www.zora.uzh.ch) we get the following picture:

Most Downloaded Items:

count,eprintid,description
="10077",="10147","<a href='http://www.zoratest.uzh.ch/10147/'>Body Modification: psychologische Aspekte von Piercings und anderen K?rperver?nderungen</a>"
="5958",="2532","<a href='http://www.zoratest.uzh.ch/2532/'>Welpenf?tterung in der Schweiz</a>"
="5204",="43064","<a href='http://www.zoratest.uzh.ch/43064/'>Extraartikul?re weichteilrheumatische Erkrankungen (Weichteilrheumatismus) und R?ckenschmerzen</a>"
="4956",="24050","<a href='http://www.zoratest.uzh.ch/24050/'>Traumatic pericarditis in cattle: clinical, radiographic and ultrasonographic findings</a>"
="4539",="19506","<a href='http://www.zoratest.uzh.ch/19506/'>IFRS aktuell: Neues aus wichtigen Gremien rund um die internationale Rechnungslegung</a>"

Top Authors:

count,set_value,description
="9710",90de69aa75e88bae17a48fe111738757,"Zweifel, Peter"
="9170",a8919638b6af7edf5e6201132093647d,"Schwabe, Gerhard"
="8944",8d05f2d4d6fa6596e6b676723b5082c8,"Fehr, Ernst"
="8381",f3f5e1a2127f50a31435b5fb54d2bef9,"Deplazes, P"
="8289",6f95d542bd0c4b9760811f454004c76c,"Linden, A"


You see immediately that this is plain wrong, because the top author, "K?lin, R" who published eprintid 10147 (see http://www.zora.uzh.ch/10147/) isn't on the list of top authors and should there have a count of 10077 downloads.

K?lin, R also doesn't appear in the Filter Items list of irstats2.

Checking the SQL tables as Seb suggested yields:

mysql> select * from eprint_creators_name where eprintid=10147\G
*************************** 1. row ***************************
eprintid: 10147
pos: 0
creators_name_honourific: 
creators_name_given: R
creators_name_family: K?lin
creators_name_lineage: 
1 row in set (0.00 sec)


mysql> select * from eprint_creators_id where eprintid=10147\G
Empty set (0.00 sec)


Another eprint indeed lists entries in the eprint_creators_id?table:

mysql> select * from eprint_creators_id where eprintid=13208;
+----------+-----+------------------------------+
| eprintid | pos | creators_id ? ? ? ? ? ? ? ? ?|
+----------+-----+------------------------------+
| ? ?13208 | ? 0 | mjackson at vetclinics.uzh.ch ? |
| ? ?13208 | ? 1 | ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?|
| ? ?13208 | ? 2 | jkuemmerle at vetclinics.uzh.ch |
| ? ?13208 | ? 3 | afuerst at vetclinics.uzh.ch ? ?|
+----------+-----+------------------------------+


Conclusion: irstats2 seems to gather author statistics only correctly, if there is creators_id entry (at least an e-mail address set) in table eprint_creators_id?. Also it seems to produce a filter list entry only, if there is a corresponding entry in table eprint_creators_id.

Irstats2 authors, please correct this wrong behavior.

Best regards,

Martin

--
Dr. Martin Br?ndle
Informatikdienste
Universit?t Z?rich
Winterthurerstr. 190
CH-8057 Z?rich

mail: martin.braendle at id.uzh.ch
phone: +41 44 63 56705
fax: +41 44 63 54505
http://www.id.uzh.ch

Sebastien Francois ---25/07/2014 15:49:42---And you do have creators data? ?(select * from eprint_creators_name ?---and/or--- select * from epri

Von: Sebastien Francois <sf2 at ecs.soton.ac.uk>
An: eprints-tech at ecs.soton.ac.uk
Datum: 25/07/2014 15:49
Betreff: [EP-tech] Re: About IRStats2
Gesendet von: eprints-tech-bounces at ecs.soton.ac.uk



And you do have creators data? ?(select * from eprint_creators_name ---and/or--- select * from eprint_creators_id)

Cos that's where irstats2 tries to process the data from.

Seb.

On 25/07/14 13:07, pgasinos pgs wrote:
Yes ?
My repository is:
http://anaktisis.teiwm.gr?

Kostas Pgasinos

???? ?????????, 25 ??????? 2014, ? ??????? Sebastien Francois <sf2 at ecs.soton.ac.uk> ??????:
Hey,

Do you have a URL I can look at?

It seems like there are some issues with your data (the "countries' does not exist" error indicates some issues with Geo::IP). Do you get any related errors/warnings when you run "bin/epadmin test"?

If that were possible, I'd re-generate all the stats:

bin/stats/process_stats <id> --uninstall

then

bin/stats/process_stats <id> --setup --verbose

As you know, this may take some time (depending on the size of your 'access' dataset).

Kind regards,
Seb
*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/

*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20140816/02fbc434/attachment-0001.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Image.1__=4EBBF7A6DFB0D9F68f9e8a93df9 at lotus.uzh.ch.gif
Type: image/gif
Size: 105 bytes
Desc: not available
Url : http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20140816/02fbc434/attachment-0001.gif