[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[EP-tech] Antwort: Re: About IRStats2
Hi,
found a solution myself. In cfg.d/z_irstats2.pl, the use_ids flag must be set to 0:
$c->{irstats2}->{sets} = [
? ? ? ? {
? ? ? ? ? ? ? ? 'field' => 'divisions',
? ? ? ? ? ? ? ? 'groupings' => [ 'authors' ]
? ? ? ? },
? ? ? ? {
? ? ? ? ? ? ? ? 'field' => 'subjects',
? ? ? ? ? ? ? ? 'groupings' => [ 'authors' ]
? ? ? ? },
? ? ? ? {
? ? ? ? ? ? ? ? 'name' => 'type',
? ? ? ? ? ? ? ? 'field' => 'type',
? ? ? ? ? ? ? ? 'groupings' => [ 'authors' ]
? ? ? ? },
# ? ? ? # EdShare:
# ? ? ? {
# ? ? ? ? ? ? ? 'field' => 'courses',
# ? ? ? }
? ? ? ? # using creators_name and creators_id
? ? ? ? {
? ? ? ? ? ? ? ? 'name' => 'authors',
? ? ? ? ? ? ? ? 'field' => 'creators',
? ? ? ? ? ? ? ? 'groupings' => [ 'type' ],
? ? ? ? ? ? ? ? 'anon' => 1,? ? # don't show user's email address (the 'id' field)
? ? ? ? ? ? ? ? # for compound:
# if use_ids == 0 -> just use _name, same as having field => 'creators_name'
# if use_ids == 1 -> use _id as key for the set and _name for display - value will be ignored if _id is NOT set!
? ? ? ? ? ? ? ? 'use_ids' => 0,
# ? ? ? ? ? ? ? 'id_field' => 'id', ? ? ? ? ? ? # default value, optional. if the subfield is called 'email' then use 'email'
? ? ? ? ? ? ? ? minimum_filter_length => 2,
? ? ? ? },
Now authors statistics are generated correctly.
Best regards,
Martin
-----eprints-tech-bounces at ecs.soton.ac.uk schrieb: -----
An: eprints-tech at ecs.soton.ac.uk
Von: martin.braendle at id.uzh.ch
Gesendet von: eprints-tech-bounces at ecs.soton.ac.uk
Datum: 15.08.2014 08:57
Betreff: [EP-tech] Antwort: Re: About IRStats2
Hi,
it looks like irstats2 has a flaw in gathering author data. From our statistics on our test server zoratest (irstats2 is not yet deployed on the production server www.zora.uzh.ch) we get the following picture:
Most Downloaded Items:
count,eprintid,description
="10077",="10147","<a href='http://www.zoratest.uzh.ch/10147/'>Body Modification: psychologische Aspekte von Piercings und anderen K?rperver?nderungen</a>"
="5958",="2532","<a href='http://www.zoratest.uzh.ch/2532/'>Welpenf?tterung in der Schweiz</a>"
="5204",="43064","<a href='http://www.zoratest.uzh.ch/43064/'>Extraartikul?re weichteilrheumatische Erkrankungen (Weichteilrheumatismus) und R?ckenschmerzen</a>"
="4956",="24050","<a href='http://www.zoratest.uzh.ch/24050/'>Traumatic pericarditis in cattle: clinical, radiographic and ultrasonographic findings</a>"
="4539",="19506","<a href='http://www.zoratest.uzh.ch/19506/'>IFRS aktuell: Neues aus wichtigen Gremien rund um die internationale Rechnungslegung</a>"
Top Authors:
count,set_value,description
="9710",90de69aa75e88bae17a48fe111738757,"Zweifel, Peter"
="9170",a8919638b6af7edf5e6201132093647d,"Schwabe, Gerhard"
="8944",8d05f2d4d6fa6596e6b676723b5082c8,"Fehr, Ernst"
="8381",f3f5e1a2127f50a31435b5fb54d2bef9,"Deplazes, P"
="8289",6f95d542bd0c4b9760811f454004c76c,"Linden, A"
You see immediately that this is plain wrong, because the top author, "K?lin, R" who published eprintid 10147 (see http://www.zora.uzh.ch/10147/) isn't on the list of top authors and should there have a count of 10077 downloads.
K?lin, R also doesn't appear in the Filter Items list of irstats2.
Checking the SQL tables as Seb suggested yields:
mysql> select * from eprint_creators_name where eprintid=10147\G
*************************** 1. row ***************************
eprintid: 10147
pos: 0
creators_name_honourific:
creators_name_given: R
creators_name_family: K?lin
creators_name_lineage:
1 row in set (0.00 sec)
mysql> select * from eprint_creators_id where eprintid=10147\G
Empty set (0.00 sec)
Another eprint indeed lists entries in the eprint_creators_id?table:
mysql> select * from eprint_creators_id where eprintid=13208;
+----------+-----+------------------------------+
| eprintid | pos | creators_id ? ? ? ? ? ? ? ? ?|
+----------+-----+------------------------------+
| ? ?13208 | ? 0 | mjackson at vetclinics.uzh.ch ? |
| ? ?13208 | ? 1 | ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?|
| ? ?13208 | ? 2 | jkuemmerle at vetclinics.uzh.ch |
| ? ?13208 | ? 3 | afuerst at vetclinics.uzh.ch ? ?|
+----------+-----+------------------------------+
Conclusion: irstats2 seems to gather author statistics only correctly, if there is creators_id entry (at least an e-mail address set) in table eprint_creators_id?. Also it seems to produce a filter list entry only, if there is a corresponding entry in table eprint_creators_id.
Irstats2 authors, please correct this wrong behavior.
Best regards,
Martin
--
Dr. Martin Br?ndle
Informatikdienste
Universit?t Z?rich
Winterthurerstr. 190
CH-8057 Z?rich
mail: martin.braendle at id.uzh.ch
phone: +41 44 63 56705
fax: +41 44 63 54505
http://www.id.uzh.ch
Sebastien Francois ---25/07/2014 15:49:42---And you do have creators data? ?(select * from eprint_creators_name ?---and/or--- select * from epri
Von: Sebastien Francois <sf2 at ecs.soton.ac.uk>
An: eprints-tech at ecs.soton.ac.uk
Datum: 25/07/2014 15:49
Betreff: [EP-tech] Re: About IRStats2
Gesendet von: eprints-tech-bounces at ecs.soton.ac.uk
And you do have creators data? ?(select * from eprint_creators_name ---and/or--- select * from eprint_creators_id)
Cos that's where irstats2 tries to process the data from.
Seb.
On 25/07/14 13:07, pgasinos pgs wrote:
Yes ?
My repository is:
http://anaktisis.teiwm.gr?
Kostas Pgasinos
???? ?????????, 25 ??????? 2014, ? ??????? Sebastien Francois <sf2 at ecs.soton.ac.uk> ??????:
Hey,
Do you have a URL I can look at?
It seems like there are some issues with your data (the "countries' does not exist" error indicates some issues with Geo::IP). Do you get any related errors/warnings when you run "bin/epadmin test"?
If that were possible, I'd re-generate all the stats:
bin/stats/process_stats <id> --uninstall
then
bin/stats/process_stats <id> --setup --verbose
As you know, this may take some time (depending on the size of your 'access' dataset).
Kind regards,
Seb
*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/
*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20140816/02fbc434/attachment-0001.html
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Image.1__=4EBBF7A6DFB0D9F68f9e8a93df9 at lotus.uzh.ch.gif
Type: image/gif
Size: 105 bytes
Desc: not available
Url : http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20140816/02fbc434/attachment-0001.gif