[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[EP-tech] Antwort: Re: About IRStats2


it looks like irstats2 has a flaw in gathering author data. From our
statistics on our test server zoratest (irstats2 is not yet deployed on the
production server www.zora.uzh.ch) we get the following picture:

Most Downloaded Items:

="10077",="10147","<a href='http://www.zoratest.uzh.ch/10147/'>Body
Modification: psychologische Aspekte von Piercings und anderen
="5958",="2532","<a href='http://www.zoratest.uzh.ch/2532/'>Welpenf?tterung
in der Schweiz</a>"
weichteilrheumatische Erkrankungen (Weichteilrheumatismus) und
="4956",="24050","<a href='http://www.zoratest.uzh.ch/24050/'>Traumatic
pericarditis in cattle: clinical, radiographic and ultrasonographic
="4539",="19506","<a href='http://www.zoratest.uzh.ch/19506/'>IFRS aktuell:
Neues aus wichtigen Gremien rund um die internationale Rechnungslegung</a>"

Top Authors:

="9710",90de69aa75e88bae17a48fe111738757,"Zweifel, Peter"
="9170",a8919638b6af7edf5e6201132093647d,"Schwabe, Gerhard"
="8944",8d05f2d4d6fa6596e6b676723b5082c8,"Fehr, Ernst"
="8381",f3f5e1a2127f50a31435b5fb54d2bef9,"Deplazes, P"
="8289",6f95d542bd0c4b9760811f454004c76c,"Linden, A"

You see immediately that this is plain wrong, because the top author,
"K?lin, R" who published eprintid 10147 (see http://www.zora.uzh.ch/10147/)
isn't on the list of top authors and should there have a count of 10077

K?lin, R also doesn't appear in the Filter Items list of irstats2.

Checking the SQL tables as Seb suggested yields:

mysql> select * from eprint_creators_name where eprintid=10147\G
*************************** 1. row ***************************
eprintid: 10147
pos: 0
creators_name_given: R
creators_name_family: K?lin
1 row in set (0.00 sec)

mysql> select * from eprint_creators_id where eprintid=10147\G
Empty set (0.00 sec)

Another eprint indeed lists entries in the eprint_creators_id table:

mysql> select * from eprint_creators_id where eprintid=13208;
| eprintid | pos | creators_id                  |
|    13208 |   0 | mjackson at vetclinics.uzh.ch   |
|    13208 |   1 |                              |
|    13208 |   2 | jkuemmerle at vetclinics.uzh.ch |
|    13208 |   3 | afuerst at vetclinics.uzh.ch    |

Conclusion: irstats2 seems to gather author statistics only correctly, if
there is creators_id entry (at least an e-mail address set) in table
eprint_creators_id . Also it seems to produce a filter list entry only, if
there is a corresponding entry in table eprint_creators_id.

Irstats2 authors, please correct this wrong behavior.

Best regards,


Dr. Martin Br?ndle
Universit?t Z?rich
Winterthurerstr. 190
CH-8057 Z?rich

mail: martin.braendle at id.uzh.ch
phone: +41 44 63 56705
fax: +41 44 63 54505

Von:	Sebastien Francois <sf2 at ecs.soton.ac.uk>
An:	eprints-tech at ecs.soton.ac.uk
Datum:	25/07/2014 15:49
Betreff:	[EP-tech] Re: About IRStats2
Gesendet von:	eprints-tech-bounces at ecs.soton.ac.uk

And you do have creators data?  (select * from eprint_creators_name
---and/or--- select * from eprint_creators_id)

Cos that's where irstats2 tries to process the data from.


On 25/07/14 13:07, pgasinos pgs wrote:
      My repository is:

      Kostas Pgasinos

      ???? ?????????, 25 ??????? 2014, ? ??????? Sebastien Francois <
      sf2 at ecs.soton.ac.uk> ??????:

        Do you have a URL I can look at?

        It seems like there are some issues with your data (the "countries'
        does not exist" error indicates some issues with Geo::IP). Do you
        get any related errors/warnings when you run "bin/epadmin test"?

        If that were possible, I'd re-generate all the stats:

        bin/stats/process_stats <id> --uninstall


        bin/stats/process_stats <id> --setup --verbose

        As you know, this may take some time (depending on the size of your
        'access' dataset).

        Kind regards,
*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20140815/739f824d/attachment.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: graycol.gif
Type: image/gif
Size: 105 bytes
Desc: not available
Url : http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20140815/739f824d/attachment.gif