EPrints Technical Mailing List Archive

See the EPrints wiki for instructions on how to join this mailing list and related information.

Message: #10324


< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

Re: [EP-tech] Query for IRSTATS2 Download from Access Table


CAUTION: This e-mail originated outside the University of Southampton.
Dear John, where can I see the filtering for bot, human double click those?

The difference is so random

  Access Table Eprints IRStat2 Different
Jan              216,293               279,265    (62,972)
Feb              182,542               196,933    (14,391)
Mac              175,017               198,564    (23,547)
Apr              239,973               250,356    (10,383)
Mei              263,108               227,056      36,052
Jun              251,904               209,344      42,560
Jul              263,250               227,984      35,266
Ogos              201,476               212,356    (10,880)
Sept              332,020               168,318    163,702
Otk                44,751                 49,454      (4,703)

Izwan
UiTM Digital Library


On Fri, Jan 23, 2026 at 4:54 AM John Salter <J.Salter@leeds.ac.uk> wrote:
CAUTION: This e-mail originated outside the University of Southampton.
CAUTION: This e-mail originated outside the University of Southampton.

Hi Izwan,
The way IRStats2 processes requests is more complicated than you can easily  replicate in a normal SQL query.

It tries to count ‘human’ usage of content, so has a list of ‘robot’ user-agents (https://www.eprints.org/resource/bad_robots/robots_ua.txt ) which it filters against - as well as other approaches.

If you need to provide some 2025 data for someone, you could select data from the access table – but say that it includes all records – robots, humans, double-clicks etc., so will over-count the figures significantly.

 

Cheers,

John

 

John Salter

https://orcid.org/0000-0002-8611-8266

 

White Rose Libraries Technical Officer
Library and Research Management team, IT
University of Leeds

 

From: eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk> On Behalf Of MOHD.IZWAN SALIM
Sent: 22 January 2026 05:05
To: Eprints Tech <eprints-tech@ecs.soton.ac.uk>
Subject: [EP-tech] Query for IRSTATS2 Download from Access Table

 

CAUTION: External Message. Use caution opening links and attachments.

CAUTION: This e-mail originated outside the University of Southampton.

CAUTION: This e-mail originated outside the University of Southampton.

Dear Eprints

 

Our repo is currently under repocessing the data. And it takes a very long time to reach 2025.

I plan to do a quick query directly from the access table. I can't get the same or near the same as shown in the IRStat screen (for previous year)

 

Example for 2013
Repo Stat Screen shows -186,757

 

Query from the access table  - 282,322

Query with condition referring_entity_id !='' - 165,815

 

My query is 

SELECT COUNT(*) AS Jumlah

FROM access a LEFT JOIN eprint b ON a.referent_id=b.eprintid
WHERE a.datestamp_year='2013' AND service_type_id='?fulltext=yes' AND referring_entity_id !='' AND eprint_status='archive'

 

Is there any other condition I should add to at least be near the one shown in the EPrint page?

 

Izwan
UiTM Digital Library

 

PENAFIAN: E-mel ini dan apa-apa fail yang dihantar bersama-samanya ("Mesej") adalah dihasratkan hanya untuk kegunaan penerima yang dinyatakan di atas dan mungkin mengandungi maklumat yang tidak umum, bermilik, istimewa, sulit dan dikecualikan dari penzahiran di bawah undang-undang yang terpakai termasuklah Akta Rahsia Rasmi 1972. BACA SELANJUTNYA...


DISCLAIMER : This e-mail and any files transmitted with it ("Message") is intended only for the use of the recipient(s) named above and may contain information that is non-public,  proprietary,  privileged,  confidential  and  exempt  from  disclosure under applicable law including the Official Secrets Act 1972. READ MORE...

*** Options: https://wiki.eprints.org/w/Eprints-tech_Mailing_List
*** Archive: https://www.eprints.org/tech.php/
*** EPrints community wiki: https://wiki.eprints.org/


PENAFIAN: E-mel ini dan apa-apa fail yang dihantar bersama-samanya ("Mesej") adalah dihasratkan hanya untuk kegunaan penerima yang dinyatakan di atas dan mungkin mengandungi maklumat yang tidak umum, bermilik, istimewa, sulit dan dikecualikan dari penzahiran di bawah undang-undang yang terpakai termasuklah Akta Rahsia Rasmi 1972. BACA SELANJUTNYA...


DISCLAIMER : This e-mail and any files transmitted with it ("Message") is intended only for the use of the recipient(s) named above and may contain information that is non-public,  proprietary,  privileged,  confidential  and  exempt  from  disclosure under applicable law including the Official Secrets Act 1972. READ MORE...