[EP-tech] Partitioning access table (INNODB)

Hello everybody,

our eprints repository has an access table with more than 133 million
records and uploading.

We recently updated the IRStats module version 1.1 in our test repository,
and the reindexing has been running for 9 days and counting.

We have also converted the access table to INNODB and compressed (it took 7

The MySQL slow queries log reports:

-   At first it took 2 seconds per query:* Query_time: 2.068357*

Time: 190304  0:13:52

# User at Host: eprintsdbo[eprintsdbo] @  []  Id:    80

# *Query_time: 2.068357*  Lock_time: 0.000141 Rows_sent: 100000
Rows_examined: 700000

SET timestamp=1551654832;

FROM `access` LIMIT 100000 OFFSET 600000;

-   Nine days later .., it took 660 seconds per query: *Query_time:

# Time: 190312 12:33:07

# User at Host: eprintsdbo[eprintsdbo] @  []  Id:  1077

# *Query_time: 661.963604*  Lock_time: 0.000180 Rows_sent: 99787
Rows_examined: 123899787

SET timestamp=1552390387;

FROM `access` LIMIT 100000 OFFSET 123800000;

I was wondering if someone has partitioned the access table (for example
every 10 million records - access) and if this would improve the generation
of statistics.



