EPrints Technical Mailing List Archive

Message: #09301

< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First

Re: [EP-tech] Cache Files - Always look for new

CAUTION: This e-mail originated outside the University of Southampton.
Hi James,
PDF caching is a PITA.
If someone has already downloaded a copy, the browser will already have cached it - so you might not even see a request hitting the server (and therefore your Apache changes are never 'hit').

Can you confirm that you see entries in the access log from people with a cached copy - e.g. the browser is actually making a call to the server?

My normal advice is to append a random query string to the end of the PDF URL:
sort of thing - this will cause a reload - but your user has already said 'no' to this.

If requests are hitting the server, you could use e.g. an EPrints URL rewrite trigger to append a querystring and respond with a redirect.
If requests aren't hitting the server, you'll have to get them to visit a new URL... manually.


From: eprints-tech-bounces@ecs.soton.ac.uk <eprints-tech-bounces@ecs.soton.ac.uk> on behalf of James Kerwin via Eprints-tech <eprints-tech@ecs.soton.ac.uk>
Sent: 04 May 2023 14:45
To: eprints-tech@ecs.soton.ac.uk <eprints-tech@ecs.soton.ac.uk>
Subject: [EP-tech] Cache Files - Always look for new
CAUTION: This e-mail originated outside the University of Southampton.
Hi All,

A user uploaded a file to our data repository, opened it, noticed a mistake, deleted the file and reuploaded the new file. When they click the link and the pdf opens in the browser it opens the original file because, I assume, it's been cached.

This happens rarely and when it does we ask the user to refresh the page with the document or do a ctrl+f5 refresh to clear the cache for that page. In this instance the user is insisting that they couldn't possibly ask this of the people they had already shared the link with.

I've made changed to my apache ssl conf to include:

<FilesMatch ".(js|css|jpg|jpeg|png|gif|js|css|ico|swf|pdf|html)$">
  <IfModule mod_headers.c>
Header set Cache-Control "max-age=0, no-cache, no-store, must-revalidate"
Header set Pragma "no-cache"
Header unset Last-Modified

I am not an apache expert by any stretch of the imagination. After restarting apache this has not resolved the issue. Can anybody advise? Maybe there is a specific EPrints THING I need to be aware of? We're on EPrints 3.4.4.