EPrints Technical Mailing List Archive
See the EPrints wiki for instructions on how to join this mailing list and related information.
Message: #10134
< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First
RE: [EP-tech] DDoS of EPrints advanced search
- To: "eprints-tech@ecs.soton.ac.uk" <eprints-tech@ecs.soton.ac.uk>
- Subject: RE: [EP-tech] DDoS of EPrints advanced search
- From: Matthew Kerwin <matthew.kerwin@qut.edu.au>
- Date: Fri, 6 Jun 2025 00:00:54 +0000
CAUTION: This e-mail originated outside the University of Southampton.
For what it’s worth, I’ve managed to pare our border reverse-proxy filtering back to the point that I can reject requests with search and statistics URI paths that don’t have a valid
Referer header field. This is a very local fix, but it’s blocked 99% of the current flood of traffic we’re seeing. And I’ve ensured that the server itself instructs browsers to send a Referer, so organic human navigation shouldn’t be affected. I’m also profoundly aware of the volume of traffic from what appear to be LLM training robots. I was saying in our internal chat the other day that the Internet feels very hostile
lately. Oh well… Cheers --
From: eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk>
On Behalf Of Tomasz Neugebauer CAUTION: This
e-mail originated outside the University of Southampton. CAUTION: This
e-mail originated outside the University of Southampton. Hi everyone, I've seen some of the same repeat requests on our advanced search reported on this thread, although more moderate volume. Our IT security team made some tweaks to firewall that have really helped to dramatically
slow down the rate at which these repeat requests get through. For those that do get through they get a 403 right away from Apache, as I added a regex to the apache config along the lines of what David suggested, based on what I saw in the logs. For the
time being, that has been sufficient. My intuition is that this sort of issue of repeat requests / DDoS attacks, should be dealt with at that firewall level. In terms of pages that are vulnerable, although not targeted, I worry about the IRStats2, considering
putting that behind a login, but at the same time, it's great to have that data available openly so I hesitate. As I was monitoring the situation, it struck me just how much Gen AI crawling is happening on our repository, from Open AI, Bing/copilot, etc. The IR is a central/important data infrastructure for these services
that are themselves in the midst of litigation regarding copyright and fair use. Tomasz ________________________________________________
Tomasz Neugebauer Tel. / Tél. 514-848-2424 ext. / poste 7738
Mailing address / adresse postale: 1455 De Maisonneuve Blvd. W., LB-540-03, Montreal, Quebec H3G 1M8
library.concordia.ca From:
eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk> on behalf of Martin Brändle <martin.braendle@uzh.ch> Attention This email originates from outside the concordia.ca domain. // Ce courriel provient de l'extérieur du domaine de concordia.ca CAUTION: This
e-mail originated outside the University of Southampton. CAUTION: This
e-mail originated outside the University of Southampton. Hi Matthew, not sure if separating search would really help. One just shifts load to another system. Even with our Elasticsearch implementation (https://github.com/eprintsug/EPrintsElasticsearch)
we see hackers trying to spoof the URL and use excessive crawling, on other implementations they also try to post random queries. Kind regards, Martin -- Dr. Martin Brändle |
- Follow-Ups:
- Re: [EP-tech] DDoS of EPrints advanced search
- From: Florian Heß <hess@ub.uni-heidelberg.de>
- Re: [EP-tech] DDoS of EPrints advanced search
- References:
- [EP-tech] DDoS of EPrints advanced search
- From: David R Newman <drn@ecs.soton.ac.uk>
- AW: [EP-tech] DDoS of EPrints advanced search
- From: Jens Witzel <jens.witzel@uzh.ch>
- RE: [EP-tech] DDoS of EPrints advanced search
- From: John Salter <J.Salter@leeds.ac.uk>
- Re: [EP-tech] DDoS of EPrints advanced search
- From: Florian Heß <hess@ub.uni-heidelberg.de>
- Re: [EP-tech] DDoS of EPrints advanced search
- From: David R Newman <drn@ecs.soton.ac.uk>
- Re: [EP-tech] DDoS of EPrints advanced search
- From: Florian Heß <hess@ub.uni-heidelberg.de>
- RE: [EP-tech] DDoS of EPrints advanced search
- From: Matthew Kerwin <matthew.kerwin@qut.edu.au>
- Re: [EP-tech] DDoS of EPrints advanced search
- From: Martin Brändle <martin.braendle@uzh.ch>
- Re: [EP-tech] DDoS of EPrints advanced search
- From: Tomasz Neugebauer <Tomasz.Neugebauer@concordia.ca>
- [EP-tech] DDoS of EPrints advanced search
- Prev by Date: RE: [EP-tech] DDoS of EPrints advanced search
- Next by Date: Re: [EP-tech] DDoS of EPrints advanced search
- Previous by thread: Re: [EP-tech] DDoS of EPrints advanced search
- Next by thread: Re: [EP-tech] DDoS of EPrints advanced search
- Index(es):