EPrints Technical Mailing List Archive
See the EPrints wiki for instructions on how to join this mailing list and related information.
Message: #10132
< Previous (by date) | Next (by date) > | < Previous (in thread) | Next (in thread) > | Messages - Most Recent First | Threads - Most Recent First
Re: [EP-tech] DDoS of EPrints advanced search
- To: "eprints-tech@ecs.soton.ac.uk" <eprints-tech@ecs.soton.ac.uk>, "David R Newman" <drn@ecs.soton.ac.uk>
- Subject: Re: [EP-tech] DDoS of EPrints advanced search
- From: Tomasz Neugebauer <Tomasz.Neugebauer@concordia.ca>
- Date: Thu, 5 Jun 2025 15:43:22 +0000
CAUTION: This e-mail originated outside the University of Southampton.
Hi everyone,
I've seen some of the same repeat requests on our advanced search reported on this thread, although more moderate volume. Our IT security team made some tweaks to firewall that have really helped to dramatically slow down the rate at which these repeat requests
get through. For those that do get through they get a 403 right away from Apache, as I added a regex to the apache config along the lines of what David suggested, based on what I saw in the logs. For the time being, that has been sufficient. My intuition
is that this sort of issue of repeat requests / DDoS attacks, should be dealt with at that firewall level. In terms of pages that are vulnerable, although not targeted, I worry about the IRStats2, considering putting that behind a login, but at the same time,
it's great to have that data available openly so I hesitate.
As I was monitoring the situation, it struck me just how much Gen AI crawling is happening on our repository, from Open AI, Bing/copilot, etc. The IR is a central/important data infrastructure for these services that are themselves in the midst of litigation
regarding copyright and fair use.
Tomasz
________________________________________________
Tomasz Neugebauer
Tel. / Tél. 514-848-2424 ext. / poste 7738
Mailing address / adresse postale: 1455 De Maisonneuve Blvd. W., LB-540-03, Montreal, Quebec H3G 1M8 library.concordia.ca From: eprints-tech-request@ecs.soton.ac.uk <eprints-tech-request@ecs.soton.ac.uk> on behalf of Martin Brändle <martin.braendle@uzh.ch>
Sent: Thursday, June 5, 2025 9:52 AM To: eprints-tech@ecs.soton.ac.uk <eprints-tech@ecs.soton.ac.uk>; David R Newman <drn@ecs.soton.ac.uk> Subject: Re: [EP-tech] DDoS of EPrints advanced search Attention This email originates from outside the concordia.ca domain. // Ce courriel provient de l'extérieur du domaine de concordia.ca
CAUTION: This e-mail originated outside the University of Southampton.
CAUTION: This e-mail originated outside the University of Southampton.
Hi Matthew,
not sure if separating search would really help. One just shifts load to another system. Even with our Elasticsearch implementation (https://github.com/eprintsug/EPrintsElasticsearch) we see hackers trying to spoof the URL and use excessive crawling, on other implementations they also try to post random queries.
Kind regards,
Martin
-- Dr. Martin Brändle
|
- Follow-Ups:
- RE: [EP-tech] DDoS of EPrints advanced search
- From: Matthew Kerwin <matthew.kerwin@qut.edu.au>
- RE: [EP-tech] DDoS of EPrints advanced search
- References:
- [EP-tech] DDoS of EPrints advanced search
- From: David R Newman <drn@ecs.soton.ac.uk>
- AW: [EP-tech] DDoS of EPrints advanced search
- From: Jens Witzel <jens.witzel@uzh.ch>
- RE: [EP-tech] DDoS of EPrints advanced search
- From: John Salter <J.Salter@leeds.ac.uk>
- Re: [EP-tech] DDoS of EPrints advanced search
- From: Florian Heß <hess@ub.uni-heidelberg.de>
- Re: [EP-tech] DDoS of EPrints advanced search
- From: David R Newman <drn@ecs.soton.ac.uk>
- Re: [EP-tech] DDoS of EPrints advanced search
- From: Florian Heß <hess@ub.uni-heidelberg.de>
- RE: [EP-tech] DDoS of EPrints advanced search
- From: Matthew Kerwin <matthew.kerwin@qut.edu.au>
- Re: [EP-tech] DDoS of EPrints advanced search
- From: Martin Brändle <martin.braendle@uzh.ch>
- [EP-tech] DDoS of EPrints advanced search
- Prev by Date: Re: [EP-tech] DDoS of EPrints advanced search
- Next by Date: RE: [EP-tech] DDoS of EPrints advanced search
- Previous by thread: Re: [EP-tech] DDoS of EPrints advanced search
- Next by thread: RE: [EP-tech] DDoS of EPrints advanced search
- Index(es):