[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[EP-tech] Antwort: Re: Spam to submitter via "Copy request" form



CAUTION: This e-mail originated outside the University of Southampton.
Yes - the privacy statement is a field that was added to the request dataset.
It is stored as the version of the privacy statement that was agreed e.g. 'request_v1'.
We only have one version at the moment, but if we revised this to make a v2 statement, we would store 'request_v2' in the database.

Cheers,
John


From: jens.witzel at uzh.ch [mailto:jens.witzel at uzh.ch]
Sent: 13 September 2021 14:47
To: John Salter <J.Salter at leeds.ac.uk>
Cc: eprints-tech at ecs.soton.ac.uk; jens.witzel at uzh.ch
Subject: Antwort: Re: [EP-tech] Spam to submitter via "Copy request" form


Hi John

thanks a lot for your quick answer. I'll keep an eye on it. Q: Do you store the "Privacy Agreement" click?

Of cause we analyse apaches logfiles and feed our badbot list, but unfortunately at the moment of sending the form it's to late ;-)

Anybody else doing the same or something different?

Cheers
Jens

--
Jens Witzel
Zentrale Informatik
Universit?t Z?rich
Stampfenbachstrasse 73
CH-8006 Z?rich

mail:  jens.witzel at uzh.ch<mailto:jens.witzel at uzh.ch>
phone: +41 44 63 56777
https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.zi.uzh.ch%2F&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4cdca1b5fc984fa5526e08d976bec5c4%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637671383997556619%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=f8UWL16yrOhiRu%2B3CK36KUYpoOWTMpNfQTQvd6pU%2B6A%3D&amp;reserved=0

[Inactive hide details for "John Salter" ---13.09.2021 15:30:46---Hi Jens, We use the recaptcha stuff e.g. https://eprints.white]"John Salter" ---13.09.2021 15:30:46---Hi Jens, We use the recaptcha stuff e.g. https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Feprints.whiterose.ac.uk%2Fcgi%2Frequest_doc%3Fdocid%3D23483&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4cdca1b5fc984fa5526e08d976bec5c4%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637671383997556619%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=cRgxGHXEWlZu7OdbZoNWB3jWOb0%2Fa3LUtRdRfK9aiq8%3D&amp;reserved=0

Von: "John Salter" <J.Salter at leeds.ac.uk<mailto:J.Salter at leeds.ac.uk>>
An: "eprints-tech at ecs.soton.ac.uk<mailto:eprints-tech at ecs.soton.ac.uk>" <eprints-tech at ecs.soton.ac.uk<mailto:eprints-tech at ecs.soton.ac.uk>>, "jens.witzel at uzh.ch<mailto:jens.witzel at uzh.ch>" <jens.witzel at uzh.ch<mailto:jens.witzel at uzh.ch>>
Datum: 13.09.2021 15:30
Betreff: Re: [EP-tech] Spam to submitter via "Copy request" form

________________________________



Hi Jens,
We use the recaptcha stuff e.g. https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Feprints.whiterose.ac.uk%2Fcgi%2Frequest_doc%3Fdocid%3D2348396&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4cdca1b5fc984fa5526e08d976bec5c4%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637671383997556619%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=pqJjsE7sUBD4h8w%2FC7rdQPyDq3Qh2PAjM5Wal9a6hGI%3D&amp;reserved=0 .
The google.com version and recaptcha.net are essentially the same thing - but recaptcha.net isn't blocked in e.g. China, so we use that.

This does work well for us, and we also use recaptcha.net on our account creation and 'contact us' pages on our eTheses repository.

As the request details are stored in the EPrints database, you could do some analysis of these spam requests, and see if there are common themes - e.g. links in the request reason, or email addresses supplied?
You could also look at historic Apache logs and see if they all originate from the same place?

Cheers,
John


________________________________

From: eprints-tech-bounces at ecs.soton.ac.uk<mailto:eprints-tech-bounces at ecs.soton.ac.uk> <eprints-tech-bounces at ecs.soton.ac.uk<mailto:eprints-tech-bounces at ecs.soton.ac.uk>> on behalf of jens.witzel--- via Eprints-tech <eprints-tech at ecs.soton.ac.uk<mailto:eprints-tech at ecs.soton.ac.uk>>
Sent: 13 September 2021 13:34
To: eprints-tech at ecs.soton.ac.uk<mailto:eprints-tech at ecs.soton.ac.uk> <eprints-tech at ecs.soton.ac.uk<mailto:eprints-tech at ecs.soton.ac.uk>>
Subject: [EP-tech] Spam to submitter via "Copy request" form

CAUTION: This e-mail originated outside the University of Southampton.
Hi out there

we have received some feedback regarding spam via the "Copy Request". Lots of emails gone to one submitter. Does anybody use any capture or something else in this direction?

First I found something in /usr/local/eprints/lib/workflows/request/default.xml (line 22ff.) - using googles capture https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.google.com%2Frecaptcha%2Fabout%2F&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4cdca1b5fc984fa5526e08d976bec5c4%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637671383997556619%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=5r1Sawzx13X42lyqB0e3yAw1Y3Z75iH62VGvDZ62NxA%3D&amp;reserved=0<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.google.com%2Frecaptcha%2Fabout%2F&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4cdca1b5fc984fa5526e08d976bec5c4%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637671383997556619%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=5r1Sawzx13X42lyqB0e3yAw1Y3Z75iH62VGvDZ62NxA%3D&amp;reserved=0> but for sure we will have problems with data privacy.

Second i found some hints in the Eprints wiki: A captcha pseudo-field based on https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Frecaptcha.net%2F&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4cdca1b5fc984fa5526e08d976bec5c4%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637671383997556619%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=lSGpZyrEM5QLDQZEicEZivfn5FeFF%2Bli0vuzSqSuA2c%3D&amp;reserved=0<https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Frecaptcha.net%2F&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4cdca1b5fc984fa5526e08d976bec5c4%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637671383997556619%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=lSGpZyrEM5QLDQZEicEZivfn5FeFF%2Bli0vuzSqSuA2c%3D&amp;reserved=0>
https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwiki.eprints.org%2Fw%2FNew_Features_in_EPrints_3.2&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4cdca1b5fc984fa5526e08d976bec5c4%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637671383997556619%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=XDVVSy8YqovfD%2BNWwojuUKD4yFOhbhWkLmZGaU7Q4hg%3D&amp;reserved=0<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwiki.eprints.org%2Fw%2FNew_Features_in_EPrints_3.2&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4cdca1b5fc984fa5526e08d976bec5c4%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637671383997556619%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=XDVVSy8YqovfD%2BNWwojuUKD4yFOhbhWkLmZGaU7Q4hg%3D&amp;reserved=0>

Anything else? Cookies, Perl driven stuff? What do you guys use?

Every hint is welcome

Jens

--
Jens Witzel
Zentrale Informatik
Universit?t Z?rich
Stampfenbachstrasse 73
CH-8006 Z?rich

mail:  jens.witzel at uzh.ch<mailto:jens.witzel at uzh.ch>
phone: +41 44 63 56777
https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.zi.uzh.ch%2F&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4cdca1b5fc984fa5526e08d976bec5c4%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637671383997556619%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=f8UWL16yrOhiRu%2B3CK36KUYpoOWTMpNfQTQvd6pU%2B6A%3D&amp;reserved=0<https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.zi.uzh.ch%2F&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C4cdca1b5fc984fa5526e08d976bec5c4%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637671383997556619%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=f8UWL16yrOhiRu%2B3CK36KUYpoOWTMpNfQTQvd6pU%2B6A%3D&amp;reserved=0>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20210913/7758b7bf/attachment-0001.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.gif
Type: image/gif
Size: 105 bytes
Desc: image001.gif
Url : http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20210913/7758b7bf/attachment-0001.gif