I like how you think. These are all great ideas! Reminds me of a time some real ...

smaudet · on Dec 12, 2022

Is the primary motivator to do this?

I'm curious if they are stealing anything else, e.g. are they selling ads/tracking, do they replace order forms with their own...

mkoryak · on Dec 12, 2022

because I asked them to stop doing it, and they didn't. Technically they were stealing my bandwidth.

Also to teach them an important lesson about the internet.

Firmwarrior · on Dec 12, 2022

haha, they're just lucky you didn't introduce them to Goatse

mkoryak · on Dec 12, 2022

well actually...

there was another time a site hotlinked to a js file. After asking them to stop, i found that they had a contact form with a homebrew captcha which created the letters image like http://evilsite.com/cgi-bin/captcha.jpg?q=ansr

A little while later, their captcha form had a hidden input appended with the correct answer value, and the word to solve was changed to a new 4 letter word from a dictionary of interesting 4 letter words. The form still worked because of the hidden input. I might have changed the name on the "real" input also.

spmurrayzzz · on Dec 12, 2022

Signal boosting suggestion #1 here. Great idea.

Additionally if they decide to blackhole the fake/honeypot url, since you mentioned they pass along the user agent, you could mixin some token in a randomized user agent string that your scraper uses so that you could duck-type the request on your end to signal when to capture the egress ip.

pwdisswordfish9 · on Dec 12, 2022

#5 and #6 are key. Don't try to block them directly, just get them delisted. When you've worked out a way to identify which requests belong to the scammer, feed them content that the search engines and their ad partners will penalize them for.

davidrupp · on Dec 12, 2022

Bummed that I can upvote this only once. Excellent work.

graderjs · on Dec 12, 2022

LOL! Thank you for the laugh. This is great.

egberts1 · on Dec 12, 2022

What a sure-fire way to toast them! Kudos!