
How do I do know AI steals each new phrase I write and publish? My proof isn’t circumstantial or summary. It’s sitting proper in entrance of me in my web site’s server logs.
Daily, I see 1000’s of hits from AI crawlers, comparable to Googlebot, BingBot, ChatGPT-Consumer, Meta-ExternalAgent, ClaudeBot, PerplexityBot, and lots of others you’ve most likely by no means heard of.
However the true downside isn’t the names that we acknowledge. It’s the hidden, unidentified bots quietly scraping the whole lot I publish. A whole lot, generally 1000’s, of them vacuuming up my phrases with out a hint of credit score or consent.
And for what? Nicely, the subsequent time you check out a superb new AI instrument (for a modest subscription payment, naturally), you’ll be able to thank the numerous writers like me whose work has grow to be the coaching fodder that makes all of it doable.
Copying from the Web is nothing new
Writers have at all times recognized that publishing on-line comes with dangers. However the scale at present is unprecedented as AI steals writing from all over the place.
After I first began publishing on-line a few years in the past, individuals copied and republished my writing by merely copying and pasting my textual content.
Just a few years later, it grew to become somewhat extra subtle as bots started scraping content material from HTML or feeds to republish copied content material.
Regardless that it was outright theft of fabric coated by copyright, it was comparatively simple to hint. I sometimes despatched e mail warnings, however to little impact.
Nevertheless, there was a constructive facet to this copying. Serps may simply see from the revealed dates that mine was the unique, so it didn’t have an effect on my search engine rankings.
As a bonus, most of the copies of my content material nonetheless contained my inner hyperlinks, which helped me acquire some further visitors.
One other type of copying started with Google together with featured widgets in search outcomes.
Whereas it appeared like a constructive at first to be featured on the high of Google Search, the draw back was that Google copied textual content from my content material and fairly often answered the search question. That meant that customers had little cause to click on via to my article.
However at present, this impact has been multiplied 1,000-fold with Google’s AI Overviews. The large downside is that whereas Google does embody hyperlinks to some websites, customers have little cause to click on, as a result of they’ve the reply to their query.
So sure, copying is nothing new, however AI steals writing, which has taken it to a brand new degree that leaves writers defenseless.
It’s not solely my phrases, it’s my server!


It prices cash to host a web site on the Web, and each hit on a web site counts.
That’s fantastic if guests are coming to a web site and consuming content material. It’s additionally fantastic that search engine crawlers hit a web site often.
Till the appearance of AI, it was an unwritten settlement that serps had open entry to websites in change for delivering visitors.
However that settlement is now out the window.
Earlier than AI arrived, serps crawled my web site round 1,000 -1,500 instances per day. Now, nevertheless, with AI crawlers, my web site is hit over 75,000 instances day by day. That’s the additional draw back that individuals not often point out.
Sure, writers are justified in worrying or complaining about AI stealing writers’ work. However what about the price of huge will increase in server load?
I’m lucky that my server host hasn’t (as but) elevated my prices.
Nevertheless, many authors and writers use web sites which have bandwidth limits to maintain prices down.
That’s fantastic should you count on 100 visits per day. However what occurs when lots of of AI crawlers hit these websites tens of 1000’s of instances per day?
It’s a bit like being robbed, however having to pay for the privilege.
Sure, everyone knows that writers are unprotected from AI as a result of there are not any legal guidelines governing AI theft and coaching on stolen phrases.
Whereas there are some measures into consideration, none of them take note of the price of AI to web site house owners.
So, if AI is scraping our content material at an unprecedented scale and even driving up prices, what can writers do to guard their work?
Learn how to battle again towards AI crawlers
Fortunately, there are applied sciences out there to assist reduce or not less than scale back the frequency of AI bots and crawlers.
For writers utilizing free or hosted web site platforms, you’ll have an choice to dam crawlers in your robots.txt file. For instance, Wix has a function “To set the robots.txt file to dam AI crawlers.”
You should utilize this technique on nearly any platform, and whereas it’s not an ideal answer, it is going to assist scale back the frequency of AI hits.
If in case you have a WordPress web site linked to Cloudflare, you may have many extra choices.
Cloudflare has a function the place you’ll be able to merely block or permit AI crawlers. It means which you could differentiate and maybe permit search engine crawlers, however block AI assistants and instruments.
However these strategies solely relate to the well-known AI providers. So what are you able to do concerning the proliferation of smaller and normally unidentified crawlers?
Nicely, you want to take a Sherlock Holmes strategy and search for clues.
It is advisable to verify your web site server for overly energetic bots which are greater than more likely to be AI bots at present.
One of many best instruments to make use of is StatCounter. The free model has limits, however it offers you sufficient to establish bot visitors and the IP addresses.
Wordfence’s dwell visitors function may provide you with a warning to uncommon exercise and establish the IP tackle.
After getting an IP tackle, you’ll be able to block it with both Cloudflare or WordFence. I exploit each, and they’re the very best types of protection.
Irrespective of the way you attempt to mitigate AI bots, you’ll by no means get all of them. There are just too many, and the quantity is growing day by day.
However by taking a number of the measures I’ve famous, you’ll be able to battle again and save your server.
Abstract
Proper now, writers and publishers are on the shedding finish of the AI revolution.
It’s not honest, however that’s the way in which it’s, leaving writers unprotected from AI. The one solution to fully cease AI stealing your writing is to cease writing and publishing.
However that’s not the reply.
The specter of copying has been round for years, so what’s new? The one distinction is that it was doable to search out copycat publishers from the textual content they copied.
However with AI, your textual content is sucked up and used for coaching, which is unattainable to hint.
So, the one protection is to cease them from accessing your server and your writing.
It’ll be a whack-a-mole course of for a while but, however it’s the very best protection on supply proper now.
Replace: Inside one minute of publishing this text, my server logs recorded 77 bot hits from AI crawlers. How ironic that they got here so rapidly to “learn” my article about AI stealing content material.
Associated Studying: Why Utilizing AI To Write For You Is A Horrible Concept
Share This Article


Leave a Reply