Anti-crawler tarpits and related concepts have existed for decades already; LLM training data is only the latest and most popular of web-scraping goals.
Claude is happy and able to provide a laundry list of ways to mitigate the impact of tarpits on your crawler, and politeness / respecting robots.txt is only one of them.
They're all usage based plans. You probably wouldn't hit 10k/head for most users, but thousands is not unheard of. But it's kind of anticipating that's what they would want to charge.
There is no ceiling to how much waste you can create.
Much like stacks and stacks of badly written web frameworks made things like collapsing comments on new reddit 200 ms of JavaScript execution ( https://bvisness.me/high-level/burnitwithfire.png ) I can easily imagine people layer stuff together till token burn is beyond insane.
I mean just look at the Gastown repository. Its like literally hundreds of thousands of lines of go and md files.
The thing is, all these “better than a medieval king” tech niceties still don’t cover the bottom of Maslow’s hierarchy for all, and “poverty” is the state of suffering those gaps for lack of money.
Even in very cheap local housing you usually still have heating, a fridge and more then enough food (to much more often then to little even for the poorest people).
"More reach" seems a valid enough goal/desire in and of itself (even if you deride it as a shallow form of communication, shallow attention is what provides the opportunity for deeper connections); this sets the goal-activity of creative pursuits apart from "lounging alone at the beach" (which is itself a flawed representation of retirement, but that's another story).
The reason OP gave for trying to achieve more reach was this:
> it's been pointed out to me in harsh ways I could be easily growing if I tried a little harder, so I've invested more resources into the channel, equipment, actually trying growth, etc.
…which made me think of the tourist in the story.
Is it really more reach that they desire, transforming their content into whatever sates the algorithm, chasing metrics, investing time and money? Or is their current level of reach perhaps already enough as it is, a work of love and dedication, without someone—something—else deciding what’s best?
And the only thing that stopped in Xinjiang is the news coverage and press access.
I find it deeply ironic that for some, the vibes have shifted towards "hey maybe the CCP isn't all that bad" just because...what, the solar buildouts make them look more competent and long-sighted compared to your local upstart authoritarian party? Such is the nature of vibes, I suppose.
On voting day here, you can be in and out in 5 minutes. The amount of time it takes to vote is the amount of time it takes for you to fill in the bubbles.
Tell me how that's worse than me waiting three hours in line in the Phoenix sun to vote, only to be given a provisional ballot because I'm the wrong demographic. And I know my ballot is going to be thrown in the trash and not counted.
You have never been to Japan. Visit. It will blow your mind.
edit: lol, "You're posting too fast" an hour later. Okay @dang, you win. I'll just take the "ur stoopid" comment like a champ. What kind of fucking Nazi won't even let you speak?
Lack of care is malice, though. Hanlon's razor only applies if they would aspire to do better but lack the awareness/capability, not if they'd happily accept the tradeoff.
Claude is happy and able to provide a laundry list of ways to mitigate the impact of tarpits on your crawler, and politeness / respecting robots.txt is only one of them.
reply