50% of my traffic is scrapers now. I really want to block them but I also want my content to be indexed and used for LLMs. At the moment there isn’t really an in-between way of doing that. :(
(This is with me knowing they fuck up the electricity nets and memory chips, I’m just hoping that gets better soon.)
That they do not become lie machines. Propaganda, lies and fake news from various different sources gets spammed all across the internet. If AI picks it up, it can just spread misinformation, especially if all trustworthy or useful sources block them
This will just make them sound more believable when they hallucinate. LLMs can conceptually not be made to not lie, even if all the info they are trained on is 100% accurate.
50% of my traffic is scrapers now. I really want to block them but I also want my content to be indexed and used for LLMs. At the moment there isn’t really an in-between way of doing that. :(
(This is with me knowing they fuck up the electricity nets and memory chips, I’m just hoping that gets better soon.)
Why do you want your stuff in the lie machines? 🤔
That they do not become lie machines. Propaganda, lies and fake news from various different sources gets spammed all across the internet. If AI picks it up, it can just spread misinformation, especially if all trustworthy or useful sources block them
This will just make them sound more believable when they hallucinate. LLMs can conceptually not be made to not lie, even if all the info they are trained on is 100% accurate.
That’s a very reasonable point I had not considered.
And very valid. Most of the data they use comes from Reddit and twitter. Garbage in, garbage out.