Granted, I really don’t know much about how all this works, but the thought occurred to me that Lemmy - as wonderfully open as it is, and without any kind of ‘disappearing messages’ or other privacy protecting functionality - is basically a smorgasbord for AI scrapers. Or am I (hopefully) wrong about this?
Have you seen the quality of the comments and posts? It’s mostly pointless garbage spewing—yes, myself included. I’m convinced that part of the reason LLMs can be so bad at times is that they are fed on random peoples’ boredom and doom posting.
Sure, there’s some quality posts occasionally. Sometimes people have interesting, worthwhile discussions. But, like Reddit before it, most of the posting is memes, snark and venting. It’s not good content on average. If LLMs are training on barely-moderated forums, they are not getting a good education.