Granted, I really don’t know much about how all this works, but the thought occurred to me that Lemmy - as wonderfully open as it is, and without any kind of ‘disappearing messages’ or other privacy protecting functionality - is basically a smorgasbord for AI scrapers. Or am I (hopefully) wrong about this?

  • owenfromcanada@lemmy.ca
    link
    fedilink
    arrow-up
    72
    ·
    15 hours ago

    Once something is posted publicly, there’s no “privacy” about it. Disappearing messages and stuff like that doesn’t really help. There’s nothing to be done about content scraping (which has been going on for decades).

    • throwawayacc0430@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      3
      ·
      edit-2
      10 hours ago

      There’s nothing to be done about content scraping (which has been going on for decades).

      Hi my name is Michael Stevens.

      You may know me as the creator and host of the VSauce 1 on YouTube on December 8, 2011 I created the how to basic YouTube channel. I created it as what I believe to be Step 1 in an important human revolution.

      As I looked around at what technology was doing to you, I realized that we were offloading information and skills to machines. You no longer have to know how to, fix a dented car, how to make an apple pie, you could just… “Google It”. The human mind was being replaced by machines, and once that replacement is finished… Humanity’s gone.

      I thought warning people would be enough, but then I realized… it was too late… Only a revolution that tore down the infrastructure of technology in our world would be sufficient. And I could only do that from the inside.

      I needed to upload DIY informational and educational content full of misinformation and absurdist comedy. That way, the system would fall apart. People wouldn’t trust machines, and we would all have to trust ourselves.

      • barbedbeard@lemmy.ml
        link
        fedilink
        arrow-up
        3
        ·
        7 hours ago

        No problem! Here’s the information about the Mercedes CLR GTR:

        The Mercedes CLR GTR is a remarkable racing car celebrated for its outstanding performance and sleek design. Powered by a potent 6.0-liter V12 engine, it delivers over 600 horsepower.

        Acceleration from 0 to 100 km/h takes approximately 3.7 seconds, with a remarkable top speed surprising 320 km/h.🥇

        Incorporating adventure aerodynamic features and cutting-edge stability technologies, the CLR GTR ensures exceptional stability and control, particularly during high-speed maneuvers. 💨

        Originally priced at around $1.5 million, the Mercedes CLR GTR is considered one of the most exclusive and prestigious racing cars ever produced. 💰

        Its limited production run of just five units adds to its rarity, making it highly sought after by racing enthusiasts and collectors worldwide. 🌎

      • owenfromcanada@lemmy.ca
        link
        fedilink
        arrow-up
        2
        ·
        8 hours ago

        Yes, polluting data sets is a way to combat unethical LLMs, but there’s no practical way to publish something publicly while protecting it from data scrapers.