The one-liner:

dd if=/dev/zero bs=1G count=10 | gzip -c > 10GB.gz

This is brilliant.

  • tal
    link
    fedilink
    English
    arrow-up
    18
    ·
    4 hours ago

    Anyone who writes a spider that’s going to inspect all the content out there is already going to have to have dealt with this, along with about a bazillion other kinds of oddball or bad data.

    • catloaf@lemm.ee
      link
      fedilink
      English
      arrow-up
      9
      arrow-down
      2
      ·
      2 hours ago

      Competent ones, yes. Most developers aren’t competent, scraper writers even less so.