Andy Reid@lemmy.world to Technology@lemmy.worldEnglish · 1 year agoAI companies are violating a basic social contract of the web and and ignoring robots.txtwww.theverge.comexternal-linkmessage-square200fedilinkarrow-up11.09Karrow-down115cross-posted to: technology@midwest.socialtechnology@beehaw.orgtechnology@lemmy.zip
arrow-up11.08Karrow-down1external-linkAI companies are violating a basic social contract of the web and and ignoring robots.txtwww.theverge.comAndy Reid@lemmy.world to Technology@lemmy.worldEnglish · 1 year agomessage-square200fedilinkcross-posted to: technology@midwest.socialtechnology@beehaw.orgtechnology@lemmy.zip
minus-squareShitpostCentral@lemmy.worldlinkfedilinkEnglisharrow-up16·1 year agoYou’re second point is a good one, but you absolutely can log the IP which requested robots.txt. That’s just a standard part of any http server ever, no JavaScript needed.
minus-squareGenderNeutralBro@lemmy.sdf.orglinkfedilinkEnglisharrow-up11·1 year agoYou’d probably have to go out of your way to avoid logging this. I’ve always seen such logs enabled by default when setting up web servers.
You’re second point is a good one, but you absolutely can log the IP which requested robots.txt. That’s just a standard part of any http server ever, no JavaScript needed.
You’d probably have to go out of your way to avoid logging this. I’ve always seen such logs enabled by default when setting up web servers.