• 18 Posts
  • 1.85K Comments
Joined 9 months ago
cake
Cake day: November 27th, 2024

help-circle





  • Documentation is easier. Tools for PDF, Markdown have increased in efficacy. Coding alone has lowered the barrier to bringing building blocks and some understanding to the masses.

    I have seen none of these, in practice.

    The documentation generated is no better than what a level 1 support rep creates, and needs to be heavily fixed before being relied on.

    Pandoc still produces PDFs, Markdown, etc just as quickly as it always has.

    The code produced still has the same issues as documentation: it’s shite, and not easily bug fixed due to a lack of understanding by anyone with what its actually doing. And, if you need someone who understand the code already to bugfix it, guess what? You didn’t save anyone anything.

    And, all of this, only using terrawatts more electricity than before, with equivalent or worse outcomes.



  • LLM might be worse than those but Perplexity is certainly a lesser player in the field.

    Its a good thing I don’t just block Perplexity, but all of the LLMs.

    And I wont comment on the rest of this, but lets consider another form of property: Real estate.

    You own a plot of land. Should others be able to use it, however they feel, whenever they feel like? Or should you have a say in how it gets used?

    If you feel like you should have exclusive say in how real estate you own is used and when and by whom, why is intellectual property any different? There must be value in using it, so what’s wrong with revenues generated by that use being shared (At least) with the creator?

    Last I checked, I’m not seeing rev shares from any of these LLMs that have certainly used my code and other content to train?


  • Information should not be gated or owned in a way that would make it illegal for anyone to access it under proper conditions.

    Then you don’t believe content creators should have any control over their own works?

    The “proper conditions” are deemed by the content creator, not the consumers.

    Doing a GET request doesn’t do that.

    Not at all. It consumes at most, a watt.

    What kind of problems that would be?

    Increasing my hosting bill, to accommodate the senseless traffic being sent my way?

    Outages for my site, making my content unavailable for legitimate users?

    You have to agree that at one point “be used by LLM” would not be different from “be used by a user”.

    Not at all. LLMs are not users.

    It’s self-hosted and free.

    If you want, or they charge for the hosted version. If they want to use a paid for version, then they can divert some of that revenue to me, the creator, because without creators, they would have no product.

    How does that prohibit usage and processing of your info? That sounds like “I won’t be providing any comments on Lemmy website, if you want my opinion you can mail me at a@b.com

    That’s a apples and oranges comparison, and you know it.

    That will never block all of them. Your info will be used without your consent and you will not feel troubled from it. So you might not feel troubled if more things do the same.

    Perplexity seems to be troubled by it.

    What if I use my local hosted LLM? Anyway, the point is, selling text can’t work well, and you’re going to spend much more resources on collecting and summarizing data about how your text was used and how others benefited from it, in order to get compensation, than it worths.

    If selling text can’t work well, then why do LLM products insist on using my text, to sell it?

    Also, it might be the case that some information is actually worthless when compared to a service provided by things like LLM, even though they use that worthless information in the process.

    LLMs are a net negative, as far as costs go. They consume far more in resources than they provide in benefit. If my information was worthless without an LLM, it’s worthless with an LLM, therefore, LLMs don’t need to access it. Periodt.

    The bottom line? Content creators get the first say in how their content is used, and consumed. You are not entitled to their labor, for free, and without condition.


  • Ironically, you completely fell into proving my point.

    You mean, by supplying actual examples of when it was tried, and when it worked?

    Somehow, I don’t think you’ll figure that out… Not today, maybe, but in the future, hopefully.

    Someday, you’ll figure out that capitalism is basically just “Might equals right”. Not today, by in the future, maybe. I don’t hold out much hope, though.






  • Some of the truth is revealed.

    I didn’t think this was a hard to believe concept, that adults condition children, and in the US, and many western societies, we condition people to think anything biological or sexual is “dirty”, and associating with “dirty” leads to (often, physical) punishment.

    And, all animals will avoid physical punishment, because it’s painful. Humans, and many other non-human sentient beings, also experience pain from social and emotional punishments.

    Self-fulfilling prophecies are a thing. People struggle to grasp the alternative possibilities.

    I’m not sure how that’s a self-fullfilling prophecy. It’s more a description of how society works.


  • That all sounds very vague to me, and I don’t expect it to be captured properly by law any time soon.

    It already has been captured, properly in law, in most places. We can use the US as an example: Both intellectual property and real property have laws already that cover these very items.

    What does it mean for you and how is it different from being accessed by a user?

    Well, does a user burn up gigawatts of power, to access my site every time? That’s a huge different.

    Imagine you host a weather forecast. If that information is public, what kind of compensation do you expect from anyone or anything who accesses that data?

    Depends on the terms of service I set for that service.

    Is it okay for a person to access your site?

    Sure!

    Is it okay for a script written by that person to fetch data every day automatically?

    Sure! As long as it doesn’t cause problems for me, the creator and hoster of said content.

    Would it be okay for a user to dump a page of your site with a headless browser?

    See above. Both power usage and causing problems for me.

    Would it be okay to let an LLM take a look at it to extract info required by a user?

    No. I said, I do not want my content and services to be used by and for LLMs.

    Have you heard about changedetection.io project?

    I have now. And should a user want to use that service, that service, which charges 8.99/month for it needs to pay me a portion of that, or risk having their service blocked.

    There no need to use it, as I already provide RSS feeds for my content. Use the RSS feed, if you want updates.

    If some of these sound unfair to you, you might want to put a DRM on your data or something.

    Or, I can just block them, via a service like Cloud Flare. Which I do.

    Would you expect a compensation from me after reading your comment?

    None. Unless you’re wanting to access if via an LLM. Then I want compensation for the profit driven access to my content.