Chinese artificial intelligence startup DeepSeek’s latest AI model sparked a $1 trillion rout in US and European technology stocks, as investors questioned bloated valuations for some of America’s biggest companies.

  • MudMan@fedia.io
    link
    fedilink
    arrow-up
    6
    ·
    3 days ago

    OK, hold on, so I went over to huggingface and took a look at this.

    Deepseek is huge. Like Llama 3.3 huge. I haven’t done any benchmarking, which I’m guessing is out there, but it surely would take as much Nvidia muscle to run this at scale as ChatGPT, even if it was much, much cheaper to train, right?

    So is the rout based on the idea that the need for training hardware is much smaller than suspected even if the operation cost is the same… or is the stock market just clueless and dumb and they’re all running on vibes at all times anyway?

    • OmegaLemmy@discuss.online
      link
      fedilink
      arrow-up
      8
      ·
      3 days ago

      I thought everyone knew stocks were all vibes by now, private market might improve with competition but a public stock will always pick the most flashy option even if it’s shit just for appeal or they quite literally lose everything if it goes slightly wrong

    • jacksilver@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      3 days ago

      Everything I’ve seen from looking into it seems to imply it’s on par for training and performance as other (LLM only) models.

      I feel like I’m missing something here or that the market is “correcting” for other reasons.

    • Umbrias@beehaw.org
      link
      fedilink
      arrow-up
      2
      arrow-down
      1
      ·
      3 days ago

      it does not take an entire nvidia datacenter to serve one customer. the largest model appears to run on a high end rig.

    • sunzu2@thebrainbin.org
      link
      fedilink
      arrow-up
      1
      arrow-down
      1
      ·
      3 days ago

      Deepseek is the based on either llama or qwen, but can be put on top of any model?

      I tested qwen which sucked dick IMHO

      Now deepseek qwen is best thing I tried locally

  • sunzu2@thebrainbin.org
    link
    fedilink
    arrow-up
    9
    arrow-down
    9
    ·
    3 days ago

    Tech enjoyers spent the weekend fucking with these model to prove that chinaman sucks, only to realize chinaman did a great job…

    Gonna be hard justifying them salaries 🐸 when a bunch scrappy chinese wage slaves can do this eyyy?

    And before you come in

    chinaman is lying how much they spent on training, rheee

    Sure, maybe, but they also trained in prior gen hardware and they apparently outsoftwared the American grifters and their 500k a pop wage slaves

    500b pledged by US private capital, but hey daddy sam can we pretty please have some more cash, can you please build us electricity for this🤡

    Pathetic…

    • Fizz@lemmy.nz
      link
      fedilink
      arrow-up
      5
      arrow-down
      1
      ·
      3 days ago

      Exactly. Anyone who still thinks China is not on the cutting edge is not paying attention. Even with all the restrictions they’ve been pushing crazy technical advancements the last 5 years.