Lemmy Today
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
gay_king_prince_charles [she/her, he/him]@hexbear.net to technology@hexbear.netEnglish · 12 days ago

Alibaba Cloud releases Qwen3, an open weight LLM set that outperforms ChatGPT-o1 with only 32B parameters

qwenlm.github.io

external-link
message-square
6
link
fedilink
  • cross-posted to:
  • localllama@sh.itjust.works
  • hackernews@lemmy.bestiver.se
26
external-link

Alibaba Cloud releases Qwen3, an open weight LLM set that outperforms ChatGPT-o1 with only 32B parameters

qwenlm.github.io

gay_king_prince_charles [she/her, he/him]@hexbear.net to technology@hexbear.netEnglish · 12 days ago
message-square
6
link
fedilink
  • cross-posted to:
  • localllama@sh.itjust.works
  • hackernews@lemmy.bestiver.se
Qwen3: Think Deeper, Act Faster
qwenlm.github.io
external-link
QWEN CHAT GitHub Hugging Face ModelScope Kaggle DEMO DISCORD Introduction Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models. Our flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general capabilities, etc., when compared to other top-tier models such as DeepSeek-R1, o1, o3-mini, Grok-3, and Gemini-2.5-Pro. Additionally, the small MoE model, Qwen3-30B-A3B, outcompetes QwQ-32B with 10 times of activated parameters, and even a tiny model like Qwen3-4B can rival the performance of Qwen2.
alert-triangle
You must log in or register to comment.
  • JoeByeThen [he/him, they/them]@hexbear.net
    link
    fedilink
    English
    arrow-up
    13
    ·
    12 days ago

    nicholson-yes

    • JoeByeThen [he/him, they/them]@hexbear.net
      link
      fedilink
      English
      arrow-up
      9
      ·
      12 days ago

      Already on ollama.

      https://ollama.com/library/qwen3

      • gay_king_prince_charles [she/her, he/him]@hexbear.netOP
        link
        fedilink
        English
        arrow-up
        6
        ·
        12 days ago

        I’ve found Qwen preferable to DeepSeek for coding so I can’t wait to try this out

        • JoeByeThen [he/him, they/them]@hexbear.net
          link
          fedilink
          English
          arrow-up
          5
          ·
          12 days ago

          I’ve not used Qwen yet, but I have noticed deepseek, specifically r1, is kind of a lazy coder. Lot of ‘step 5 draw the rest of the owl’ type responses.

          Unrelated, but does anyone else’s internet speed come to a screeching halt when trying to download models from ollama? I swear I’m being throttled by xfinity.

          • gay_king_prince_charles [she/her, he/him]@hexbear.netOP
            link
            fedilink
            English
            arrow-up
            5
            ·
            12 days ago

            That might just be LLMs in general. ChatGPT does the same. Copilot is a little more well-tuned, but I really only ever have it do boilerplate.

            • JoeByeThen [he/him, they/them]@hexbear.net
              link
              fedilink
              English
              arrow-up
              2
              ·
              12 days ago

              I’ve had really good luck with chatgpt 4o, and, to be fair, I have teased some decent responses out of deepseek 3 (iirc). Different ways of expanding on the basic principles of asking it to ‘step back and visualize different options before moving forward and fully implementing them with all necessary code, following best practices, etc.’ tends to get pretty good results.

technology@hexbear.net

technology@hexbear.net

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@hexbear.net

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

  • Ways to run Microsoft/Adobe and more on Linux
  • The Ultimate FOSS Guide For Android
  • Great libre software on Windows
  • Hey you, the lib still using Chrome. Read this post!

Rules:

  • 1. Obviously abide by the sitewide code of conduct. Bigotry will be met with an immediate ban
  • 2. This community is about technology. Offtopic is permitted as long as it is kept in the comment sections
  • 3. Although this is not /c/libre, FOSS related posting is tolerated, and even welcome in the case of effort posts
  • 4. We believe technology should be liberating. As such, avoid promoting proprietary and/or bourgeois technology
  • 5. Explanatory posts to correct the potential mistakes a comrade made in a post of their own are allowed, as long as they remain respectful
  • 6. No crypto (Bitcoin, NFT, etc.) speculation, unless it is purely informative and not too cringe
  • 7. Absolutely no tech bro shit. If you have a good opinion of Silicon Valley billionaires please manifest yourself so we can ban you.
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 25 users / day
  • 605 users / week
  • 1.34K users / month
  • 2.9K users / 6 months
  • 32 local subscribers
  • 23.7K subscribers
  • 1.69K Posts
  • 19.6K Comments
  • Modlog
  • mods:
  • context [fae/faer, fae/faer]@hexbear.net
  • EmmaGoldman [she/her, comrade/them]@hexbear.net
  • SexUnderSocialism [she/her]@hexbear.net
  • gaycomputeruser [she/her]@hexbear.net
  • ZoomeristLeninist [they/them, she/her]@hexbear.net
  • BE: 0.19.11
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org