[HELP] In GPT4All settings, selecting AMD graphics card yields no performance improvement over CPU

yo_scottie_oh@lemmy.ml · edit-2 2 months ago

[HELP] In GPT4All settings, selecting AMD graphics card yields no performance improvement over CPU

Menschlicher_Fehler@feddit.org · edit-2 2 months ago

I am somewhat new to Linux and hosting local LLMs, but I think I had to install AMD ROCm for LLMs to work with my GPU.

https://rocm.docs.amd.com/en/latest/about/release-notes.html

pebbles@sh.itjust.works · 2 months ago

Can gpt4all use ROCm?

yo_scottie_oh@lemmy.ml · 2 months ago

Rats—according to their System Requirements (Linux) page, they don’t support Fedora. Even if I were to switch to a supported distro, it looks like only a small set of graphics cards are supported, and unfortunately, mine is not one of them. 😢

Supported graphics cards:

Supported operating systems:

Thanks anyway for the tip!

pebbles@sh.itjust.works · 2 months ago

Actually I run fedora and have ROCm working. In fact it’s in the default package manager.

You can see 'em all by running: sudo dnf list | grep rocm

To get your GPU working you can look up: HSA_OVERRIDE_GFX_VERSION And maybe HIP_VISIBLE_DEVICES

Though this is the techy route. If you get LM studio running, or even better llamacpp, you’ll have access to much better quantization formats than q4_1.

So, you’ll be at the same speed or even faster than Vulkan, and with high quality outputs.

yo_scottie_oh@lemmy.ml · 2 months ago

Thanks for the info—maybe I’ll give this another whirl when I have some more time.

Which card are you running on?

pebbles@sh.itjust.works · 2 months ago

I use the 24GB 7900 XTX.

I wonder why ROCm 6.4 doesn’t support you, but ROCm 6.3 does. Maybe there is a way to downgrade. Also that override_gfx environment variable may be enough to get 6.4 working for you. Not sure though.

I’d say an easy route (if it works lol) would be using dnf to install ROCm, and then use LM studio’s installer to get the rest.

yo_scottie_oh@lemmy.ml · 2 months ago

I wonder why ROCm 6.4 doesn’t support you, but ROCm 6.3 does.

Wait, where are you reading that 6.3 supports the 6950 XT? I dug up the System Requirements (Linux) page for 6.3 and it lists the same cards as the 6.4 page. Is there another document out there that covers this topic?

pebbles@sh.itjust.works · edit-2 2 months ago

Okay I rechecked and it looks like 6.4 and 6.3 have similar compatibility/incompatibility with certain cards.

Here are the gfx versions of different amd cards:

https://rocm.docs.amd.com/en/develop/reference/gpu-arch-specs.html

Here are the supported versions of 6.4

https://rocm.docs.amd.com/en/docs-6.4.0/compatibility/compatibility-matrix.html

So given this extra bit of research it looks like you may be able to run ROCm on a 6950XT but I’m not sure about a 6750XT.

From my experience ROCm supports more than they say they do. They say they support the cards they’ve tested, but other’s still may work. I was running ROCm on my 7900 XTX before they officially supported it.

Menschlicher_Fehler@feddit.org · edit-2 2 months ago

I don’t have a clue, I only tried LM Studio and Automatic1111.

yo_scottie_oh@lemmy.ml · 2 months ago

What card are you running on?

Menschlicher_Fehler@feddit.org · edit-2 2 months ago

7900 XTX. Sorry, forgot that ROCm only supports some cards.

pebbles@sh.itjust.works · 2 months ago

My best guess would be that you installed the flatpak version of gpt4all, and somehow that messed with it’s ability to use the GPU.

Vulkan should work with your GPU, the model you chose should fit in your GPU if it is q4_0 or q4_1 which is the default in gpt4all I think.

yo_scottie_oh@lemmy.ml · edit-2 2 months ago

I’ll try installing non-flatpak GPT4All in a distrobox and see if I get a different result. Thanks for the idea.

pebbles@sh.itjust.works · 2 months ago

For sure!

yo_scottie_oh@lemmy.ml · edit-2 2 months ago

Update: I thought I’d report back on my progress. I tried installing GPT4All in distrobox containers, several different images (Ubuntu 24.04 and 22.04, and Fedora 41), but in every case, the installation script fails due to missing dependencies. I can’t get to the installer GUI. Upon further investigation, it appears that GPT4All does not support Wayland. There is an open feature request from last year, but I’m not holding my breath. I did some cursory searches for workarounds, but couldn’t figure it out in the time I had available today.

[me@UbuntuTestingGpt4All ~]$ ./gpt4all-installer-linux.run 
./gpt4all-installer-linux.run: error while loading shared libraries: libxkbcommon-x11.so.0: cannot open shared object file: No such file or directory

I wonder if I would have the same issue if I tried this while running an X session on the host machine. I’ll post another update if I test this scenario.

Anyway, thanks again for the tip.

pebbles@sh.itjust.works · 2 months ago

For sure!