Fediverse is worse than Reddit. Mod abuse, admin abuse, disinformation, and people simping for literal terrorists.

  • 0 Posts
  • 41 Comments
Joined 11 months ago
cake
Cake day: January 3rd, 2024

help-circle




  • ggml_cuda_compute_forward: ADD failed
    CUDA error: shared object initialization failed
      current device: 0, in function ggml_cuda_compute_forward at ggml/src/ggml-cuda.cu:2365
      err
    ggml/src/ggml-cuda.cu:107: CUDA error
    

    I didn’t do anything past using yay to install the AUR koboldcpp-hipblas package, and customtkinter, since the UI wouldn’t work otherwise. The koboldcpp-rocm page very specifically does not mention any other steps in the Arch section and the AUR page only mentions the UI issue.



  • I distrohopped so much after each previous distro eventually broke and me clearly not being smart enough to recover. I’m honestly kinda sick of it, even if the immutable nature also annoys the shit out of me.

    My GPU is a 6650 XT, which should in principle work with ROCm.

    Which model specifically are you recommending? Llama-3.1-8B-Lexi-Uncensored-V2-GGUF? Because the original meta-llama ones are censored to all hell and Huggingface is not particularly easy to navigate, on top of figuring out the right model size & quantization being extremely confusing.


  • I just can’t get ROCm / gpu generation to work on Bazzite, like at all. It seems completely cursed. I tried koboldcpp through a Fedora distrobox and it didn’t even show any hardware options. Tried through an Arch AUR package through distrobox and the ROCm option is there but ends with a CUDA error. lol The Vulkan option works but seems to still use the CPU more than the GPU and is consequently still kinda slow and I struggle to find a good model for my 8GB card. Fimbulvetr-10.7B-v1-Q5_K_M for example was still too slow to be practical.

    Tried LM Studio directly in Bazzite and it also just uses the CPU. It also is very obtuse on how to connect to it with SillyTavern, as it asks for an API key? I managed it once in the past but I can’t remember how but it also ended up stopping generating anything after a few replies.

    Krita’s diffusion also only runs on the CPU, which is abysmally slow, but I’m not sure if they expect Krita to be build directly on the system for ROCm support to work.

    I’m not even trying to get SDXL or something to run at this point, since that seems to be still complicated enough even on a regular distro.









  • No, I’m actually with them on that one. The he / they issue in of itself is tiny, I agree, and if they’d just changed it from gendered to gender neutral language then nobody would’ve even cared. Most of us tend to write in a gendered way out of habit or because we think about our own gender, and in a casual conversation that isn’t that important. But this is about a piece of software that, surely, is not just meant for male audiences. It’s just unprofessional to address someone as male by default. Most importantly though, being this stubborn on having the user specifically male is just a weird hill to die on, but even weirder if that particular action is the one that is actually causing the drama - which they allegedly claim wanting to prevent by dismissing “politics”. And I’m sorry, but changing a “he” to “they” is not politics, it’s just including non male users. Nothing more, nothing less. So why is it such an issue to not just address specifically male users? It really only would be because those people hold some very questionable views, which, in my opinion, clash heavily with the whole concept of free and open source software, which is supposedly for everyone. So if your actions and views are this flawed, how can you be trusted on such an important project?

    Also, in regards to this news… “no code from rivals” also is just a stupid thing to say and do. There’s plenty of good open source code that they could and probably even SHOULD use. But whatever. I’m not gonna support this project and predict it will fail anyway.