Selfhost an LLM

Shimitar@downonthestreet.eu · 27 days ago

Selfhost an LLM

ragingHungryPanda@piefed.keyboardvagabond.com · edit-2 26 days ago

not for LLMs. I have a 16GB and even what I can fit in there just isn’t really enough to be useful. It can still do things and quickly enough, but I can’t fit models that large enough to be useful.

I also don’t know if your GPU is compatible with ROCM or not.

eleitl@lemmy.zip · edit-2 25 days ago

The GPU used to but they dropped ROCm support for Radeon V and VII some time ago. Have to look at that Strix Halo/AI Max thing I guess.