What to Watch
-
Follow r/LocalLLaMA threads for updates on working non-CUDA inference setups.
r/LocalLLaMA
What Changed
-
What's the status of non-CUDA inference?
r/LocalLLaMA