Ollama is still the easiest way to start local LLMs, but it's the worst way to keep running them
… 04 / 8 Hardware When running an LLM with Ollama, what hardware component has the biggest impact on inference speed? A Hard disk drive HDD read speed B CPU clock speed in GHz C Available VRAM on the GPU D Internet bandwidth Spot on! VRAM is the key bottleneck for local LLM inference. …