Speculative decoding made my local LLM actually usable
… Running a local LLM is easy until you actually try to use it every day Five minutes to set up, five hours to realize you don't actually want to use it Getting a local LLM running is the easy part. …