I tested 3 tiny local LLMs for everyday work, and only one of them impressed me
["google-gemma","qwen3","google","llms"]
Tracked topic
Qwen3 is an AI model family developed by Alibaba, released as a set of large language models for natural-language tasks.
["google-gemma","qwen3","google","llms"]
…It's more complicated to switch between multiple local LLMs, but honestly, the only one right now that's capable enough for coding is Qwen3-Coder-Next, and until that changes, I…
["google-gemma","google","qwen3"]
…Related I replaced ChatGPT and Claude with this powerful local LLM and saved over $20 a month while gaining full control Qwen3.6 runs on my old GPU and does what ChatGPT…
…On my main PC, I’ve got an RTX 3080 Ti running Qwen3.6-35B-A3B , and I use it for the majority of my VS Code tasks. But for the rest…
…After entering the LLM model name and the context window, I've now got Qwen3-Coder-Next as my coding companion and am using my workstation GPU to its full potential. Visual…
…ollama pull qwen3.5:9b Next, install Claude Code. Then point Claude Code at your local Ollama server instead of Anthropic's API by setting the environment variables. npm install -g @anthropic…
…Take Qwen3.6-27B , for example: it can run on an AMD APU or an Apple Mac with 32GB of unified memory or a 24GB graphics card, yet boasts "flagship coding power…
…Related I replaced ChatGPT and Claude with this powerful local LLM and saved over $20 a month while gaining full control Qwen3.6 runs on my old GPU and does what ChatGPT…
…With 32GB, you can run heavily quantized 70B parameter models, or fit something like Qwen3.5-27B at a decent quantization. 32GB of VRAM is a lot, and it's a step…