Search

Showing top 134 results for "Qwen3"

Qwen3

Qwen3 is an AI model family developed by Alibaba, released as a set of large language models for natural-language tasks.

136 articles indexed Last updated 10h ago See topic hub

Videos

Paper page - Qwen-Image-Flash: Beyond Objective Design

["qwen3"]

Jun 4, 2026

I switched my local AI setup to AMD's Lemonade after Nvidia support landed, and solved my local AI portability problem

…Model Backend TTFT (ms) TPS VRAM peak (GB) Llama 3.2 3B Vulkan 188 78.3 4.0 Qwen3 8B (Q4_1) ROCm 95 41.2 6.6 Qwen3 8B (Q4_1…

Jul 7, 2026 · Joe Rice-Jones

I wrote a script to run Claude Code with my local LLM, and skipping the cloud has never been easier

…The way it handles tool integration, file edits, and permissions just feels more polished to me, and I've had better success with local models like Qwen3 Coder Next with Claude Code…

Mar 20, 2026 · Adam Conway

AMD's GAIA Defaults To Better Model, Continued Improvements For Local AI

["amd","qwen3","google-gemma"]

May 2, 2026

Discussions and forums

r/LocalLLaMA · u/LLMFan46 · May 26, 2026

Qwen3.5 35B A3B uncensored heretic Native MTP Preserved is Out Now With the Full 785 MTPs Preserved and Retained, Available in Safetensors, GGUFs. NVFP4, NVFP4 GGUFs and GPTQ-Int4 Formats

Safetensors, llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved: https://huggingface.co/llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved GGUFs, llmfan46/Qwen3.5-35B-A3B-uncensored-here…

Hacker News · u/thc1006 · Apr 21, 2026

Qwen3.6-35B-A3B speculative decoding is net-negative on RTX 3090

5 2

Hacker News · u/GreenGames · Apr 20, 2026

We got 207 tok/s with Qwen3.5-27B on an RTX 3090

165 52

r/LocalLLaMA · u/Beamsters · May 20, 2026

Qwen3.7 Max scored by Artificial Analysis, 27B/35B waiting room

https://preview.redd.it/42ak5qmus82h1.png?width=1133&format=png&auto=webp&s=744ea3dfc06c83d0c4d8aa128c39b3238b17d7be Qwen 3.7 Max sitting at 5th, pretty much on par with GPT 5.4 (xhigh) and a notch above the just release…

Hacker News · u/freakynit · Apr 17, 2026

Show HN: Open Access Qwen3.6-35B-A3B-UD-Q5_K_M with TurboQuant

https://w418ufqpha7gzj-80.proxy.runpod.netStarted for myself, but since Im not using it continuously, sharing it:Open Access Qwen3.6-35B-A3B-UD-Q5_K_M with TurboQuant (TheTom/llama-cpp-turboquant) on RTX 3090 (Runpod spo…

4 2

Paper page - WriteSAE: Sparse Autoencoders for Recurrent State

…Atom substitution beats matched-norm ablation on 92.4% of n=4{,}851 firings at Qwen3.5-0.8B L9 H4, the 87-atom population test holds at 89.8%, the closed…

May 14, 2026

I gave a local LLM full control over my Proxmox node, and it worked better than I expected

…Since my RTX 3080 Ti can drive the bulky Qwen3.6-35B-A3B with relative ease thanks to MoE offloading, I ran it with llama-server and paired it with the Pi…

Jul 2, 2026 · Ayush Pande

NVIDIA Shows DLSS 4, Path Tracing, and RTX Mega Geometry with New Downloadable Bonsai Diorama Demo

…In other NVIDIA news, ACE (Avatar Cloud Engine) can now use the open-source Qwen3-8B AI model as an In-Game Inferencing (IGI) SDK plug-in. This simplifies integration within a…

Oct 22, 2025 · Alessio Palumbo

AMD Ryzen™ AI Halo for AI Developers

["amd","ryzen","qwen3","windows-11"]

I replaced my ChatGPT subscription with a local AI coding tool and haven't looked back

…Related I finally found a local LLM I actually want to use for coding Qwen3-Coder-Next is a great model, and it's even better with Claude Code as a harness…

Apr 14, 2026 · Korbin Brown

I stopped running the biggest local LLM that could fit, and a 2B model handles 90% of what I need

…Related Qwen3.5-9B tops every AI benchmark right now, but that's not how you should pick a model There's a lot more to a model than just benchmarks. How…

Jul 2, 2026 · Nolen Jonker

Followed topics