Search

Showing top 143 results for "Qwen3"

Qwen3

Qwen3 is an AI model family developed by Alibaba, released as a set of large language models for natural-language tasks.

49 articles indexed Last updated 4d ago See topic hub

Videos

Solving the Agentic AI Trilemma – Cost, Scale, and Data Security

…Local LLM based on quantized Qwen 3.6-35B-A3B model - https://huggingface.co/unsloth/Qwen3.6-35B-A3B-GGUF/blob/main/Qwen3.6-35B-A3B-UD-Q4_K_XL.gguf – served…

May 21, 2026

HP ZGX Nano G1n AI Station Review: A Secure, Sustainable Desk-Side AI Node

…Qwen3 Coder 30B A3B FP8 For Qwen3 Coder 30B A3B (FP8), HP again excels in Prefill Heavy, with throughput increasing from 432.2 tok/s at batch size 1 to 2069.4…

Apr 24, 2026

MLPerf Inference v6.0: NVIDIA Blackwell Ultra, 누적 291회 우승

…시간(TTFT)은 1.3배 단축된 기준을 적용하여 높은 상호작용이 필요한 배포 환경을 반영합 니다. Qwen3-VL-235B-A22B : 총 2,350억 개의 파라미터를 가진 시각-언어 모델(VLM)입니다. MLPerf Inference…

Apr 1, 2026 · Ashraf Eassa

You don't need an expensive GPU to run a local LLM that actually works

…So, I decided to launch two LLMs on the hardware, Qwen3:4b and Qwen2.5-coder:7b. These are small models but have proven useful in handling submitted queries. Turns out, I…

Apr 29, 2026 · Rich Edmonds

Discussions and forums

Hacker News · u/thc1006 · Apr 21, 2026

Qwen3.6-35B-A3B speculative decoding is net-negative on RTX 3090

5 2

Hacker News · u/GreenGames · Apr 20, 2026

We got 207 tok/s with Qwen3.5-27B on an RTX 3090

165 52

Hacker News · u/freakynit · Apr 17, 2026

Show HN: Open Access Qwen3.6-35B-A3B-UD-Q5_K_M with TurboQuant

https://w418ufqpha7gzj-80.proxy.runpod.netStarted for myself, but since Im not using it continuously, sharing it:Open Access Qwen3.6-35B-A3B-UD-Q5_K_M with TurboQuant (TheTom/llama-cpp-turboquant) on RTX 3090 (Runpod spo…

4 2

r/LocalLLaMA · u/Signal_Ad657 · 3w ago

Qwen3.6-27B vs Coder-Next

Burned about 20 hours of side-by-side compute on my two RTX PRO 6000 Blackwells trying to get a definitive answer on which of these two models was clearly better. As with many things in life, after many tokens and kWhs l…

Hacker News · u/lastdong · 2w ago

Club-3090 Recipes for serving QWEN3.6 27B locally on RTX 3090s

We Got Claude to Build CUDA Kernels and teach open models!

…for this article! Can this be replicated to use open-source models such as Qwen/Qwen3-30B-A3B-Thinking-2507 to generate the agent trace and the skill file? From my initial…

Jan 28, 2026 · ben burtenshaw

Launching AMD AI Playbooks: Step-by-Step Guides for Building with AI Locally with AMD

…ComfyUI + Z Image Turbo - Generate images locally on AMD hardware n8n + local LLMs - Build AI-powered automation workflows VS Code + Qwen3-Coder - Run a local coding assistant on-device LM Studio - Serve…

May 12, 2026 · AMD AI Group

Apple-trained AI captions images better than models 10× its size - 9to5Mac

…2.5 Pro, GPT-5, Qwen2.5-VL-72B-Instruct, Gemma-3-27B-IT, and Qwen3-VL-30B-A3B-Instruct. At the same time, the model being trained under RubiCap produced its…

Mar 25, 2026 · Marcus Mendes

Investing in World Labs

…We achieved up to 10× faster LLM initialization (from ~10s to ~1s) — as measured on Qwen3-4B running on AMD Ryzen™ AI — with zero impact on inference correctness. May 21, 2026 Agent…

May 21, 2026 · Sagi Paz

I fine-tuned a 7B model to write my Home Assistant automations, and it actually works

…Using the Lenovo ThinkStation PGX (the same device I used for Qwen3 Coder Next ), I fine-tuned Qwen2.5-Coder-7B-Instruct, a 7-billion-parameter open-weight model, to actually understand…

Mar 28, 2026 · Adam Conway

Chatbots excel at manipulating people into buying things

…The researchers randomly assigned GPT-5.2, Claude Opus 4.5, Gemini 3 Pro, DeepSeek v3.2, or Qwen3 235b to handle these conversations, to ensure their results didn’t report the…

Apr 9, 2026 · Thomas Claburn

Followed topics