Search

Showing top 109 results for "LLMs for chip design"

Xiaomi bets big on homegrown silicon – Fudzilla.com

…For the past five years, Xiaomi has invested 105.5bn yuan, about €13.4bn, in automobiles, custom silicon, foundational LLMs, major home appliances and other pet projects. That near €13.4bn spend…

May 22, 2026 · Nick Farrell

How to Build In-Vehicle AI Agents with NVIDIA: From Cloud to Car | NVIDIA Technical Blog

…TensorRT Edge-LLM is the NVIDIA inference framework for autoregressive models including LLMs, VLMs, and VLAs on embedded platforms. It is designed specifically for the needs of an embedded context: low latency…

May 5, 2026 · Felix Friedmann

ASUS ProArt P16, P14 & Mini PC Powered by NVIDIA RTX Spark at Computex 2026

…The new lineup — including the ProArt P16 (H7607) and P14 (H7407) laptops, alongside the ProArt Mini PC — is designed for AI creators, developers, and creative professionals who demand powerful local AI capabilities…

Jun 1, 2026

Unweight: how we compressed an LLM 22% without sacrificing quality

…Several recent systems ( Huff-LLM , ZipNN , and ZipServ ) have shown that LLM weights can be compressed significantly, but these approaches target different problems than ours. ZipNN compresses weights for distribution and storage…

Apr 17, 2026 · Mari Galicer

Building for the Rising Complexity of Agentic Systems with Extreme Co-Design | NVIDIA Technical Blog

…employs extreme co-design across multiple specialized chips (NVL72, Vera CPU, Groq 3 LPX, NVLink 6, ConnectX-9, BlueField-4, Spectrum-X) and software optimizations (Dynamo, NVFP4, TRT-LLM WideEP, Speculative Decoding…

May 5, 2026 · Eduardo Alvarez

Mac Studio: Should You Buy? Or Wait?

…The M3 Ultra chip is overkill for most users because it is expensive and designed with professional applications in mind. It's for visual effects artists, animators, those working with LLMs, and…

Apr 27, 2026 · Juli Clover

AMD Ryzen AI MAX+ 495 "Gorgon Halo" Leak Smokes Strix Halo by 10%, Packs 192GB Memory and Radeon 8065S

…The chip was spotted on a platform with 192 GB of memory, much higher than the 128 GB memory that Strix Halo currently supports. This will be fantastic for running large LLMs…

May 3, 2026 · Hassan Mujtaba

Discussions and forums

r/LocalLLaMA · u/Porespellar · May 8, 2026

Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will.

There is a lot of disdain for DGX Sparks here on the sub. And I get it. A lot of people say “It could have been great if it had been better memory bandwidth”, “SM-121 is a fake /second-class Blackwell chip” yadda, yadda.…

Hacker News · u/stealthtsdb · Apr 25, 2026

Show HN: Agent MCP Studio – build multi-agent MCP systems in a browser tab

I built a browser-only studio for designing and orchestrating MCP agent systems for development and experimental purposes. The whole stack — tool authoring, multi-agent orchestration, RAG, code execution — runs from a si…

11 6

Alibaba Targets NVIDIA's Hopper With Zhenwu M890 AI Chip, Claiming 3x The H20 Performance, 144GB HBM3 & A Roadmap Through 2028

Alibaba has unveiled its latest AI chip, "Zhenwu M890" and AI LLM "Qwen3.7-Max" , designed for Agentic AI workloads. As Agentic AI Rages On, Alibaba Rolls Out Its Own AI Chip…

May 20, 2026 · Hassan Mujtaba

Turn Your Vacant M.2 Slot Into a 20B LLM Cruncher With This Dedicated AI Module: Packing 32 GB Memory & 60 TOPs

…accelerator from EdgeCortix, which was initially designed for low-power AI platforms, bringing these capabilities to Raspberry Pi5 & other ARM-based products. The accelerator chip features an NPU with 60 TOPS of…

Apr 15, 2026 · Hassan Mujtaba

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

‹ Prev 1 2 3 4 5 6 7 8 9 10 11

Followed topics