Search

Showing top 20 results for "Local voice assistant"

People also ask

How are NVIDIA and the OSS community accelerating inference for local agentic AI?

With agents running 24 hours a day, seven days a week on increasingly complex tasks, efficient local compute matters even more. NVIDIA has collaborated with the open source community to enhance the top inference backends for agents, llama.cpp and vLLM. llama.cpp now delivers 2x performance on Qwen 3.5 and 3.6 27B dense models, and 1.6x performance on Qwen 3.5 and 3.6 35B mixture-of-expert (MoE) models. The following two techniques make this possible: Multi-Token Prediction (MTP): An advanced speculative decoding technique, where a smaller draft model proposes several tokens ahead that the targ

Build Personal AI Agents on Windows PCs with New Tools from Microsoft and NVIDIA | NVIDIA Technical Blog

How to Build a Voice Agent with RAG and Safety Guardrails | NVIDIA Technical Blog

… Prerequisites Before you begin this tutorial , you’ll need: NVIDIA API Key for cloud-hosted reasoning models get one free Local deployment requires: ~20GB of disk space NVIDIA GPU with at least 24GB of VRAM Operating system with Bash Ubuntu, macOS, or Windows Subsystem for Linux Python 3.10+ enviro… …

Jan 5, 2026 · Chris Alexiuk

Build Next-Gen Physical AI with Edge‑First LLMs for Autonomous Vehicles and Robotics | NVIDIA Technical Blog

… Native multimodal interaction is achieved through optimized Qwen3-TTS and Qwen3-ASR models, allowing end-to-end, low-latency voice dialogue with a Thinker-Talker framework, and Cosmos Reason 2 enables advanced spatio-temporal reasoning, 3D localization, and long-context processing for humanoid robo… …

Mar 12, 2026 · Lin Chai

How to Build In-Vehicle AI Agents with NVIDIA: From Cloud to Car | NVIDIA Technical Blog

… From AI factory to in-vehicle deployment Development of an agentic in-vehicle assistant requires a different workflow than a traditional voice command system. …

May 5, 2026 · Felix Friedmann

Build Personal AI Agents on Windows PCs with New Tools from Microsoft and NVIDIA | NVIDIA Technical Blog

… How are NVIDIA and the OSS community accelerating inference for local agentic AI? With agents running 24 hours a day, seven days a week on increasingly complex tasks, efficient local compute matters even more. …

Jun 2, 2026 · Annamalai Chockalingam

NVIDIA RTX Innovations Are Powering the Next Era of Game Development | NVIDIA Technical Blog

… By optimizing for on-device inference alongside game graphics, developers can now deploy more voice agents without the overhead of cloud inference. “High-quality emotional voices within a small memory footprint will scale the number of interactive characters in games,” said Zohaib Ahmed, CEO of Res… …

Mar 10, 2026 · Ike Nnoli

NVIDIA Nemotron AI Models

… How to Build a Voice-Powered RAG Agent Using New Nemotron Models Get a step-by-step guide on how to build a voice-powered RAG agent by integrating Nemotron models for speech, RAG, safety, and long-context reasoning. …

NVIDIA Technical Blog

… 6 MIN READ Build AI Agents See all See all Apr 17, 2026 Build a More Secure, Always-On Local AI Agent with OpenClaw and NVIDIA NemoClaw Agents are evolving from question-and-answer systems into long-running autonomous assistants that read files, call APIs, and drive multi-step workflows.... …

May 12, 2026

6 sources covering this — show 5 more

Followed topics

People also ask

How to Build a Voice Agent with RAG and Safety Guardrails | NVIDIA Technical Blog

Build Next-Gen Physical AI with Edge‑First LLMs for Autonomous Vehicles and Robotics | NVIDIA Technical Blog

How to Build In-Vehicle AI Agents with NVIDIA: From Cloud to Car | NVIDIA Technical Blog

Build Personal AI Agents on Windows PCs with New Tools from Microsoft and NVIDIA | NVIDIA Technical Blog

NVIDIA RTX Innovations Are Powering the Next Era of Game Development | NVIDIA Technical Blog

NVIDIA Nemotron AI Models

NVIDIA Technical Blog