Search: AI memory strategy

Accelerating GPT-OSS-20B on AMD Ryzen™ AI NPUs: Efficient MoE Inference on Strix and Halo

… Memory Allocation Strategy GPT-OSS-20B has a large memory footprint due to its 20B parameters and QMoE layers, even with INT4 quantization. To run the model on a variety of memory-constrained setups, a dynamic memory allocation scheme is used. …

May 12, 2026 · Client AI Solutions - AI Group

ZenDNN 5.2: Accelerating vLLM V1 Engine and Recommender Systems Inference on AMD EPYC™ CPUs

… Interleave Memory : Access the physical cores in a non-sequential manner to ensure memory bandwidth is distributed across all available DRAM channels, preventing bottlenecks during the decode phase. …

Mar 13, 2026 · Shailen Sobhee

AMD PACE - High-Performance Platform Aware Compute Engine

… MLP MLP layers use kernel blocking aligned to L1 and L2 cache sizes so that matmuls remain cache-resident, and memory bandwidth is used efficiently. …

Apr 8, 2026 · Arjun Muraleedharan

A progressive approach to accelerating system-level verification for AMD Versal™ adaptive SoC designs

… As a result, significant effort is required to configure and maintain synchronization between simulators. To preserve correct system behavior, clocks, events, memory transactions, and interrupts must be continuously coordinated across QEMU, XSIM, and the AIE simulator. …

May 11, 2026 · Adam Taylor

AI at Scale Starts Here: The AMD Vision Comes Alive at Advancing AI 2025

AI at Scale Starts Here: The AMD Vision Comes Alive at Advancing AI 2025 Jun 27, 2025 At the Advancing AI 2025 event AMD unveiled new technology solutions as well as our unparalleled AI portfolio and strategy to help all businesses succeed in the AI era. …

May 19, 2026 · AMD Data Center Insights

Beyond the Hype: Turning AI PC Potential Into Economic Reality

… Article By Gaston Sandoval Related Blogs View All Blogs Reimagining AI-Native Education Through Multi-Agent Interactive Classroom on AMD ROCm Reimagining AI-Native education to AMD ROCm: multi-agent classrooms across Instinct, Radeon, Ryzen AI with OpenMAIC May 11, 2026 Agentic AI Changes the CPU/G… …

May 11, 2026 · Gaston Sandoval

Day-0 Support for Baidu ERNIE-Image on AMD GPUs

… Enviornment Setup MI355X Hardware and Software Hardware: Item Detail GPU AMD Instinct MI355X × 8 single card used Architecture CDNA 4 gfx950 VRAM per card 288 GB HBM3e Host ROCm 7.2.1 Software: Software Version Docker image rocm/pytorch:rocm7.2.1 ubuntu24.04 py3.12 pytorch release 2.9.1 PyTorch 2.9… …

May 12, 2026 · AMD AI Group

Compute Defines Scale: How MediaKind and AMD Are Rethinking Video Infrastructure from On‑Prem to Cloud

… Each AMD EPYC 9004 and 9005 server CPU socket supports twelve channels of DDR5 memory and up to 128 lanes of PCIe® Gen 5 I/O; in dual‑socket system configurations, total available I/O can scale beyond a single socket, up to 160 lanes on supported platforms, and depends on system design while still … …

Apr 17, 2026 · Christopher Bellaci

Harvesting Today: AMD™ Cross Functional Operations Dashboards

… April 17, 2026 How Agentic AI Is Reshaping Chip Design AMD Corporate Fellow Alex Starr explores how Agentic AI is reshaping chip design within AMD and across the industry April 09, 2026 AMD Embedded+ Powers Fujisoft AI-Based Site Security System Learn about the Fujisoft AI-enhanced physical securit… …

May 11, 2026 · AMD IT

From Silicon to Cloud: AMD on AWS Essentials for IT Leaders

… VNNI optimizes AI inference for use cases such as image recognition and natural language processing. bfloat16 is a 16-bit floating point format for deep learning that cuts memory use in half while maintaining range. It provides faster, more efficient ML training and inference. …

May 11, 2026 · Jeremy Girven

Followed topics