Search: AI cost and memory

We Need A Proper AI Inference Benchmark Test

… Disk arrays are very costly. You can get the same I/O from a couple of flash cards today, but they ain’t cheap, either. And getting more expensive by the minute, in fact, just like main memory. …

Mar 9, 2026 · Timothy Prickett Morgan

Nvidia Finally Admits Why It Shelled Out $20 Billion For Groq

… There will have to be variants of Google TPUs and Amazon Trainiums aimed specifically at agentic AI inference and presenting a better balance between memory bandwidth and compute while also not sacrificing memory capacity. …

Mar 17, 2026 · Timothy Prickett Morgan

Eridu Cuts To The AI Networking Chase With High Radix Switch System

… If you want to do a full accounting of the network costs for AI hardware, companies also have to beef up their front end networks linking application to AI clusters, too, because these are generally painfully slow compared to the scale out networks that comprise the outer rings or branches dependin… …

Mar 11, 2026 · Timothy Prickett Morgan

Riding The Memory Boom And Trying To Avoid The Bust

… If you take this high-end and low-cost server memory out of the mix along with HBM, then the remaining DRAM business accounted for $8.99 billion in my model, up 128.2 percent. Still not bad, and illustrative of the across-the-board memory boom we are in. …

Mar 24, 2026 · Timothy Prickett Morgan

Taalas Etches AI Models Onto Transistors To Rocket Boost Inference

… By doing so, you can also radically simplify the architecture of the AI device and, the way that Taalas has done it, you can eliminate the wall between compute and memory that plagues all serial and parallel compute engines – and especially GPUs and AI XPUs that have had to resort to HBM stacked DR… …

Feb 19, 2026 · Timothy Prickett Morgan

The $2 Billion Nvidia Deal With Marvell Is About A Lot More Than NVLink Fusion

… But with the current costs of a rack of Grace-Blackwell compute and the expected – and much higher – cost of a rack of future Vera-Rubin compute, the cash coming in is adding up to the cash being invested in the chippery ecosystem by AI industry juggernaut Nvidia. …

Mar 31, 2026 · Timothy Prickett Morgan

Oracle First In Line For AMD “Altair” MI450 GPUs, “Helios” Racks

… This MI450 chip – really a bunch of chiplets that look like a single unit, as has been the case for AMD datacenter GPUs for many generations now – has its compute streaming processors etched using 2 nanometer processes from Taiwan Semiconductor Manufacturing Co and is expected to be able to process… …

Oct 14, 2025 · Timothy Prickett Morgan

SambaNova Pits Its Engineering Against Nvidia For Agentic AI

… That last bit is going to have to change, particularly as GenAI goes mainstream and the bargain power that enterprises, governments, academic institutions, and sovereigns is a lot lower than that of the hyperscalers, cloud builders, and model builders. ai sambanova ai inference sn50 nvidia

Feb 25, 2026 · Timothy Prickett Morgan

Dell Bulks Up Hardware As AI Infrastructure Shifts To On-Premises

… Samsung is using Dell AI technology in its semiconductor design, manufacturing, and automation work, while Honeywell began partnering with Dell and Nvidia last year for some of its AI operations. “AI is fueling a renaissance in enterprise hardware, a shift from bits back to atoms,” Dell said. “The … …

May 19, 2026 · Jeff Burt

With TPU 8, Google Makes GenAI Systems Much Better, Not Just Bigger

… But not only do GenAI training and inference and the other workloads each respective TPU 8 chip is handling have different needs when it comes to processing, they have very different needs for SRAM memory capacity and HBM memory and memory bandwidth. …

Apr 24, 2026 · Timothy Prickett Morgan

Followed topics