With Its IPO Done, Cerebras Can Get Back To Pushing The AI Envelope
… What is clear is that the compute to SRAM ratio on the WSE-3 waferscale processors is wrong for low latency inference. There are two ways to fix this. Shrink the process, cut back on the compute, and jack up the SRAM. …