Unpacking the deceptively simple science of tokenomics
… "If your accuracy loss is too severe, the speed up becomes irrelevant," Salvator said. However, the FP4 data types supported by AMD and Nvidia's latest accelerators use some clever math to vastly expand the number of values that can represent model weights from 16 to more than 4,000. …