iPhone 17 Pro Successfully Demonstrated Running A 400B Large Language Model, A Feat That Requires Minimum Of 200GB Memory Even When Compressed
… As for how this was accomplished, instead of loading the whole LLM into the memory, which would be impossible as the iPhone 17 Pro only ships with 12GB of LPDDR5X RAM, Flash-MoE is leveraging the device’s SSD to stream directly to the GPU. …