Google's Gemma 4 AI models get 3x speed boost by predicting future tokens
… Faster local inference right now Google has released new versions of Gemma 4 models with MTP that you can try today. Google says the MTP drafter can make Gemma models up to three times faster, but the actual gain varies based on the hardware you use. …
