Hacker News
· u/Hey1-Arthur
· Mar 22, 2026
I'm 11 and trained a custom MoE LLM for $1
# I'm 11 years old and I trained my own LLM from scratch. 50 people downloaded it in 24 hours.Hey r/LocalLLaMA,I'm Arthur, I'm 11 years old, and I just released *Wind Arc 1.6* — a custom architecture LLM I built and trai…
r/LocalLLaMA
· u/No-Selection2972
· 1w ago
Xiaomi just claimed 1,000+ tps on a 1T model using a standard 8-GPU server
Just saw Xiaomi MiMo announce MiMo-V2.5-Pro UltraSpeed, claiming they broke the 1,000 tokens/sec output barrier on a 1 trillion parameter MoE model. According to them, they’re doing it on a single standard 8-GPU node, no…