Search

Showing top 34 results for "local LLM performance" · filtered from 38 indexed

Filtered by topic: LLMs Clear ✕

Top stories

Discussions and forums

r/LocalLLaMA · u/The_Paradoxy · 2w ago

The Qwen 3.6 35B A3B hype is real!!!

My personal test for small local LLM intelligence is to check whether a model has any ability to understand the code that I write for my own academic research. My research is on some pretty niche topics and I doubt that …

r/LocalLLaMA · u/MikeNonect · 2w ago

Getting a feel for how fast X tokens/second really is.

I love following all your adventures with local LLM setups. Quality and size of the models are important, but so is performance. Numbers don't really convey the experienced speed well, however. If someone claims they run…

r/LocalLLaMA · u/gladkos · 3w ago

Qwen 3.6 27B vs Gemma 4 31B - making Packman game!

Gemma just crushed Qwen in a local LLM gamedev contest! Device: MacBook Pro M5 Max, 64GB RAM Qwen 3.6 27B: 32 tokens/sec · 18m 04s · 33,946 tokens. Gemma 4 31B: 27 tokens/sec · 3m 51s · 6,209 tokens. So what is more impo…

r/LocalLLaMA · u/Signal_Ad657 · 1w ago

M5 vs DGX Spark vs Strix Halo vs RTX 6000

Hey guys, super simple. There have been a lot of online debates about the new M5 Macs vs DGX Sparks vs Strix Halo vs dedicated GPUs etc. So I put them all in a room with good power and cooling and ran everything in paral…

r/LocalLLaMA · u/Porespellar · 2w ago

Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will.

There is a lot of disdain for DGX Sparks here on the sub. And I get it. A lot of people say “It could have been great if it had been better memory bandwidth”, “SM-121 is a fake /second-class Blackwell chip” yadda, yadda.…