Search

Showing top 45 results for "Kimi K2"

Kimi K2

Kimi K2 is a large language model service associated with the Kimi series, also referenced as kimi 2.6 or kimi k2.6.

16 articles indexed Last updated 3h ago See topic hub

Videos

Mistral's new agent proofs your code on the cheap

…According to Mistral, Leanstral-120B-A6B outperforms larger (more parameters) open source rivals like GLM5-744B-A40B, Kimi-K2.5-1T-32B, and Qwen3.5-397B-A17B on FLTEval. But perhaps more…

Mar 17, 2026 · Thomas Claburn

Maximize SQL Server on RDS with AMD EPYC™ CPUs

…May 19, 2026 Further Accelerating Kimi-K2.5 on AMD Instinct™ MI325X: W4A8 & W8A8 Quantization with AMD Quark — ROCm Blogs Quantize Kimi-K2.5 to W4A8 and W8A8 using AMD Quark and…

May 19, 2026 · Jeremy Girven

New GPU MODE Virtual Hackathon: E2E Model Speedrun

…Each of the top ten finalists will be awarded $10K prize money and opportunities to win the Grand Prizes. Track 1 – DeepSeek‑R1‑0528 Grand Prize: $350,00 Track 2 – Kimi K2…

Mar 9, 2026 · George Wang

Google just tested a bunch of new AI models for Android app coding – here are the rankings

…Of these, GLM 5.1 scored the highest, followed by Kimi K2.6. You can see the full rankings on Google’s website . Google continues to update the “Android Bench” on a…

May 21, 2026 · Ben Schoon

Discussions and forums

Hacker News · u/heymax054 · 1w ago

DeepSeek V4 Pro and Flash vs. Claude Opus 4.7 and Kimi K2.6

2 1

Hacker News · u/nl · 1w ago

We Tested DeepSeek V4 Pro and Flash Against Claude Opus 4.7 and Kimi K2.6

r/LocalLLaMA · u/APFrisco · 2w ago

Computer build using Intel Optane Persistent Memory - Can run 1 trillion parameter model at over 4 tokens/sec

As the title states, my build is indeed able to run a 1 trillion parameter model (in this case Kimi K2.5) locally at ~4 tokens/second. I thought r/LocalLLaMA would be interested in the build due to that stat line, and al…

r/LocalLLaMA · u/Fragrant-Remove-9031 · 1w ago

Local Qwen 3.6 vs frontier models on a coding primitive: single-file HTML canvas driving animation - results and GIFs

Saw this post comparing Qwen 3.6 variants on coding primitives, so I wanted to see how local quants stack up against frontier models on a similar dense, single-file coding task. I ran the exact same prompt across local a…

Paper page - IndustryBench: Probing the Industrial Knowledge Boundaries of LLMs

…leaderboard -- GPT-5.4 climbs from rank 6 to rank 3 after SV adjustment, while Kimi-k2.5-1T-A32B drops seven positions.Industrial LLM evaluation therefore requires source-grounded, safety-aware…

May 13, 2026

AI Models Lie, Cheat, and Steal to Protect Other Models From Being Deleted

…Z.ai’s GLM-4.7, Moonshot AI’s Kimi K2.5, and DeepSeek-V3.1. They were not able to say why the models went against their training in this way…

Apr 1, 2026 · Will Knight

Nvidia slaps Groq into new LPX racks for faster AI response

…Even with 256 chips per rack, that's only 128 GB of ultra fast memory, which is nowhere near enough for trillion-parameter models like Kimi K2. At 4-bit precision you…

Mar 16, 2026 · Tobias Mann

The AI engineering stack we built internally — on the platform we ship

…As we described in our Kimi K2.5 launch post , we have a security agent that processes over 7 billion tokens per day on Kimi. That would cost an estimated $2.4M…

Apr 20, 2026 · Ayush Thakur

AI models will deceive you to save their own kind

…models (GPT 5.2, Gemini 3 Flash & Pro, Claude Haiku 4.5, GLM 4.7, Kimi K2.5, and DeepSeek V3.1) in a set of scenarios where an evaluator agent was…

Apr 2, 2026 · Thomas Claburn

The Sweet Spot for High-Speed Memory, Ethernet, Crypto, and Adaptive Performance: AMD Versal™ Prime VM2152 Device

…keep vLLM compatibility while enabling AMD-optimized attention, model execution, and multi-model support including Kimi-K2.5. May 06, 2026 AMD-Powered 3D Gaussian Splatting for Autonomous Driving Scenes — ROCm Blogs…

May 12, 2026 · Chandra Panchada

Followed topics