Search

Showing top 87 results for "Kimi K2"

Kimi K2

Kimi K2 is a large language model service associated with the Kimi series, also referenced as kimi 2.6 or kimi k2.6.

31 articles indexed Last updated just now See topic hub

Videos

AI Search: the search primitive for your agents

…We're using Kimi K2.5 as the LLM via Workers AI . The model decides when to call the tools based on the conversation: import { AIChatAgent, type OnChatMessageOptions } from "@cloudflare/ai-chat…

Apr 16, 2026 · Gabriel Massadas

NVIDIA AI Robots Learn To Install Graphics Cards Without Human Help

…Codex on GPT-5.5, Claude Code on Opus 4.7, and Kimi Code on Kimi K2.6. Each pitched ideas, tested them on hardware, and kept only what improved. The result…

Jun 18, 2026 · Tim Sweezy

KI-Forschung: LLMs glauben Lügen trotz expliziter Warnung - Golem.de

…Für ihr Experiment haben die Wissenschaftler die KI-Modelle Qwen 3.5-35B-A3B, Kimi K2.5 und GPT-4.1 mit hanebüchenen Falschinformationen gefüttert – unter anderem, dass der Sänger Ed Sheeran…

May 29, 2026 · Tobias Költzsch

Mistral's new agent proofs your code on the cheap

…According to Mistral, Leanstral-120B-A6B outperforms larger (more parameters) open source rivals like GLM5-744B-A40B, Kimi-K2.5-1T-32B, and Qwen3.5-397B-A17B on FLTEval. But perhaps more…

Mar 17, 2026 · Thomas Claburn

Discussions and forums

Hacker News · u/heymax054 · May 15, 2026

DeepSeek V4 Pro and Flash vs. Claude Opus 4.7 and Kimi K2.6

2 1

Hacker News · u/nl · May 15, 2026

We Tested DeepSeek V4 Pro and Flash Against Claude Opus 4.7 and Kimi K2.6

r/LocalLLaMA · u/APFrisco · May 11, 2026

Computer build using Intel Optane Persistent Memory - Can run 1 trillion parameter model at over 4 tokens/sec

As the title states, my build is indeed able to run a 1 trillion parameter model (in this case Kimi K2.5) locally at ~4 tokens/second. I thought r/LocalLLaMA would be interested in the build due to that stat line, and al…

Hacker News · u/ramonga · 4w ago

Show HN: Free open source coding models in Slack

Hey HN,We believe we have the easiest onboarding from signup to being able to spin up coding agents in slack like Stripe, Ramp & Coinbase.Demo of the onboarding: https://www.tella.tv/video/connecting-cord-to-slack-1-19ep…

r/LocalLLaMA · u/Fragrant-Remove-9031 · May 16, 2026

Local Qwen 3.6 vs frontier models on a coding primitive: single-file HTML canvas driving animation - results and GIFs

Saw this post comparing Qwen 3.6 variants on coding primitives, so I wanted to see how local quants stack up against frontier models on a similar dense, single-file coding task. I ran the exact same prompt across local a…

KI-Inferencing: US-deutsches Start-up will Nvidia ausstechen

…Er eignet sich laut Tensordyne für gängige KI-Modelle wie Kimi K2.6, DeepSeek-R1/V4 Pro, Llama3.1 405B, Mixtral 8x22B, GPT-OSS-120B und Qwen 80B. Zum Vergleich: Nvidia will…

Jun 15, 2026 · Christof Windeck

Retail Edge AMD EPYC 4000 Supermicro AS-1116R-FN4 AS-E300-14GR

…May 15, 2026 Further Accelerating Kimi-K2.5 on AMD Instinct™ MI325X: W4A8 & W8A8 Quantization with AMD Quark — ROCm Blogs Quantize Kimi-K2.5 to W4A8 and W8A8 using AMD Quark and…

May 27, 2026 · Jerry Baldock

ROCm Certification Associate: Architecture, Programming, and Optimization

…The workshop will also showcase practical optimization techniques for improving end-to-end serving performance of the Kimi K2.5 model using optimized FlyDSL Mixture-of-Experts (MoE) kernels.;This advanced hands…

Jul 22, 2026

Maximize SQL Server on RDS with AMD EPYC™ CPUs

…May 19, 2026 Further Accelerating Kimi-K2.5 on AMD Instinct™ MI325X: W4A8 & W8A8 Quantization with AMD Quark — ROCm Blogs Quantize Kimi-K2.5 to W4A8 and W8A8 using AMD Quark and…

May 19, 2026 · Jeremy Girven

AI coding agents taught robots how to install GPUs and cut zip ties

…5, Anthropic’s Claude Code with Opus 4.7, and Moonshot AI’s Kimi Code with Kimi K2.6. Teams of the coding agents independently developed different algorithmic approaches to robot training…

Jun 17, 2026 · Jeremy Hsu

New GPU MODE Virtual Hackathon: E2E Model Speedrun

…Each of the top ten finalists will be awarded $10K prize money and opportunities to win the Grand Prizes. Track 1 – DeepSeek‑R1‑0528 Grand Prize: $350,00 Track 2 – Kimi K2…

Mar 9, 2026 · George Wang

Followed topics