Search

Showing top 17 results for "DeepSeek" · filtered from 25 indexed

All sources huggingface.co 3 techcrunch.com 2 tweakers.net 2 tomsguide.com 1 theverge.com 1 engadget.com 1 developer.nvidia.com 1 tweaktown.com 1 cnet.com 1 amd.com 1 wired.com 1 anthropic.com 1

Tracked topic

DeepSeek

28 articles indexed Last updated 3d ago See topic hub

Videos

More videos

DeepSeek V4 AI Beats Billion Dollar Systems…For Free Two Minute Papers 2w ago DeepSeek Just Fixed One Of The Biggest Problems With AI Two Minute Papers Mar 24, 2026 My Honest Thoughts about Deepseek Matthew Berman 4w ago DeepSeek’s New AI Is A Game Changer Two Minute Papers 3d ago GPT 5.5 Arrives, DeepSeek V4 Drops, and the Compute War Intensifies AI Explained Apr 24, 2026 Deepseek is a Problem Matthew Berman 3w ago [GRPO Explained] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Yannic Kilcher Jan 26, 2025 You Are Being Told Contradictory Things About AI AI Explained Dec 5, 2025

How to deploy and fine-tune DeepSeek models on AWS

…I have been trying to deploy deepseek-ai/DeepSeek-R1-Distill-Qwen-32B on inferentia with a context window higher than 4096 (let's say MAX_TOTAL_TOKENS=8192 ), but it seems…

Mar 27, 2025 · Simon Pagezy

Open-R1: a fully open reproduction of DeepSeek-R1

…https://huggingface.co/deepseek-ai/DeepSeek-V3/blob/main/modeling_deepseek.py Is it possible to contribute to this project? · Yes, you can look at https://huggingface.co/open-r1 and https…

Mar 27, 2025 · Elie Bakouch

5 AI Models Tried to Scam Me. Some of Them Were Scary Good

…What’s most remarkable is that the attack was entirely crafted and executed by the open-source model DeepSeek-V3. The model crafted the opening gambit then responded to replies in ways…

Apr 22, 2026 · Will Knight

A “diff” tool for AI: Finding behavioral differences in new models

…GPT-OSS-20B vs DeepSeek-R1-0528-Qwen3-8B We also compared a more powerful open-source model, OpenAI's GPT-OSS-20B , to DeepSeek's model DeepSeek-R1-0528-Qwen3-8B…

Mar 13, 2026