Streaming Tokens and Tools: Multi-Turn Agentic Harness Support in NVIDIA Dynamo | NVIDIA Technical Blog
… For some models, dropping prior thinking on turns without tool calls is an established behavior and part of the model’s fine-tuning DeepSeek-R1 is the clearest example . But that same behavior is wrong for interleaved agentic turns, where the prior reasoning explains the tool sequence. …