Paper page - Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation
…specialized agents construct visual-aware plans, collect claim-grounded evidence, maintain source-aligned images in a Visual Working Memory , and compose reports through declarative multimodal tool use . A verifier agent serves as…