Paper page - InterLV-Search: Benchmarking Interleaved Multimodal Agentic Search
…below 50% overall accuracy, highlighting challenges in visual evidence seeking , search control , and multimodal evidence integration. We release the benchmark data and evaluation code at https://github.com/hbhalpha/InterLV-Search-Bench…