Paper page - Advancing Creative Physical Intelligence in Large Multimodal Models
…Each instance presents a scenario image with structured views of candidate entities and their parts, enabling fine-grained, interactive evaluation of how models iteratively inspect the scene, identify relevant affordances, and compose…