Pretrained to Imagine, Fine-Tuned to Act: The Rise of World-Action Models | NVIDIA Technical Blog
… After finishing that task, the robot should pick up the orange on the left side of the toaster and stop after it has picked it up.” Context frame and ground-truth rollout: Veo 3.1 generated rollouts zero-shot, no robotics fine-tuning : The generated rollout is surprisingly good for a model that was… …