Tenny Yin (@tennyyin)

2025-06-11 | ❤️ 200 | 🔁 52


🔎Can robots search for objects like humans? Humans explore unseen environments intelligently—using prior knowledge to actively seek information and guide search. But can robots do the same? 👀

🚀Introducing WoMAP (World Models for Active Perception): a novel framework for embodied open-vocabulary object localization that combines the reasoning power of VLMs 🧠with the physical grounding capabilities of world models 🌎. 🌐 https://robot-womap.github.io/ 🧵(1/N)


See similar notes in domain-robotics, domain-vlm, domain-dev-tools

Tags

type-tutorial domain-robotics, domain-vlm, domain-dev-tools