rsasaki0109 (@rsasaki0109)

2025-12-15 | โค๏ธ 180 | ๐Ÿ” 30


OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer https://github.com/Livioni/OmniVGGT-official OmniVGGT is a spatial foundation model that can effectively benefit from an arbitrary number of auxiliary geometric modalities (depth, camera intrinsics and pose) to obtain high-quality 3D geometric results. Experimental results show that OmniVGGT achieves state-of-the-art performance across various downstream tasks and further improves performance on robot manipulation tasks.

๐Ÿ”— ์›๋ณธ ๋งํฌ

๋ฏธ๋””์–ด

image


Tags

3D Robotics AI-ML