rsasaki0109 (@rsasaki0109)
2025-12-15 | โค๏ธ 180 | ๐ 30
OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer https://github.com/Livioni/OmniVGGT-official OmniVGGT is a spatial foundation model that can effectively benefit from an arbitrary number of auxiliary geometric modalities (depth, camera intrinsics and pose) to obtain high-quality 3D geometric results. Experimental results show that OmniVGGT achieves state-of-the-art performance across various downstream tasks and further improves performance on robot manipulation tasks.
๐ ์๋ณธ ๋งํฌ
๋ฏธ๋์ด
