Jianyuan (@jianyuan_wang)
2025-03-17 | โค๏ธ 1355 | ๐ 195
Introducing VGGT (CVPRโ25), a feedforward Transformer that directly infers all key 3D attributes from one, a few, or hundreds of images, in seconds! No expensive optimization needed, yet delivers SOTA results for:
โ Camera Pose Estimation โ Multi-view Depth Estimation โ Dense Point Cloud Reconstruction โ Point Tracking
Project Page: https://vgg-t.github.io/
Code & Weights: https://github.com/facebookresearch/vggt/
๐ ์๋ณธ ๋งํฌ
๋ฏธ๋์ด
![]()
๐ Related
Auto-generated - needs manual review
Tags
domain-vision-3d domain-llm domain-dev-tools domain-visionos