Zeqi Xiao (@zeqi_xiao)

2025-12-02 | โค๏ธ 38 | ๐Ÿ” 3


How far can video generative models go in visuospatial intelligence? ๐Ÿค”

We propose Video4Spatial, showing that with video-only context, models can:

๐Ÿ—บ๏ธ Plan in 3D and ground objects ๐ŸŽฅ Follow camera-pose instructions ๐Ÿงฑ Maintain strong spatial consistency https://x.com/zeqi_xiao/status/1995992142142161040/video/1

๐Ÿ”— ์›๋ณธ ๋งํฌ

๋ฏธ๋””์–ด

image


Tags

3D AI-ML GenAI