๐Ÿ“š ์„ธํ˜„'s Vault

๐ŸŒ ๋„๋ฉ”์ธ

  • ๐Ÿ”ฎ3D-Vision
  • ๐ŸŽจRendering
  • ๐Ÿค–Robotics
  • ๐Ÿง LLM
  • ๐Ÿ‘๏ธVLM
  • ๐ŸŽฌGenAI
  • ๐ŸฅฝXR
  • ๐ŸŽฎSimulation
  • ๐Ÿ› ๏ธDev-Tools
  • ๐Ÿ’ฐCrypto
  • ๐Ÿ“ˆFinance
  • ๐Ÿ“‹Productivity
  • ๐Ÿ“ฆ๊ธฐํƒ€

๐Ÿ“„ Papers

  • ๐Ÿ“š์ „์ฒด ๋…ผ๋ฌธ172
Home

โฏ

bookmarks

โฏ

the context size of video world models is only a few frames like a human with se

the-context-size-of-video-world-models-is-only-a-few-frames-like-a-human-with-se

2025๋…„ 6์›” 06์ผ1 min read

  • GenAI
  • video-gen
  • text-to-X

Gordon Wetzstein (@GordonWetzstein)

2025-06-06 | โค๏ธ 452 | ๐Ÿ” 53


The context size of video world models is only a few frames. Like a human with severe memory loss! We design a long-term memory for world models based on explicit 3D representations inspired by the human mind. This enables long-term consistency. https://spmem.github.io/ 1/3 https://x.com/GordonWetzstein/status/1930984909755359476/video/1

๋ฏธ๋””์–ด

video


Tags

domain-genai domain-ai-ml


๊ทธ๋ž˜ํ”„ ๋ทฐ

  • Gordon Wetzstein (@GordonWetzstein)
  • ๋ฏธ๋””์–ด
  • Tags

๋ฐฑ๋งํฌ

  • domain-GenAI

Created with Quartz v4.5.2 ยฉ 2026

  • GitHub
  • Sehyeon Park