📚 세현's Vault

🌍 도메인

🔮3D-Vision
🎨Rendering
🤖Robotics
🧠LLM
👁️VLM
🎬GenAI
🥽XR
🎮Simulation
🛠️Dev-Tools
💰Crypto
📈Finance
📋Productivity
📦기타

📄 Papers

📚전체 논문172

❯

❯

Releasing ViTok v2: open source ViT auto encoder codebase + pretrained weights ...

Releasing ViTok-v2: open-source ViT auto-encoder codebase + pretrained weights ...

2026년 1월 20일1 min read

3D-Vision
3DGS
avatar

Philippe Hansen-Estruch (@tokenpilled65B)

2026-01-20 | ❤️ 413 | 🔁 57 | 💬 15

Releasing ViTok-v2: open-source ViT auto-encoder codebase + pretrained weights

Train your own ViT auto-encoder on any streamed (hf://) or local webdataset. NaFlex pipeline handles any resolution and aspect ratio

Includes reproduced 350M and 4.5B models weights competitive at 256p, SOTA at high-res (512p+)

미디어

🔗 Related

what-if-we-could-model-vision-like-a-wave-moving-through — 주제: AI-ML, Dev-Tools
video-models-serve-as-a-good-pretrained-backbone-for-robot — 주제: AI-ML, Dev-Tools
introducing-shaper-a-method-for-robust-conditional-3d-shape — 주제: AI-ML, Dev-Tools
what-if-we-could-train-ai-robots-in-a-perfect-physics — 주제: AI-ML, Dev-Tools
if-youve-ever-tried-to-create-3dgs-scenes-from-photos-taken — 주제: AI-ML, Web/Graphics

Tags

AI-ML Dev-Tools Web-Graphics

그래프 뷰

Philippe Hansen-Estruch (@tokenpilled65B)
미디어
🔗 Related
Tags

백링크

What if we could model vision like a wave moving through space? Researchers fr...
domain-3D-Vision

Created with Quartz v4.5.2 © 2026

GitHub
Sehyeon Park