📚 세현's Vault

🌍 도메인

🔮3D-Vision
🎨Rendering
🤖Robotics
🧠LLM
👁️VLM
🎬GenAI
🥽XR
🎮Simulation
🛠️Dev-Tools
💰Crypto
📈Finance
📋Productivity
📦기타

📄 Papers

📚전체 논문172

❯

❯

thrilled to announce our icml25 paper why is spatial

thrilled-to-announce-our-icml25-paper-why-is-spatial

2025년 5월 02일1 min read

VLM
grounding
VQA

Shiqi Chen (@shiqi_chen17)

2025-05-02 | ❤️ 295 | 🔁 41

🚀🔥 Thrilled to announce our ICML25 paper: “Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas”!

We dive into the core reasons behind spatial reasoning difficulties for Vision-Language Models from an attention mechanism view. 🌍🔍

Paper: https://arxiv.org/pdf/2503.01773 Code: https://github.com/shiqichen17/AdaptVis Website: https://shiqichen17.github.io/AdaptVis/

🔗 원본 링크

https://arxiv.org/pdf/2503.01773
https://github.com/shiqichen17/AdaptVis
https://shiqichen17.github.io/AdaptVis/

미디어

🔗 Related

Auto-generated - needs manual review

Tags

domain-ai-ml domain-vlm domain-dev-tools domain-visionos

그래프 뷰

Shiqi Chen (@shiqi_chen17)
🔗 원본 링크
미디어
🔗 Related
Tags

백링크

domain-VLM

Created with Quartz v4.5.2 © 2026

GitHub
Sehyeon Park