Chun-Hsiao (Daniel) Yeh (@danielyehhh)

2025-05-07 | โค๏ธ 79 | ๐Ÿ” 27


โ—๏ธโ—๏ธ Can MLLMs understand scenes from multiple camera viewpoints โ€” like humans?

๐Ÿงญ We introduce All-Angles Bench โ€” 2,100+ QA pairs on multi-view scenes.

๐Ÿ“Š We evaluate 27 top MLLMs, including Gemini-2.0-Flash, Claude-3.7-Sonnet, and GPT-4o.

๐ŸŒ Project: https://danielchyeh.github.io/All-Angles-Bench/ https://x.com/danielyehhh/status/1919926183136838078/photo/1

๐Ÿ”— ์›๋ณธ ๋งํฌ

๋ฏธ๋””์–ด

image


Auto-generated - needs manual review

Tags

domain-ai-ml domain-vlm