Tiancheng Zhao (Tony) (@tianchezhao)

2025-04-15 | ❤️ 146 | 🔁 38


Finally, our report of incentivizing reasoning in VLMs is out!

  1. Open-source VLM-R1 framework​​ 🔥
  2. Reward engineering tricks​​ that unlock emergent “aha!” moments💡
  3. Analysis of ​​OOD generalization: RL vs SFT​​ tradeoffs and many more📊

https://huggingface.co/papers/2504.07615

🔗 원본 링크


Auto-generated - needs manual review

Tags

domain-vlm domain-dev-tools domain-visionos