๐Ÿ“š ์„ธํ˜„'s Vault

๐ŸŒ ๋„๋ฉ”์ธ

  • ๐Ÿ”ฎ3D-Vision
  • ๐ŸŽจRendering
  • ๐Ÿค–Robotics
  • ๐Ÿง LLM
  • ๐Ÿ‘๏ธVLM
  • ๐ŸŽฌGenAI
  • ๐ŸฅฝXR
  • ๐ŸŽฎSimulation
  • ๐Ÿ› ๏ธDev-Tools
  • ๐Ÿ’ฐCrypto
  • ๐Ÿ“ˆFinance
  • ๐Ÿ“‹Productivity
  • ๐Ÿ“ฆ๊ธฐํƒ€

๐Ÿ“„ Papers

  • ๐Ÿ“š์ „์ฒด ๋…ผ๋ฌธ172

ํƒœ๊ทธ: captioning

8๊ฑด์˜ ํ•ญ๋ชฉ

  • 2025๋…„ 11์›” 16์ผ

    this-is-a-phenomenal-video-by-jbhuang0604-explaining-seminal

    • VLM
    • grounding
    • captioning
  • 2025๋…„ 10์›” 21์ผ

    oh-boy-a-2b-vision-model-seriously-damn

    • VLM
    • VQA
    • captioning
  • 2025๋…„ 10์›” 14์ผ

    excited-to-share-our-new-work-streamingvlm-we-tackle-a

    • VLM
    • VQA
    • captioning
  • 2025๋…„ 10์›” 13์ผ

    streamingvlm-real-time-understanding-for-infinite-video

    • VLM
    • VQA
    • captioning
  • 2025๋…„ 4์›” 15์ผ

    finally-our-report-of-incentivizing-reasoning-in-vlms-is

    • VLM
    • VQA
    • captioning
  • 2025๋…„ 4์›” 14์ผ

    really-great-use-of-multimodal-llms-to-analyze-a-massive

    • VLM
    • VQA
    • captioning
  • 2024๋…„ 6์›” 06์ผ

    fastembed-030-is-here-now-featuring-image-embeddings-resnet5

    • VLM
    • grounding
    • captioning
  • 2024๋…„ 4์›” 05์ผ

    ๐Œ๐ข๐ง๐ข๐†๐๐“๐Ÿ’-๐•๐ข๐๐ž๐จ-gradio-demo-is-now-available-on-the-spaces-a

    • VLM
    • VQA
    • captioning

Created with Quartz v4.5.2 ยฉ 2026

  • GitHub
  • Sehyeon Park