📚 세현's Vault

🌍 도메인

🔮3D-Vision
🎨Rendering
🤖Robotics
🧠LLM
👁️VLM
🎬GenAI
🥽XR
🎮Simulation
🛠️Dev-Tools
💰Crypto
📈Finance
📋Productivity
📦기타

📄 Papers

📚전체 논문172

❯

❯

must read how to build an llm inference engine using c and

must-read-how-to-build-an-llm-inference-engine-using-c-and

2025년 5월 01일1 min read

LLM
visionos
inference

Lior Alexander (@LiorOnAI)

2025-05-01 | ❤️ 696 | 🔁 125

Must read: How to build an LLM inference engine using C++ and CUDA from scratch without libraries.

Andrew Chan shows how to match and beat llama.cpp with a custom Mistral-v0.2 backend.

It runs at 63.8 tok/s (short prompts) and 58.8 tok/s (long) on a single RTX 4090. https://x.com/LiorOnAI/status/1917958210016727248/photo/1

🔗 원본 링크

https://x.com/LiorOnAI/status/1917958210016727248/photo/1

미디어

🔗 Related

Auto-generated - needs manual review

Tags

domain-ai-ml domain-visionos

그래프 뷰

Lior Alexander (@LiorOnAI)
🔗 원본 링크
미디어
🔗 Related
Tags

백링크

domain-LLM

Created with Quartz v4.5.2 © 2026

GitHub
Sehyeon Park