Sebastian Raschka (@rasbt)

2024-10-05 | โค๏ธ 3299 | ๐Ÿ” 586


The Llama 3.2 1B and 3B models are my favorite LLMs โ€” small but very capable. If you want to understand how the architectures look like under the hood, I implemented them from scratch (one of the best ways to learn): https://github.com/rasbt/LLMs-from-scratch/blob/main/ch05/07_gpt_to_llama/converting-llama2-to-llama3.ipynb https://x.com/rasbt/status/1842548690256384278/photo/1

๋ฏธ๋””์–ด

photo


Tags

domain-llm