This repository documents my personal journey in learning about large language models (LLMs) from scratch. Please note that I have only a basic understanding of machine learning (ML) and deep learning (DL).
The ultimate goal is to stay updated with current research on LLM Systems, with a focus on the system aspects rather than the LLM itself.
However, you should first understand how LLMs work before delving into the system of LLMs, right?
- Milestone #1: Understanding How LLM Works.
My personal notes.
- Introduction to LLM.
- Advanced NLP Fall 2024. Home page, Youtube
- Large Language Models: Methods and Applications / Fall 2024. Home page
- Large Language Model Systems, Spring 2024. Home page
- Introduction to Generative AI 2024 Spring. Home page , Youtube
- Speech and Language Processing (3rd ed. draft). Link
- Build a Large Language Model (from Scratch).
- 深度学习论文精读 by 李沐。 Github
- The spelled-out intro to neural networks and backpropagation: building micrograd, by Andrej Karpathy. Youtube
- Let's build GPT: from scratch, in code, spelled out, by Andrej Karpathy. Youtube