[A tutorial on LLM - Haifeng Li](https://medium.com/@haifengl/a-tutorial-to-llm-f78dd4e82efc) - Meta information about dataset collection, fine-tuning, transfer learning, types of applications where LLM's can be used etc.
[Why multi-head self attention works: math, intuitions and 10+1 hidden insights - AI Summer](https://theaisummer.com/self-attention/) - Really good article
## Systems
[Llama.cpp 30B runs with only 6GB of RAM now (CPU)](https://github.com/ggerganov/llama.cpp/discussions/638#discussioncomment-5492916)