- Implementing the Simplest Neural Network
- Implementing a Multilayer Neural Network
- Challenges of LLM training at scale
- Computational requirements of LLM training
- Data processing for LLM training
- Scaling laws for LLM training
- Blog: A closer look at "training" a trillion-parameter model on Frontier
- Blog: LLM training without a parallel file system