Distributed Training Basics
Reading Note: Megatron-LM v1
Quantization for NN Inference
Reading Note: TVM
Reading Note: Triton
Moore’s Law, and the future of computing beyond Moore’s Law
Deep Learning Performance Background
Reading Notes: MI300X vs H100 vs H200 Benchmark Part 1: Training – CUDA Moat Still Alive
An Architecture Overview of ML Systems
PMPP Reading Notes
Two ways of adapting LLMs for Recommender Systems
CSE221 - lec18: File System Cont. : GFS
CSE221 - lec17: Networking: IX & Snap
CSE221 - lec16: Networking: RPC & Receive Livelock
CSE221 - lec15: Scalability: RCU & Analysis of Linux Scalability
CSE221 - lec14: Scheduling:Scheduler Activation & Decades of Wasted Cores
CSE221 - lec13: File System Consistency: Soft Updates & Split FS
Reading Notes on NLP Papers
CSE221 - lec12: File System: FFS & LFS
File System Refresher