2025-10
2025-05
- 2025-05-08 Reading Notes: Qwen Technical Report
- 2025-05-05 Reading Notes: MiniCPM Technical Report
2025-04
- 2025-04-29 Reading Notes: LLaMA Technical Report
- 2025-04-23 Word Embedding Techniques
- 2025-04-22 Understanding Tokenization Methods
2025-03
- 2025-03-20 Rotatry Positional Encoding
2025-02
- 2025-02-28 Reading Notes: GPT Series
- 2025-02-16 Distributed Training Basics
- 2025-02-16 Reading Note: Megatron-LM v1
- 2025-02-07 Quantization for NN Inference
- 2025-02-02 Reading Note: TVM
- 2025-02-02 Reading Note: Triton
2025-01
- 2025-01-27 Deep Learning Performance Background
- 2025-01-22 An Architecture Overview of ML Systems
2024-12
- 2024-12-07 CSE221 - lec18: File System Cont. : GFS
- 2024-12-03 CSE221 - lec17: Networking: IX & Snap
2024-11
- 2024-11-13 CSE221 - lec12: File System: FFS & LFS
- 2024-11-12 File System Refresher
- 2024-11-06 CSE221 - lec10: Virtualization: VM370 & Xen
- 2024-11-05 CSE221 - lec09: Extending OS: L4 & Exokernel
2024-10
- 2024-10-04 CSE221 - lec00: Introduction
2024-08
- 2024-08-21 Resource Management in Modern C++
- 2024-08-16 Type Deduction in Modern C++
2024-06
- 2024-06-23 Some Ideas on Research
- 2024-06-02 Brief Notes on Instruction Tuning
2024-03
- 2024-03-25 Unit Test Generation: What to do next?
2024-01
- 2024-01-12 Class Loading in Java
