Lifan Sun's blog

Lifan Sun's blog, Welcome to my blog.

  • Blog
  • About
  • RSS
  • Search
  • NLP (16)
  • MLSys (14)
  • RecSys (1)
  • System (22)
  • C++ (2)
  • daily-life (1)
  • SE-Paper Reading (6)
  • LLVM (1)
  • Java (1)

Reading Notes: Qwen Technical Report

May 7, 2025

Reading Notes Collections: Context Length Extrapolation

May 6, 2025

Reading Notes: MiniCPM Technical Report

May 4, 2025

Reading Notes: LLaMA Technical Report

Apr 28, 2025

Word Embedding Techniques

Apr 22, 2025

Understanding Tokenization Methods

Apr 21, 2025

Rotatry Positional Encoding

Mar 19, 2025

Reading Notes: “FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness”

Mar 8, 2025

Reading Notes: “Efficient Memory Management for Large Language Model Serving with PagedAttention”

Mar 7, 2025

Reading Notes: “Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer”

Mar 2, 2025

Reading Notes: “Training Compute-Optimal Large Language Models”

Mar 1, 2025

Reading Notes: “GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints”

Feb 28, 2025

Reading Notes: GPT Series

Feb 27, 2025

Reading Notes: “Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning”

Feb 23, 2025

Reading Notes: “GPipe: Easy Scaling with Micro-Batch Pipeline”

Feb 22, 2025

Distributed Training Basics

Feb 15, 2025

Reading Note: Megatron-LM v1

Feb 15, 2025

Quantization for NN Inference

Feb 6, 2025

Reading Note: TVM

Feb 1, 2025

Reading Note: Triton

Feb 1, 2025


© Lifan Sun 2023 - 2025