Lifan Sun's blog

Lifan Sun's blog, Welcome to my blog.

  • Blog
  • About
  • RSS
  • Search
  • NLP (16)
  • MLSys (14)
  • RecSys (1)
  • System (22)
  • C++ (2)
  • daily-life (1)
  • SE-Paper Reading (6)
  • LLVM (1)
  • Java (1)

Reading Notes: Qwen Technical Report

May 7, 2025

Reading Notes Collections: Context Length Extrapolation

May 6, 2025

Reading Notes: MiniCPM Technical Report

May 4, 2025

Reading Notes: LLaMA Technical Report

Apr 28, 2025

Word Embedding Techniques

Apr 22, 2025

Understanding Tokenization Methods

Apr 21, 2025

Rotatry Positional Encoding

Mar 19, 2025

Reading Notes: “Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer”

Mar 2, 2025

Reading Notes: “Training Compute-Optimal Large Language Models”

Mar 1, 2025

Reading Notes: “GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints”

Feb 28, 2025

Reading Notes: GPT Series

Feb 27, 2025

Two ways of adapting LLMs for Recommender Systems

Jan 15, 2025

Reading Notes: “Annotated Transformer”

Jul 26, 2024

Introduction to GenAI 2024 Spring, Course Notes

Jul 9, 2024

Reading Note: “WizardLM: Empowering Large Language Models to Follow Complex Instructions”

Jun 7, 2024

Brief Notes on Instruction Tuning

Jun 1, 2024


© Lifan Sun 2023 - 2025