Lifan Sun's blog

Lifan Sun's blog, Welcome to my blog.

Reading Notes: Qwen Technical Report

Reading Notes Collections: Context Length Extrapolation

Reading Notes: MiniCPM Technical Report

Reading Notes: LLaMA Technical Report

Word Embedding Techniques

Understanding Tokenization Methods

Rotatry Positional Encoding

Reading Notes: “Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer”

Reading Notes: “Training Compute-Optimal Large Language Models”

Reading Notes: “GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints”

Reading Notes: GPT Series

Two ways of adapting LLMs for Recommender Systems

Reading Notes: “Annotated Transformer”

Introduction to GenAI 2024 Spring, Course Notes

Reading Note: “WizardLM: Empowering Large Language Models to Follow Complex Instructions”

Brief Notes on Instruction Tuning

© Lifan Sun 2023 - 2025