change a bit

This commit is contained in:
Junda Chen 2026-02-01 20:00:45 -08:00
parent ce03157597
commit 2a20ab23ff
2 changed files with 2 additions and 2 deletions

View file

@ -36,7 +36,7 @@ The course can be more accurately divided into three parts (with several additio
- LLM fundamentals: Transformers, Attention, and MoE - LLM fundamentals: Transformers, Attention, and MoE
- LLM training optimizations (e.g., FlashAttention-style techniques) - LLM training optimizations (e.g., FlashAttention-style techniques)
- LLM inference: continuous batching, paged attention, disaggregated prefill/decoding - LLM inference: continuous batching, paged attention, disaggregated prefill/decoding
- Scaling laws, test-time compute and reasoning, and “LLM + X” applications (RAG, search, multimodality, tool use, agents, etc.) - Scaling laws
(Guest lectures cover topics such as ML compilers, LLM pretraining and open science, fast inference, and tool use and agents, serving as complementary extensions.) (Guest lectures cover topics such as ML compilers, LLM pretraining and open science, fast inference, and tool use and agents, serving as complementary extensions.)

View file

@ -38,7 +38,7 @@
- LLM 基础Transformer、Attention、MoE - LLM 基础Transformer、Attention、MoE
- LLM 训练优化FlashAttention 等 - LLM 训练优化FlashAttention 等
- LLM 推理continuous batching、paged attention、disaggregated prefill/decoding - LLM 推理continuous batching、paged attention、disaggregated prefill/decoding
- Scaling law、test-time compute / reasoning以及 “LLM + X”RAG / search / multimodality / tool-use / agents 等) - Scaling law
Guest lecturesML compiler、LLM pretraining/open science、fast inference、tool use & agents 等,作为补充与扩展。) Guest lecturesML compiler、LLM pretraining/open science、fast inference、tool use & agents 等,作为补充与扩展。)