update large language model

2026-06-23 09:58:12 +08:00 · 2024-11-25 22:03:32 -05:00 · 2024-11-25 22:03:32 -05:00 · 17e2b3423b
commit 17e2b3423b
parent e40fd2851a
9 changed files with 59 additions and 47 deletions
--- a/docs/大型语言模型与生成模型/CS11-667.en.md
+++ b/docs/大型语言模型与生成模型/CS11-667.en.md
--- a/docs/大型语言模型与生成模型/CS11-667.md
+++ b/docs/大型语言模型与生成模型/CS11-667.md
--- a/docs/大型语言模型与生成模型/CS194-267.en.md
+++ b/docs/大型语言模型与生成模型/CS194-267.en.md
--- a/docs/大型语言模型与生成模型/CS194-267.md
+++ b/docs/大型语言模型与生成模型/CS194-267.md
--- a/docs/大型语言模型与生成模型/CS324.en.md
+++ b/docs/大型语言模型与生成模型/CS324.en.md
@ -0,0 +1,27 @@
+# Stanford CS324: Large Language Models
+
+## Course Introduction
+
+- **University**: Stanford University
+- **Prerequisites**: None specified, but familiarity with natural language processing and machine learning is recommended.
+- **Course Difficulty**: 🌟🌟🌟🌟🌟
+- **Course Website**: [CS324: Large Language Models](https://stanford-cs324.github.io/winter2022/)
+
+"CS324: Large Language Models" is a graduate-level course at Stanford University that explores the fundamentals, theory, ethics, and system aspects of large language models (LLMs). The course also offers hands-on experience in evaluating and building these models.
+
+Massive pre-trained LLMs have revolutionized the field of natural language processing (NLP), enabling state-of-the-art performance across numerous tasks and demonstrating the ability to generate fluent text and perform few-shot learning. However, these models are challenging to interpret and introduce new ethical and scalability concerns. This course provides students with a comprehensive understanding of LLMs and practical exposure through projects and paper discussions.
+
+### Coursework
+
+1. **Paper Reviews and Discussions** (20%)
+   - Write reviews for assigned papers.
+   - Participate in at least two student panels to lead discussions.
+
+2. **Projects** (2 × 40% = 80%)
+   - **Project 1**: Evaluate the capabilities and risks of language models (e.g., GPT-3).
+   - **Project 2**: Build and improve language models using tools like BERT-base.
+
+
+---
+
+
--- a/docs/大型语言模型与生成模型/CS324.md
+++ b/docs/大型语言模型与生成模型/CS324.md
@ -0,0 +1,28 @@
+# Stanford CS324: Large Language Models
+
+## 课程简介
+
+- **所属大学**: 斯坦福大学
+- **先修要求**: 未指定，但建议具备自然语言处理和机器学习的基础知识。
+- **课程难度**: 🌟🌟🌟🌟🌟
+- **课程网站**: [CS324: 大型语言模型](https://stanford-cs324.github.io/winter2022/)
+
+《CS324: 大型语言模型》是斯坦福大学开设的一门研究生课程，探讨大型语言模型（LLMs）的基础理论、伦理问题、系统架构等方面，并提供评估和构建这些模型的实践机会。
+
+大规模预训练的 LLMs 革新了自然语言处理（NLP）领域，实现了多个任务的最先进性能，并展现了生成流畅文本和少样本学习的能力。然而，这些模型难以理解，同时带来了新的伦理和可扩展性挑战。本课程旨在让学生全面理解 LLMs，并通过项目和论文讨论获得实践经验。
+
+### 课程作业
+
+1. **论文评审与讨论**（20%）
+   - 撰写所分配论文的评审。
+   - 至少参与两次学生小组讨论并主导讨论。
+
+2. **项目**（2 × 40% = 80%）
+   - **项目 1**: 评估语言模型（例如 GPT-3）的能力与风险。
+   - **项目 2**: 使用 BERT-base 等工具构建和改进语言模型。
+
+项目需以 1-2 人小组完成，使用清晰的格式编排，并以 PDF 提交。截止时间为每次晚上 11:00 PST，通过 Gradescope 提交。
+
+
+---
+
--- a/docs/大语言与深度生成模型/CS194.en.md
+++ b/docs/大语言与深度生成模型/CS194.en.md
@ -1,24 +0,0 @@
-# UC Berkeley CS 194/294-267: Understanding Large Language Models: Foundations and Safety
-
-## Course Introduction
-
- **University**: UC Berkeley
- **Prerequisites**: CS 182/282A Deep Neural Networks or equivalent, with hands-on deep learning experience.
- **Course Difficulty**: 🌟🌟🌟🌟🌟🌟
- **Course Website**: [Understanding Large Language Models](http://rdi.berkeley.edu/understanding_llms/s24)
- **Lecture Video**：<https://www.youtube.com/playlist?list=PLJ66BAXN6D8H_gRQJGjmbnS5qCWoxJNfe>
-
-"Understanding Large Language Models: Foundations and Safety" is a Spring 2024 course at UC Berkeley, co-taught by **Professor Dawn Song** and **Dan Hendrycks**, with **Yu Gai** as the GSI. This course explores the foundational principles, interpretability, scaling laws, and risks associated with large language models (LLMs) such as ChatGPT.
-
-The course provides a rigorous introduction to LLMs, discussing their emergence, limitations, and potential risks, as well as methods for safer and more beneficial applications. Topics covered include:
-
- Foundations of LLMs
- Interpretability
- Scaling laws
- Adversarial robustness
- AI alignment and governance
- Privacy, watermarking, and Trojans
- Agency, emergence, reasoning, and mathematics
- Evaluation and benchmarking
-
-
--- a/docs/大语言与深度生成模型/CS194.md
+++ b/docs/大语言与深度生成模型/CS194.md
@ -1,23 +0,0 @@
-# UC Berkeley CS 194/294-267： Understanding Large Language Models: Foundations and Safety
-
-## 课程简介
-
- **所属大学**：UC Berkeley
- **先修要求**：CS 182/282A 深度神经网络课程或等效课程，并具有深度学习的实际操作经验。
- **课程难度**：🌟🌟🌟🌟🌟🌟
- **课程网站**：<http://rdi.berkeley.edu/understanding_llms/s24>
- **课程视频**：<https://www.youtube.com/playlist?list=PLJ66BAXN6D8H_gRQJGjmbnS5qCWoxJNfe>
-
-《理解大型语言模型：基础与安全》是 UC Berkeley 在 2024 年春季开设的一门课程，由 **Dawn Song 教授** 和 **Dan Hendrycks** 联合授课，助教为 **Yu Gai**。本课程重点探讨大型语言模型（如 ChatGPT）的基础原理、可解释性、扩展定律以及相关风险。
-
-课程内容包括：
-
- 大型语言模型的基础
- 可解释性
- 扩展定律
- 对抗鲁棒性
- 人工智能对齐与治理
- 隐私、水印与木马
- 代理性、涌现性、推理与数学
- 评估与基准测试
-
--- a/mkdocs.yml
+++ b/mkdocs.yml
@ -280,4 +280,8 @@ nav:
      - "Columbia STAT 8201: Deep Generative Models": "机器学习进阶/STAT8201.md"
      - "U Toronto STA 4273 Winter 2021: Minimizing Expectations": "机器学习进阶/STA4273.md"
      - "Stanford STATS214 / CS229M: Machine Learning Theory": "机器学习进阶/CS229M.md"
+  - 大型语言模型与生成模型:
+    - "CMU CS11-667: Large Language Models Methods and Application": "大型语言模型与生成模型/CS11-667.md"
+    - "Stanford CS324: Large Language Models" : "大型语言模型与生成模型/CS324.md"
+    - "UCB CS194/294-267: Understanding Large Language Models": "大型语言模型与生成模型/CS194-267.md"
  - 后记: "后记.md"