CS 685, Spring 2025, UMass Amherst
Assignments
Schedule
Make sure to reload this page to ensure you're seeing the latest version.
Readings should be done before watching the corresponding lecture videos. See this page or this page for materials (videos / slides / reading) from the Spring 2023/2024 offering.
Week 1 (2/3-5): introduction, language modeling
-
- Course introduction // [video] (last year) // [slides]
- HW 0 released here, due 2/14
- Final projects:
- Organize into groups of 4 by 2/14. If you do not know people, you can post your interests in Piazza to find partners. Once decided, please fill https://forms.gle/PKvJRxZkUMgFrkVG8 telling us your team members (please sending only one response per group).
- Proposal due 3/7 on Gradescope, use this Overleaf template
- [optional reading] The Bitter Lesson
Week 2 (2/10-12): neural language models, optimization and backpropagation
Week 3 (2/19): Attention mechanisms
Week 4 (2/24-26): Transformer
Week 5 (3/3-5): Fine-Tuning and Instruction Tuning
- BERT + Instruction tuning // [video] (last year) // [notes] // [slides]
- [reading] BERT: Pre-training of Deep Bidirectional Transformers... (Devlin et al., 2019)
- [reading] Exploring the Limits of Transfer Learning... (Raffel et al., JMLR 2020, "T5")
- [reading] A Closer Look at the Limitations of Instruction Tuning (Ghosh et al., ICML 2024)
Week 6 (3/10-12): LLM Alignment