Deep learning, represented by large language models, is revolutionizing human lives. However, trustworthiness threats in deep learning widely exist, posing great challenges to AI safety, security, and reliability. This course introduces state-of-the-art frontiers on deep learning research for a wide range of trustworthiness issues, including threat discovery, mitigation, and certification methods through seminar-style presentations and hands-on projects.
This is a seminar-style course for trustworthy deep learning. The first half of the course is an overview of deep learning and preliminaries for trustworthy AI methods, including training of neural networks, common neural network architectures, large language models, the definition of AI attacks, defenses, and certification and verification in the context of AI. The second half of the course visits representative and recent research papers in the field through student presentations, covering topics like evasion attacks and defenses, robustness certification, differential privacy, membership inference attacks, watermarks, detection of AI-generated contents, machine unlearning, prompt injection attacks, model stealing, and finetuning attacks.
Schedule and Syllabus #
Slides will be updated as the term progresses. All slides are available in this OneDrive folder. The slides are password encrypted - password posted on CourSys.
| Week | Date | Topics (Tentative) | Assignment & Due | Reading |
|---|---|---|---|---|
| Week 1 (5/10 - 5/16) | Tue (5/12) 1h | (Lecture) Syllabus, Introduction to Deep Learning I | Homework 0 Release | See References in slides |
| Thur (5/14) 2h | Lecture cancelled | |||
| Week 2 (5/17 - 5/23) | Tue (5/19) 1h | (Lecture) Introduction to Deep Learning II | See References in Slides | |
| Thur (5/21) 2h | (Lecture) Introduction to Deep Learning III | See References in Slides | ||
| Week 3 (5/24 - 5/30) | Tue (5/26) 1h | (Lecture) Introduction to Deep Learning III (Continued) | See References in Slides | |
| Thur (5/28) 2h | (Lecture) Course Presentation Instructions; Introduction to Deep Learning IV; Trustworthy Deep Learning Overview | Presentation Signing-up Sheet Release Homework 0 Due (5/30) |
See References in Slides | |
| Week 4 (5/31 - 6/6) | Tue (6/2) 1h | (Lecture) Robustness Threats in Deep Learning - Attacks I | See References in Slides | |
| Thur (6/4) 2h | (Lecture) Robustness Threats in Deep Learning - Attacks II | See References in Slides | ||
| Week 5 (6/7 - 6/13) | Tue (6/9) 1h | (Lecture) Robustness Threats in Deep Learning - Defenses | See References in Slides | |
| Thur (6/11) 2h | (Lecture) Robustness Threats in Deep Learning - Certification I | Course Project Release: Development kit | See References in Slides | |
| Week 6 (6/14) - 6/20) | Tue (6/16) 1h | (Lecture) Introduction to Course Project | See References in Slides | |
| Thur (6/18) 2h | (Lecture) Robustness Threats in Deep Learning - Certification II | Presentation Signing-up Due | See References in Slides | |
| Week 7 (6/21 - 6/27) | Tue (6/23) 1h | (Lecture) Large Language Models: Overview | CMU 11-711 ANLP Stanford CS336 |
|
| Thur (6/25) 2h | (Lecture) LLM and Agentic Trustworthiness: Overview | DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models AI Risk Management Should Incorporate Both Safety and Security Recommendations for Technical AI Safety Research Directions DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents |
||
| Week 8 (6/28 - 7/4) | Tue (6/30) 1h | Presentation | Notes Submission Required | |
| Thur (7/2) 2h | Presentation | Notes Submission Required | ||
| Week 9 (7/5 - 7/11) | Tue (7/7) 1h | Presentation | Notes Submission Required | |
| Thur (7/9) 2h | Presentation | Notes Submission Required | ||
| Week 10 (7/12 - 7/18) | Tue (7/14) 1h | Presentation | Notes Submission Required | |
| Thur (7/16) 2h | Presentation | Notes Submission Required | ||
| Week 11 (7/17 - 7/25) | Tue (7/21) 1h | Presentation | Notes Submission Required | |
| Thur (7/23) 2h | Presentation | Notes Submission Required | ||
| Week 12 (7/26 - 8/1) | Tue (7/28) 1h | Presentation | Notes Submission Required | |
| Thur (7/30) 2h | Presentation | Notes Submission Required | ||
| Week 13 (8/2 - 8/8) | Tue (8/4) 1h | Presentation | Notes Submission Required Course project Due |
|
| Thur (8/6) 2h | Presentation (Lecture) Course Project Discussion, Closing Remarks |
Notes Submission Required |
||
| Week 14 (8/9 - 8/15) | Grade Released |
Assignments and Project #
- Homework 0
- Deadline: 23:59,
May 23, 2026 (end of week 2)May 30, 2026 (end of week 3) - Grades released
- Deadline: 23:59,
- Presentation:
- Signing-up spreadsheet and instructions released on May 28 on CourSys for signing up
- Signing-up due date: Jun 18 (Week 6 Thursday)
- Course project:
- Released on Week 5 (Jun 11)
- Development kit
- Instruction Slides
- Due: Beginning of Week 13 (Aug 4)
- Note Submission:
- Submission links dynamically released on CourSys
- Only for student presentation dates
- Due 7 days after each presentation date
- Submit on CourSys
- Up to 3 exemptions
Grading #
10% Homework 0 (raw score) + 40% course project (1.1 × raw score with no cap) + 30% paper presentation + 20% notes of peer evaluation and summary
Information Platform #
Prerequisites #
There is no formal pre-requisite. Background in algorithms, calculus, linear algebra (e.g., MATH 151, MATH 152, MATH 232, CMPT 225), CMPT 410/726 Machine Learning strongly recommended. It is also recommended to have a background in CMPT 412/762 Computer Vision and CMPT 713 NLP.
Textbook and Reading Materials #
There is no primary reference material. We will read an assortment of research papers during lectures.
- Deep Learning Book
- By Ian Goodfellow, Yoshua Bengio, and Aaron Courville
- Recommended for students to gain background in deep learning before taking the course.
- Online course Intro to ML Safety
- By Dan Hendrycks at the Center for AI Safety
- Optional, advanced reading for interested students
- A well-developed course recommended for those who want to learn general machine learning safety from a systematic and interdisciplinary perspective.
Ethics Statement #
This course will include topics related computer security and privacy. As part of this investigation we may cover technologies whose abuse could infringe on the rights of others. As computer scientists, we rely on the ethical use of these technologies. Unethical use includes circumvention of an existing security or privacy mechanisms for any purpose, or the dissemination, promotion, or exploitation of vulnerabilities of these services. Any activity outside the letter or spirit of these guidelines will be reported to the proper authorities and may result in dismissal from the class and possibly more severe academic and legal sanctions.
Academic Integrity Policy #
- Some examples of unacceptable behaviour in homeworks and course projects:
- Handing in assignments that are not 100% your own work (in design, implementation, wording, etc.), without proper citation. There must be a README file in your submission with citations to any external code used.
- Sharing code fragments with others in class (for group project, with others who are not in the same group) is not allowed.
- Keep discussions to high level information rather than specific code hints.
- Copying and then obfuscating code is a serious academic honesty violation.
- Submitting work that has been submitted before, for any course at any institution.
- If you are unclear on what academic honesty is, see Simon Fraser University’s Policy S10-01.
- All instances of academic dishonesty will be dealt with very severely.
- In general, minimum requested penalties will be as follows:
- For assignments and course projcets: a mark of -50% on the assignment. So, academic dishonesty on an assignment worth 10% of your final mark will result in a zero on the assignment, and a penalty of 5% from your final grade.
- Please note that these are minimum penalties. At the instructor’s option, more severe penalties may be given/requested. All instances of academic dishonesty will be noted on your University record.
- The instructor may use an automated service that will check for plagiarism.
LLM Policy:
LLMs can be used for study core concepts and usage of packages at a high level. Also, LLMs can be used as code completion assistants. However, LLMs are not allowed to finish the assignment as a whole or any significant component of the assignment from scratch. Violators will be viewed as breach of academic integrity.
Acknowledgement #
The course is developed from CS562 and CS598GS at UIUC. Part of the content is adapted from Intro to ML Safety. Some course policies are developed from CMPT 413 Natural Language Processing.