teaching
CS7170 Seminar in Artificial Intelligence: Frontier in AI for Science (Spring 2025)
- Time: Tuesday 11:45 am - 1:25 pm, Thursday 2:50 pm - 4:30 pm
- Location: Hastings Suite 113
- Lecturer: Wengong Jin
- Pre-requisites: N/A
Course Description
This course provides an introduction of how Artificial Intelligence (AI) and Machine Learning (ML) techniques are transforming scientific discovery. Students will learn various deep learning architectures, such as graph neural networks, transformers, diffusion models, and explore how these models can be applied across scientific disciplines such as chemistry, biology, and material science. Students from non-CS disciplines are encouraged to attend.
Course Deliverables
- Participation and Discussions (20%).
- Research frontier presentation (40%) (see schedule below).
- Project presentation (40%): Present a project in any topic related to AI for science. If you are working on any research projects in the intersection of science and AI, you are welcome to present your existing research work.
Schedule (tentative)
Each class will cover a deep learning topic and demonstrate how it can be applied to different problems in scientific disciplines. The class is divided into two parts. The first part focuses on state-of-the-art AI methods and their applications in science, with the goal of helping students understand the current state of the field (AI in science). The second part focuses on research frontiers, with the goal of prompting students think about directions that may shape the future of this field. This part will be presentations from students or guest speakers. The class will end with a few project presentations.
Date | AI topics | Example Applications | Reading |
---|---|---|---|
1/7 | Introduction | Miscellaneous | Antibiotic discovery; Protein folding; Material design; |
1/9 | Graph neural network I | Molecule/material property prediction | MPNN; GAT; Chemprop; Crystal GCN; |
1/14 | Graph neural network II | Molecule property prediction | GNNExplainer; RationaleRL; Pretraining GNN |
1/16 | Graph generation | De novo drug design | GraphRNN; Junction Tree VAE; |
1/21 | Transformer I | Protein language models | ESM; ESM-3 |
1/23 | Transformer II | DNA language models | MAMBA; Evo |
1/28 | Equivariant neural networks I | Molecular property prediction | |
1/30 | Equivariant neural networks II | Protein structure modeling | |
2/4 | Diffusion models I (Gaussian diffusion) | Image generation | |
2/6 | Diffusion models II (Flow matching) | Protein folding | |
2/11 | Diffusion models III (Discrete diffusion) | Protein design | |
2/13 | Reinforcement learning I (PPO) | De novo drug design | |
2/18 | Reinforcement learning II (Monte Carlo tree search) | Chemical retrosynthesis | |
2/20 | Self-supervised learning | Biomedical image analysis | |
2/25 | Explainable AI | Miscellaneous | |
2/27 | Project Proposal presentation | Miscellaneous | |
3/4-3/6 | Spring break (no class) | ||
3/11 | Guest Lecture: Wenhao Gao (MIT) | Chemistry | |
3/13 | Research frontier II | ||
3/18 | Research frontier III | ||
3/20 | Research frontier IV | ||
3/25 | Research frontier V | ||
3/27 | Research frontier VI | ||
4/1 | Research frontier VII | ||
4/3 | Research frontier VIII | ||
4/8 | Project presentation by students | ||
4/10 | Project presentation by students | ||
4/15 | Last day of class |
Research frontier candidate topics
- Scalability: faster equivariant neural networks and diffusion models
- Interpretability: how to understand the rationale behind model predictions?
- Physics-informed neural networks: how to incorporate domain knowledge (e.g. biophysics and chemistry)?
- LLM agent: how to apply LLM agents to accelerate scientific discovery?
- Federated learning: how to share sensitive data for model training?
- Lab-in-the-loop learning: how to learn from experimental feedback?
- (More based on your interest)