teaching | Wengong Jin

CS7170 Seminar in Artificial Intelligence: Frontier in AI for Science (Spring 2025)

Time: Tuesday 11:45 am - 1:25 pm, Thursday 2:50 pm - 4:30 pm
Location: Hastings Suite 113
Lecturer: Wengong Jin
Pre-requisites: N/A

Course Description

This course provides an introduction of how Artificial Intelligence (AI) and Machine Learning (ML) techniques are transforming scientific discovery. Students will learn various deep learning architectures, such as graph neural networks, transformers, diffusion models, and explore how these models can be applied across scientific disciplines such as chemistry, biology, and material science. Students from non-CS disciplines are encouraged to attend.

Course Deliverables

Participation and Discussions (20%).
Research frontier presentation (literature survey) (40%): present a list of papers related to a research topic in the field (~30min)
Project presentation (40%): Present a project in any topic related to AI for science. If you are working on any research projects in the intersection of science and AI, you are welcome to present your existing research work (~30min)
Please submit your research frontier presentation title and project proposal via this Google form (deadline 2/24 EOD)
If you would like to schedule an office hour with me (20min on Tuesday afternoon), please sign up using Calendarly

Schedule

Each class will cover a deep learning topic and demonstrate how it can be applied to different problems in scientific disciplines. The class is divided into two parts. The first part focuses on state-of-the-art AI methods and their applications in science, with the goal of helping students understand the current state of the field (AI in science). The second part focuses on research frontiers, with the goal of prompting students think about directions that may shape the future of this field. This part will be presentations from students or guest speakers. The class will end with a few project presentations.

Date	AI topics	Reading
1/7	Introduction	Antibiotic discovery; Protein folding; Material design;
1/9	Graph neural network (1)	MPNN; GAT; Chemprop; Crystal GCN;
1/14	Graph neural network (2)	GNNExplainer; RationaleRL; Pretraining GNN
1/16	Graph generation	GraphRNN; Junction Tree VAE;
1/21	Transformer (1)	Protein LMs: ESM; ESM-3
1/23	Transformer (2)	DNA LMs: MAMBA; Evo
1/28	Equivariant neural networks
1/30	Molecular dynamics
2/4	Diffusion models (1) (Gaussian diffusion)
2/6	Diffusion models (2) (Flow matching)
2/11	Explainable AI (1)
2/13	Explainable AI (2)
2/18	LLM (1)	RLHF and DPO survey
2/20	LLM (2)	Chain-of-thought reasoning survey
2/25	Guest Lecture: Tinglin Huang (Yale)	FAFormer for RNA; FAFormer for spatial transcriptomics
2/27	Project Proposal presentation
3/4-3/6	Spring break (no class)
3/11	Guest Lecture: Wenhao Gao (MIT)
3/13	Research frontier presentation	Presenter: Franc, Joey Ehlert
3/18	Research frontier presentation	Presenter: Ross Stewart, Karna Mendonca, Circe Hsu
3/20	Research frontier presentation	Presenter: Youran Ye, Ardavan Mehdizadeh, Haneen Abderrazzaq
3/25	Research frontier presentation	Presenter: Jici Jiang, Sarah Szvetecz, Yinyue Zhu
3/27	Guest Lecture: Kyle Swanson (Stanford)
4/1	Guest Lecture: Tian Xie (Microsoft AI for Science)
4/3	Project presentation by students	Jici, Franc, Joey
4/8	Project presentation by students	Ross, Karna
4/10	Project presentation by students	Haneen, Yinyue, Sarah
4/15	Project presentation by students	Youran, Ardavan, Circe

Research frontier presentation: example topics

Scalability: faster equivariant neural networks and diffusion models
Interpretability: how to understand the rationale behind model predictions?
Physics-informed neural networks: how to incorporate domain knowledge (e.g. biophysics and chemistry)?
LLM agent: how to apply LLM agents to accelerate scientific discovery?
Federated learning: how to share sensitive data for model training?
Lab-in-the-loop learning: how to learn from experimental feedback?
(More based on your interest)