EN.601.783

Vision as Bayesian Inference

Tues/Thurs: 9:00-10:15 Spring 2024, Homewood Campus, Hackerman B17

Course Description

This course models vision as Bayesian Inference. It concentrates on visual tasks such as segmenting images, detecting objects in images, and recognizing objects. The course will also cover advanced topics including CNNs, Transformers, NERF, diffusion models, vision-language models and LLMs/ChatGPT. Its goal is to describe the state of the art techniques. The handouts consist of copies of the lecture notes and related papers.

Important information

Instructor: Prof. Alan Yuille, ayuille1@jhu.edu.
TAs: Jieneng Chen (jchen293@jhu.edu, OH: Wed 8-9am), Steven Kan (pkan2@jhu.edu, OH: Tue 12-1pm)
CAs: Junfei Xiao (junfei.xiao@jhu.edu, OH: Wed 1-2pm), Jiahao Wang (jwang384@jhu.edu, OH: Thu 1:30-2:30pm) and Rong Yang (ryang42@jh.edu, OH: Mon 9-10:00am)
Office hours (OH): Malone 216; Any change will be posted in the "Discussions" section.
We encourages students to initiate a discussion related to course in Canvas or piazza.
Grading Plan:
- (tentative) 4 homework assignments (to be posted here) + bonus participation points. Late homework will not be accepted unless permission is obtained well in advance in documented extenuating circumstances. Please email Prof. Yuille (ayuille1@jhu.edu) for approval and cc TAs.
- We encourage students to participate in class, by asking questions to the professor. TAs will periodically record the names of those who ask interesting questions and award them bonus points that contribute to their final grades (1 point per class session, with a maximum of 3 points). Students not asking questions will not experience any negative effects on their grades.

Homework

Homework1 covers the first 3 weeks of the courses. It is released in the section "Assignments". Please submit directly in Canvas.

Reading Material

Course Notes and Handouts.
Papers and Journal Articles.

Tentative Schedule

More will be updated. Please see the schedule in Spring23 for reference. We will add more advanced topics including CNNs, Transformers, NERF, diffusion models, vision-language models and LLMs/ChatGPT.

Lecture	Topics	Handouts	Additional Readings
1 (01/23/2024)	Introduction	Lecture1
2 (01/25/2024)	Image Representation and PCA Sparsity	Lecture2	Eigenfaces Robust Face Recognition Sparse Representation
3 (01/30/2024)	Dictionaries, Mixtures of Gaussians, Mini‐Epitomes, EM	Lecture3 (1) Lecture3 (2)	K-means++ Mini-Epitomes
4 (02/01/2024)	Edge Detection, Generative and Discriminative Models	Lecture4 (1) Lecture4 (2)	ProtoObjects SLIC Lecture4
5 (02/06/2024)	SuperPixel; Decision Theory	Lecture5 (1) Lecture5 (2) Lecture5 (3)
6 (02/08/2024)	Image Segmentation	Lecture6
7 (02/13/2024)	Markov Random Fields and MFT; Examples - GrabCut	Lecture7(1) Lecture7(2) Lecture7(3)
8 (02/15/2024)	Exponential Models with Latent Variables;	Lecture8
9 (02/20/2024)	Boltzmann Machine and HMMs	Lecture9(1) Lecture9(2) Lecture9(3)
10 (02/22/2024)	Regression & Deep Neural Networks	Lecture10(1) Lecture10(2) Lecture10(3)
11 (02/27/2024)	Deformable Part Model	Lecture11
12 (02/29/2024)	SVM	Lecture12
13 (03/07/2024)	Compositional (Semantic) Structure and Unsupervised Graph Structure Learning	Lecture13
14 (03/12/2024)	Compositional Generative Networks	Lecture14(1) Lecture14(2)
15 (03/14/2024)	3D NEMO NeuralSMPL	Lecture15(1) Lecture15(2) Lecture15(3)
03/16 - 03/24	Spring Break
16 03/26/2024	Transformer	Lecture16
17 03/28/2024	SuperPixel Transformer	Lecture17
18 04/02/2024	Vision-Language	Lecture18(1) Lecture18(2) Lecture18(3)
19 04/04/2024	GAN, AutoEncoder, Diffusion models	Lecture19(1) Lecture19(2) Lecture19(3) Lecture19(4)
20 04/09/2024	Adversarial Attack and Examiner	Lecture20(1) Lecture20(2) Lecture20(3) Lecture20(4)
21 04/11/2024	Self-supervised Learning	Lecture21
22 04/16/2024	Synthetic Data and Controllable Generation for 3D Models	Lecture22(1) Lecture22(2) Lecture22(3)
23 04/18/2024	Lambertian Model	Lecture23(1) Lecture23(2) Lecture23(3)
24 04/23/2024	Rendering Techniques (Gaussian Splatting; NERF etc.)	Lecture24(1) Lecture24(2)
25 04/25/2024	Physical Scene; World Models; Activities	Lecture25(1) Lecture25(2) Lecture25(3)