EN.601.783

Vision as Bayesian Inference

Tues/Thurs: 9:00-10:15 Spring 2024, Homewood Campus, Hackerman B17 


Course Description

This course models vision as Bayesian Inference. It concentrates on visual tasks such as segmenting images, detecting objects in images, and recognizing objects. The course will also cover advanced topics including CNNs, Transformers, NERF, diffusion models, vision-language models and LLMs/ChatGPT. Its goal is to describe the state of the art techniques. The handouts consist of copies of the lecture notes and related papers.

Important information

Homework

 

Reading Material

 

Tentative Schedule

Lecture Topics Handouts Additional Readings
1 (01/23/2024) Introduction Lecture1
2 (01/25/2024) Image Representation and PCA Sparsity Lecture2 Eigenfaces
Robust Face Recognition
Sparse Representation
3 (01/30/2024) Dictionaries, Mixtures of Gaussians, Miniā€Epitomes, EM Lecture3 (1)
Lecture3 (2)
K-means++
Mini-Epitomes
4 (02/01/2024)

Edge Detection, Generative and Discriminative Models

Lecture4 (1)
Lecture4 (2)

ProtoObjects
SLIC

Lecture4

5 (02/06/2024) SuperPixel; Decision Theory

Lecture5 (1)
Lecture5 (2)
Lecture5 (3)

6 (02/08/2024) Image Segmentation  Lecture6
7 (02/13/2024)

Markov Random Fields and MFT;

Examples - GrabCut

Lecture7(1)

Lecture7(2)

Lecture7(3)

8 (02/15/2024)

Exponential Models with Latent Variables;

Lecture8

9 (02/20/2024)

Boltzmann Machine and HMMs

Lecture9(1)

Lecture9(2)

Lecture9(3)

10 (02/22/2024)

Regression & Deep Neural Networks

Lecture10(1)

Lecture10(2)

Lecture10(3)

 

11 (02/27/2024)

Deformable Part Model

Lecture11

12 (02/29/2024)

SVM

Lecture12

13

(03/07/2024)

Compositional (Semantic) Structure and Unsupervised Graph Structure Learning

Lecture13

14

(03/12/2024)

Compositional Generative Networks

Lecture14(1)

Lecture14(2)

15

(03/14/2024)

3D NEMO

NeuralSMPL

Lecture15(1)

Lecture15(2)

Lecture15(3)

03/16 - 03/24

Spring Break

 

16

03/26/2024

Transformer

Lecture16

17

03/28/2024

SuperPixel Transformer

Lecture17

18

04/02/2024

Vision-Language

Lecture18(1)

Lecture18(2)

Lecture18(3)

19

04/04/2024

GAN, AutoEncoder, Diffusion models

Lecture19(1)

Lecture19(2)

Lecture19(3)

Lecture19(4)

20

04/09/2024

Adversarial Attack and Examiner

Lecture20(1)

Lecture20(2)

Lecture20(3)

Lecture20(4)

21

04/11/2024

Self-supervised Learning

Lecture21

22

04/16/2024

Synthetic Data and Controllable Generation for 3D Models

Lecture22(1)

Lecture22(2)

Lecture22(3)

23

04/18/2024

Lambertian Model 

Lecture23(1)

Lecture23(2)

Lecture23(3)

24

04/23/2024

Rendering Techniques (Gaussian Splatting; NERF etc.)

Lecture24(1)

Lecture24(2)

25

04/25/2024

Physical Scene; World Models; Activities

Lecture25(1)

Lecture25(2)

Lecture25(3)