EN.601.783

Vision as Bayesian Inference

Tues/Thurs: 9:00-10:15 Spring 2025, Homewood Campus, Hodson 210 


Course Description

This course models vision as Bayesian Inference. It concentrates on visual tasks such as segmenting images, detecting objects in images, and recognizing objects. The course will also cover advanced topics including CNNs, Transformers, NERF, diffusion models, vision-language models and LLMs/ChatGPT. Its goal is to describe the state of the art techniques. The handouts consist of copies of the lecture notes and related papers.

Important information

Homework

 

Reading Material

 

Tentative Schedule

Lecture Topics Handouts Additional Readings
1 (01/21/2025) Introduction Lecture1
2 (01/23/2025) Image Representation and PCA Sparsity Lecture2 Eigenfaces
Robust Face Recognition
Sparse Representation
3 (01/28/2025) Dictionaries, Mixtures of Gaussians, Mini‐Epitomes, EM Lecture3 (1)
Lecture3 (2)
K-means++
Mini-Epitomes
4 (02/01/2025)

Edge Detection, Generative and Discriminative Models

Lecture4 (1)
Lecture4 (2)

ProtoObjects
SLIC

Lecture4

5 (02/04/2025) EM; Image Statistics & New Membrane Models

Lecture5 (EM)
Lecture5 (Image Statistics & New Membrane Models)

6 (02/06/2025) Region Competition & Variational Image Statistics &   Lecture6 (Region Competition & Variational Image Statistics) Lecture6 (Edge Detection)
7 (02/11/2025)

Markov Random Fields and MFT;

Examples - GrabCut

Lecture7(1)

Lecture7(2)

Lecture7(3)

8 (02/13/2025)

Exponential Models with Latent Variables;

Lecture8

9 (02/18/2025)

Boltzmann Machine and HMMs

Lecture9(1)

Lecture9(2)

Lecture9(3)

10 (02/20/2025)

Regression & Deep Neural Networks

Lecture10(1)

Lecture10(2)

Lecture10(3)

 

11 (02/25/2025)

Deformable Part Model

Lecture11

12 (02/27/2025)

SVM

Lecture12

13

(03/05/2025)

Compositional (Semantic) Structure and Unsupervised Graph Structure Learning

Lecture13

14

(03/10/2025)

Compositional Generative Networks

Lecture14(1)

Lecture14(2)

15

(03/12/2025)

3D NEMO

NeuralSMPL

Lecture15(1)

Lecture15(2)

Lecture15(3)

03/16 - 03/24

Spring Break

 

16

03/24/2025

Transformer

Lecture16

17

03/26/2025

SuperPixel Transformer

Lecture17

18

03/31/2025

Vision-Language

Lecture18(1)

Lecture18(2)

Lecture18(3)

19

04/02/2025

GAN, AutoEncoder, Diffusion models

Lecture19(1)

Lecture19(2)

Lecture19(3)

Lecture19(4)

20

04/07/2025

Adversarial Attack and Examiner

Lecture20(1)

Lecture20(2)

Lecture20(3)

Lecture20(4)

21

04/09/2025

Self-supervised Learning

Lecture21

22

04/14/2025

Synthetic Data and Controllable Generation for 3D Models

Lecture22(1)

Lecture22(2)

Lecture22(3)

23

04/16/2025

Lambertian Model 

Lecture23(1)

Lecture23(2)

Lecture23(3)

24

04/21/2025

Rendering Techniques (Gaussian Splatting; NERF etc.)

Lecture24(1)

Lecture24(2)

25

04/23/2025

Physical Scene; World Models; Activities

Lecture25(1)

Lecture25(2)

Lecture25(3)