Tues/Thurs: 9:00-10:15 Spring 2024, Homewood Campus, Hackerman B17
This course models vision as Bayesian Inference. It concentrates on visual tasks such as segmenting images, detecting objects in images, and recognizing objects. The course will also cover advanced topics including CNNs, Transformers, NERF, diffusion models, vision-language models and LLMs/ChatGPT. Its goal is to describe the state of the art techniques. The handouts consist of copies of the lecture notes and related papers.
Lecture | Topics | Handouts | Additional Readings |
1 (01/23/2024) | Introduction | Lecture1 | |
2 (01/25/2024) | Image Representation and PCA Sparsity | Lecture2 | Eigenfaces Robust Face Recognition Sparse Representation |
3 (01/30/2024) | Dictionaries, Mixtures of Gaussians, MiniāEpitomes, EM | Lecture3 (1) Lecture3 (2) |
K-means++ Mini-Epitomes |
4 (02/01/2024) |
Edge Detection, Generative and Discriminative Models |
Lecture4 (1) Lecture4 (2) |
|
5 (02/06/2024) | SuperPixel; Decision Theory | ||
6 (02/08/2024) | Image Segmentation | Lecture6 | |
7 (02/13/2024) |
Markov Random Fields and MFT; Examples - GrabCut |
||
8 (02/15/2024) |
Exponential Models with Latent Variables; |
Lecture8 | |
9 (02/20/2024) |
Boltzmann Machine and HMMs |
||
10 (02/22/2024) |
Regression & Deep Neural Networks |
|
|
11 (02/27/2024) |
Deformable Part Model |
||
12 (02/29/2024) |
SVM |
||
13 (03/07/2024) |
Compositional (Semantic) Structure and Unsupervised Graph Structure Learning |
||
14 (03/12/2024) |
Compositional Generative Networks |
||
15 (03/14/2024) |
3D NEMO NeuralSMPL |
||
03/16 - 03/24 |
Spring Break |
|
|
16 03/26/2024 |
Transformer |
||
17 03/28/2024 |
SuperPixel Transformer |
||
18 04/02/2024 |
Vision-Language |
||
19 04/04/2024 |
GAN, AutoEncoder, Diffusion models |
||
20 04/09/2024 |
Adversarial Attack and Examiner |
||
21 04/11/2024 |
Self-supervised Learning |
||
22 04/16/2024 |
Synthetic Data and Controllable Generation for 3D Models |
||
23 04/18/2024 |
Lambertian Model |
||
24 04/23/2024 |
Rendering Techniques (Gaussian Splatting; NERF etc.) |
||
25 04/25/2024 |
Physical Scene; World Models; Activities |