Available as a colab notebook here.
Given a single input image (left) and a conditioning transform (not shown), our tiny latent diffusion model generates novel views (pred) of unseen geometry. Note that even when the model has no information on the top or back of the cars, it can still generate feasible predictions that resemble the unseen ground truth images.
Please see our paper for additional details on the project.
Webpage created by Ethan Chun and ChatGPT :)