MSc Thesis Presentation - Olivia Perryman
Name: Olivia Perryman
Date: Wednesday, August 7th
Time: 10:30am
Location: Join Zoom Meeting
https://ubc.zoom.us/j/2368822572?pwd=ODVMV1MySnM2YWNSTnVWT2wwbEVnZz09
Meeting ID: 236 882 2572
Passcode: 587537
Supervisor (s): Helge Rhodin, Kwang Moo Yi
Title of the thesis: Weakly-Supervised Geometry-Aware Novel View Synthesis
Abstract:
Enabling computers to understand and interpret visual information is crucial for the development of more sophisticated and interactive technologies. Learning structure, such as 3D shape, can help computers understand visual information more effectively. Our model disentangles object structure and appearance in a self-supervised manner from multiview images within a single category. We extract a 3D pointcloud from images and reconstruct consistent novel views by rendering the pointclouds from different perspectives. Using a much simpler model and far fewer training examples than costly state-of-the-art diffusion models, we can recover 3D structure from single images of objects and quickly reconstruct them from unseen viewpoints. Our findings suggest that understanding 3D constraints of the real world can enhance performance on visual tasks and make models more robust and generalizable to a wider variety of inputs. This approach enables downstream tasks such as pose transfer, spatially-guided conditional image generation, and paves the way for commonsense reasoning. Our work has potential applications in augmented reality and visual effects, with further exploration of the model's capabilities and integration into broader systems for enhanced visual understanding.