Abstract
We propose to repaint an image-to-video diffusion model to synthesize light fields that are geometrically consistent. Despite significant advancements in diffusion models for novel view synthesis, applying these models to generate a light field, i.e., fronto-parallel multiple views, has been challenging because of persistent visual and geometric consistency issues. By utilizing advances in video diffusion, we extend the temporal consistency of video diffusion to the geometric consistency of multi-view settings. We fine-tune the image-to-video diffusion model framework for optimized multi-view diffusion by incorporating multi-view data with camera parameters. Furthermore, we propose integrating a repaint method during the sampling (denoising process) to achieve enhanced accurate camera control in multi-view diffusion, improving consistency by maintaining the known region in the input image. This approach enables the application of light field synthesis that requires precise camera control and demonstrates the ability of diffusion models to generate light fields with wide baselines, leveraging their unique generative power.
Original language | English |
---|---|
Title of host publication | Pattern Recognition - 27th International Conference, ICPR 2024, Proceedings |
Editors | Apostolos Antonacopoulos, Subhasis Chaudhuri, Rama Chellappa, Cheng-Lin Liu, Saumik Bhattacharya, Umapada Pal |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 145-160 |
Number of pages | 16 |
ISBN (Print) | 9783031784552 |
DOIs | |
State | Published - 2025 |
Event | 27th International Conference on Pattern Recognition, ICPR 2024 - Kolkata, India Duration: 1 Dec 2024 → 5 Dec 2024 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 15318 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 27th International Conference on Pattern Recognition, ICPR 2024 |
---|---|
Country/Territory | India |
City | Kolkata |
Period | 1/12/24 → 5/12/24 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
Keywords
- Light Field
- Novel View Synthesis
- Video Diffusion Model