Improving alignment and controllability in GANs and diffusion models


Li, Yumeng


[img] PDF
DissertationLi.pdf - Published

Download (105MB)

URN: urn:nbn:de:bsz:180-madoc-692757
Document Type: Doctoral dissertation
Year of publication: 2025
Place of publication: Mannheim
University: Universität Mannheim
Evaluator: Keuper, Margret
Date of oral examination: 2025
Publication language: English
Institution: School of Business Informatics and Mathematics > Machine Learning (Keuper 2024-)
License: CC BY 4.0 Creative Commons Attribution 4.0 International (CC BY 4.0)
Subject: 004 Computer science, internet
Keywords (English): generative models , GANs , diffusion models , GenAI , computer vision
Abstract: Visual generative modeling is a transformative area that aims to synthesize diverse, realistic-looking visual content, e.g., images and videos. These models are widely applied in various domains, ranging from creative art design and the visual effects industry to data augmentation for downstream computer vision tasks. Over the past decade, this field has made tremendous progress, with significant advancements evolving from Generative Adversarial Networks (GANs) to diffusion models. Despite achieving higher fidelity and improved training stability, it remains challenging to control the synthesis process and generate content precisely as desired. To this end, this thesis presents several new techniques aimed at improving alignment and controllability in GANs and diffusion models across various tasks, such as GAN inversion, layout-to-image, text-to-image, and text-to-video generation. Further, these enhancements make the models more effective in a wide range of real-world applications.




Dieser Eintrag ist Teil der Universitätsbibliographie.

Das Dokument wird vom Publikationsserver der Universitätsbibliothek Mannheim bereitgestellt.




Metadata export


Citation


+ Search Authors in

BASE: Li, Yumeng

Google Scholar: Li, Yumeng

ORCID: Li, Yumeng ORCID: 0000-0002-5562-1707

+ Download Statistics

Downloads per month over past year

View more statistics



You have found an error? Please let us know about your desired correction here: E-Mail


Actions (login required)

Show item Show item