TBU Publications
Repository of TBU Publications

OS-QLR: One-Shot Quantized Latent Refinement for Fast and Efficient Image Generation

DSpace Repository

Show simple item record


dc.title OS-QLR: One-Shot Quantized Latent Refinement for Fast and Efficient Image Generation en
dc.contributor.author Li, Peng
dc.contributor.author Šenkeřík, Roman
dc.contributor.author Viktorin, Adam
dc.relation.ispartof Communications in Computer and Information Science
dc.identifier.issn 1865-0929 Scopus Sources, Sherpa/RoMEO, JCR
dc.identifier.isbn 978-303215637-2
dc.date.issued 2026
utb.relation.volume 2829 CCIS
dc.citation.spage 650
dc.citation.epage 665
dc.event.title 17th International Joint Conference on Computational Intelligence, IJCCI 2025
dc.event.location Marbella
utb.event.state-en Spain
utb.event.state-cs Španělsko
dc.event.sdate 2025-10-22
dc.event.edate 2025-10-24
dc.type conferenceObject
dc.language.iso en
dc.publisher Springer Science and Business Media Deutschland GmbH
dc.identifier.doi 10.1007/978-3-032-15638-9_38
dc.relation.uri https://link.springer.com/chapter/10.1007/978-3-032-15638-9_38
dc.subject CIFAR-10 en
dc.subject Computational Efficiency en
dc.subject FashionMNIST en
dc.subject Generative AI en
dc.subject Image Generation en
dc.subject One-Shot Quantized Latent Refinement en
dc.subject VQ-VAE en
dc.description.abstract This paper introduces One-Shot Quantized Latent Refinement (OS-QLR), a novel two-stage generative framework designed for high-quality image generation and improved computational efficiency. OS-QLR first learns a compact, discrete latent representation using a Vector Quantized Variational Autoencoder (VQ-VAE). It then employs a single-step refinement network within this latent space to produce clean, plausible samples from noisy or random inputs. Experimental results on the FashionMNIST and CIFAR-10 datasets show that OS-QLR consistently delivers superior image quality, featuring sharper details, fewer artifacts, and significantly lower Fréchet Inception Distance scores compared to unrefined VQ-VAE models. Additionally, OS-QLR demonstrates strong performance even with various levels of latent space corruption. Importantly, the training process for OS-QLR is greatly accelerated, taking only hours instead of the days or even weeks required by Diffusion Models, Generative Adversarial Networks (GANs), and Autoregressive image generation models. The non-iterative sampling method allows for rapid image generation, making OS-QLR a compelling and efficient alternative to current computationally intensive generative models. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026. en
utb.faculty Faculty of Applied Informatics
dc.identifier.uri http://hdl.handle.net/10563/1012805
utb.identifier.scopus 2-s2.0-105030320236
utb.source d-scopus
dc.date.accessioned 2026-04-30T12:07:57Z
dc.date.available 2026-04-30T12:07:57Z
dc.description.sponsorship The research presented in this paper was partially supported by the Internal Grant Agency of Tomas Bata University in Zlin under project number IGA/CebiaTech/2023/004, and by resources of the A.I.Lab at the Faculty of Applied Informatics, Tomas Bata University in Zlin (ailab.fai.utb.cz).
utb.contributor.internalauthor Li, Peng
utb.contributor.internalauthor Šenkeřík, Roman
utb.contributor.internalauthor Viktorin, Adam
utb.fulltext.sponsorship The research presented in this paper was partially supported by the Internal Grant Agency of Tomas Bata University in Zlin under project number IGA/CebiaTech/2023/004, and by resources of the A.I.Lab at the Faculty of Applied Informatics, Tomas Bata University in Zlin (ailab.fai.utb.cz).
utb.fulltext.projects IGA/CebiaTech/2023/004
Find Full text

Files in this item

Files Size Format View

There are no files associated with this item.

Show simple item record