OS-QLR: One-Shot Quantized Latent Refinement for Fast and Efficient Image Generation

Li, Peng; Šenkeřík, Roman; Viktorin, Adam

dc.title	OS-QLR: One-Shot Quantized Latent Refinement for Fast and Efficient Image Generation	en
dc.contributor.author	Li, Peng
dc.contributor.author	Šenkeřík, Roman
dc.contributor.author	Viktorin, Adam
dc.relation.ispartof	Communications in Computer and Information Science
dc.identifier.issn	1865-0929 Scopus Sources, Sherpa/RoMEO, JCR
dc.identifier.isbn	978-303215637-2
dc.date.issued	2026
utb.relation.volume	2829 CCIS
dc.citation.spage	650
dc.citation.epage	665
dc.event.title	17th International Joint Conference on Computational Intelligence, IJCCI 2025
dc.event.location	Marbella
utb.event.state-en	Spain
utb.event.state-cs	Španělsko
dc.event.sdate	2025-10-22
dc.event.edate	2025-10-24
dc.type	conferenceObject
dc.language.iso	en
dc.publisher	Springer Science and Business Media Deutschland GmbH
dc.identifier.doi	10.1007/978-3-032-15638-9_38
dc.relation.uri	https://link.springer.com/chapter/10.1007/978-3-032-15638-9_38
dc.subject	CIFAR-10	en
dc.subject	Computational Efficiency	en
dc.subject	FashionMNIST	en
dc.subject	Generative AI	en
dc.subject	Image Generation	en
dc.subject	One-Shot Quantized Latent Refinement	en
dc.subject	VQ-VAE	en
dc.description.abstract	This paper introduces One-Shot Quantized Latent Refinement (OS-QLR), a novel two-stage generative framework designed for high-quality image generation and improved computational efficiency. OS-QLR first learns a compact, discrete latent representation using a Vector Quantized Variational Autoencoder (VQ-VAE). It then employs a single-step refinement network within this latent space to produce clean, plausible samples from noisy or random inputs. Experimental results on the FashionMNIST and CIFAR-10 datasets show that OS-QLR consistently delivers superior image quality, featuring sharper details, fewer artifacts, and significantly lower Fréchet Inception Distance scores compared to unrefined VQ-VAE models. Additionally, OS-QLR demonstrates strong performance even with various levels of latent space corruption. Importantly, the training process for OS-QLR is greatly accelerated, taking only hours instead of the days or even weeks required by Diffusion Models, Generative Adversarial Networks (GANs), and Autoregressive image generation models. The non-iterative sampling method allows for rapid image generation, making OS-QLR a compelling and efficient alternative to current computationally intensive generative models. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.	en
utb.faculty	Faculty of Applied Informatics
dc.identifier.uri	http://hdl.handle.net/10563/1012805
utb.identifier.scopus	2-s2.0-105030320236
utb.source	d-scopus
dc.date.accessioned	2026-04-30T12:07:57Z
dc.date.available	2026-04-30T12:07:57Z
dc.description.sponsorship	The research presented in this paper was partially supported by the Internal Grant Agency of Tomas Bata University in Zlin under project number IGA/CebiaTech/2023/004, and by resources of the A.I.Lab at the Faculty of Applied Informatics, Tomas Bata University in Zlin (ailab.fai.utb.cz).
utb.contributor.internalauthor	Li, Peng
utb.contributor.internalauthor	Šenkeřík, Roman
utb.contributor.internalauthor	Viktorin, Adam
utb.fulltext.sponsorship	The research presented in this paper was partially supported by the Internal Grant Agency of Tomas Bata University in Zlin under project number IGA/CebiaTech/2023/004, and by resources of the A.I.Lab at the Faculty of Applied Informatics, Tomas Bata University in Zlin (ailab.fai.utb.cz).
utb.fulltext.projects	IGA/CebiaTech/2023/004