17 May 2024 Single-image translation based on improved spatial attention and hybrid convolution
Pengbo Zhou, Zhiqiang Yang, Longquan Yan, Guohua Geng, Mingquan Zhou
Author Affiliations +
Abstract

Our primary objective is to address the challenges of unsupervised image-to-image translation, particularly in the domain of applying different visual styles to given content images. However, popular single-image translation techniques face issues such as poor quality, excessive image noise, and discrepancies between generated images and human perception. To overcome these challenges, we propose an innovative approach utilizing a dual-branch attention-guided paradigm to achieve high-quality single-image translation. Our method employs a multiscale pyramid structure and utilizes the generator to perform image transformation after downsampling the input image. In addition, we introduce dual-branch spatial attention modules and hybrid convolution modules to enhance the quality of generated images, mitigate noise, and align more closely with human visual perception. This approach helps enhance the focus on the main subject while minimizing interference from background information, thereby producing excellent image translation results. Furthermore, comprehensive experimental validation and comparisons with benchmark datasets, including the Terra Cotta Warriors dataset, affirm the effectiveness of our method. Specifically, our research significantly improves performance, as evidenced by the decrease in the single-image Fréchet inception distance value to 1.83. This result underscores the superior performance of our method compared with the state-of-the-art approaches.

© 2024 SPIE and IS&T
Pengbo Zhou, Zhiqiang Yang, Longquan Yan, Guohua Geng, and Mingquan Zhou "Single-image translation based on improved spatial attention and hybrid convolution," Journal of Electronic Imaging 33(3), 033020 (17 May 2024). https://doi.org/10.1117/1.JEI.33.3.033020
Received: 21 February 2024; Accepted: 17 April 2024; Published: 17 May 2024
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Convolution

Image processing

Image quality

Data modeling

Education and training

Performance modeling

Visualization

Back to Top