UVCGAN v2: An Improved Cycle-Consistent GAN for Unpaired Image-to-Image Translation (2303.16280v3)

Published 28 Mar 2023 in cs.CV

Abstract: An unpaired image-to-image (I2I) translation technique seeks to find a mapping between two domains of data in a fully unsupervised manner. While initial solutions to the I2I problem were provided by generative adversarial neural networks (GANs), diffusion models (DMs) currently hold the state-of-the-art status on the I2I translation benchmarks in terms of Frechet inception distance (FID). Yet, DMs suffer from limitations, such as not using data from the source domain during the training or maintaining consistency of the source and translated images only via simple pixel-wise errors. This work improves a recent UVCGAN model and equips it with modern advancements in model architectures and training procedures. The resulting revised model significantly outperforms other advanced GAN- and DM-based competitors on a variety of benchmarks. In the case of Male-to-Female translation of CelebA, the model achieves more than 40% improvement in FID score compared to the state-of-the-art results. This work also demonstrates the ineffectiveness of the pixel-wise I2I translation faithfulness metrics and suggests their revision. The code and trained models are available at https://github.com/LS4GAN/uvcgan2

Citations (8)

View on Semantic Scholar

Summary

The paper presents UVCGAN v2, a novel cycle-consistent GAN that significantly improves unpaired image translation performance.
It introduces advanced architecture modifications and training strategies to boost image quality and translation reliability.
The method outperforms previous models on benchmark datasets, offering actionable insights for practical computer vision applications.

Summary of "LaTeX Author Guidelines for WACV Proceedings"

This document provides comprehensive guidance for authors intending to submit manuscripts to the Workshop on Applications of Computer Vision (WACV) proceedings using the LaTeX document preparation system. Authored by experienced members of the academic community, the guidelines are meticulously tailored to ensure uniformity and clarity in the presentation of scholarly work. The document covers a wide array of considerations including formatting, structure, and submission protocols.

Key Details

The guidelines commence with an abstract that, while formatted in a standardized manner, typically does not include substantive content. The introduction underscores updates and modifications from previous guidelines, aiming to eliminate common ambiguities encountered by authors.

The paper mandates that all manuscripts be submitted in English and addresses issues of dual submission by directing authors to the relevant policies on the WACV website. Importantly, the length of submissions is strictly capped at eight pages, excluding references. Papers exceeding this limit will be automatically rejected, with no allowances for overlength, emphasizing the importance of conciseness and adherence to formatting rules.

The inclusion of a printed ruler in the review version assists reviewers in providing precise feedback. Authors are encouraged to maintain consistent formatting when preparing their submissions, with specific instructions on type style, use of equations, and cross-referencing.

Blind Review Process

An essential aspect covered is the requirement for blind review compliance. Authors are advised to avoid self-referential language that might compromise anonymity. This section reinforces the academic standard for citation discussion, cautioning authors against inadvertently revealing their identities through references to previous work or acknowledgments.

Formatting Specifications

The article delineates detailed specifications for formatting texts, figures, tables, and equations within the manuscript. Notable points include the restriction of the text to a two-column layout and specific margin measurements suited for either US or A4 paper sizes. Font styles and sizes are prescribed, notably Times in various point sizes for different elements within the paper.

Mathematical and Technical Documentation

The guide enforces the proper numbering of equations for ease of reference, aligning with best practices for mathematical presentation in academic papers. Additionally, the document emphasizes consistency in the presentation of figures and tables, ensuring these elements are legible in both electronic and printed forms.

Use of Graphics and Color Vision Considerations

Regarding illustrations and graphs, authors are prompted to focus on clarity and accessibility, particularly for readers with color vision deficiencies. Authors should introduce additional features in graphs to accommodate such considerations, thereby ensuring the accessibility of the document to a broader audience.

Administrative and Procedural Requirements

The final sections of the document cover administrative tasks related to submission, including the necessity for a signed IEEE copyright release form. This underscores the legal and procedural elements adjacent to the academic submission process.

Implications and Future Potential

The detailed nature of these guidelines not only seeks to uphold consistency across submissions but also serves as an educational tool for authors, particularly those newer to the field of computer vision. These guidelines reflect broader industry trends towards standardizing submissions for consistency in review and publication. Moving forward, the protocols outlined may evolve with technological advancements in document preparation and review processes, potentially incorporating automated checks for compliance and more dynamic review capabilities.

In summary, the WACV LaTeX Author Guidelines serve as an essential resource for authors, promoting orderly and professional presentation of research through careful adherence to established norms. These guidelines not only facilitate the review process but also contribute to the overall readability and integrity of the conference proceedings.

PDF Markdown

Related Papers

GitHub

GitHub - LS4GAN/uvcgan2: UVCGAN v2: An Improved Cycle-Consistent GAN for Unpaired Image-to-Image Translation (144 stars)

YouTube

Show All Videos