Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

37 tokens/sec

GPT-4o

11 tokens/sec

Gemini 2.5 Pro Pro

37 tokens/sec

o3 Pro

5 tokens/sec

GPT-4.1 Pro

10 tokens/sec

DeepSeek R1 via Azure Pro

33 tokens/sec

2000 character limit reached

Sketch3D: Style-Consistent Guidance for Sketch-to-3D Generation (2404.01843v2)

Published 2 Apr 2024 in cs.CV

Abstract: Recently, image-to-3D approaches have achieved significant results with a natural image as input. However, it is not always possible to access these enriched color input samples in practical applications, where only sketches are available. Existing sketch-to-3D researches suffer from limitations in broad applications due to the challenges of lacking color information and multi-view content. To overcome them, this paper proposes a novel generation paradigm Sketch3D to generate realistic 3D assets with shape aligned with the input sketch and color matching the textual description. Concretely, Sketch3D first instantiates the given sketch in the reference image through the shape-preserving generation process. Second, the reference image is leveraged to deduce a coarse 3D Gaussian prior, and multi-view style-consistent guidance images are generated based on the renderings of the 3D Gaussians. Finally, three strategies are designed to optimize 3D Gaussians, i.e., structural optimization via a distribution transfer mechanism, color optimization with a straightforward MSE loss and sketch similarity optimization with a CLIP-based geometric similarity loss. Extensive visual comparisons and quantitative analysis illustrate the advantage of our Sketch3D in generating realistic 3D assets while preserving consistency with the input.

References (65)

Citations (1)

View on Semantic Scholar

Summary

The paper introduces a novel framework that converts sketches into 3D models using style-consistent guidance, enhancing shape fidelity.
It employs advanced deep learning techniques to generate coherent 3D structures from freehand inputs, paving the way for interactive design tools.
Experimental results show significant improvements in visual consistency and accuracy compared to existing sketch-to-3D methods.

Insightful Overview on ACM's Article Formatting Guidelines

Introduction

The ACM has developed a single, comprehensive template to ensure consistency and readability across its publications. This detailed document provides an extensive overview of the ACM's consolidated article template, introduced in 2017. The template serves multiple functions from formatting to facilitating metadata extraction and accessibility - crucial for the future integration into the ACM Digital Library. The flexibility embedded in the design allows authors to prepare documents for various stages of publication, from submissions for review to camera-ready copies, across both conference proceedings and journal publications.

Templating Nuances

Template Styles and Parameters

The article outlines the differential template styles (acmsmall, acmlarge, acmtog, acmconf, sigchi, sigchi-a, sigplan) designed to accommodate the diverse requirements of ACM's publications, including journals and conference proceedings. Each style is chosen based on the nature of the publication and the specific SIG governing the work. Furthermore, it discusses template parameters like anonymous, review, authorversion, and screen, which adjust the template style to suit various publication stages and requirements, such as dual-anonymous conference submissions or generating screen-friendly versions.

Prohibited Modifications

A significant emphasis is placed on the strict prohibition against modifying the template. This includes altering fundamental elements such as margins, typeface sizes, and the usage of commands to manage vertical spacing. These restrictions are enforced to maintain the integrity and uniformity of ACM publications.

Typeface Requirements

The document stresses the mandatory use of the "Libertine" typeface family, barring substitutions to maintain a standard visual aesthetic across publications. The directive serves to unify the appearance of ACM works, contributing to a cohesive brand identity.

Title, Authors, and Affiliation Guidelines

Authors are advised on how to appropriately format titles, manage author information, and specify affiliations to ensure clarity and accuracy in the metadata. Precise instructions for handling long titles, multiple authors sharing affiliations, and the necessity of including e-mail addresses are provided to optimize the metadata extraction process.

Rights Information and CCS Concepts

The necessity of including rights management information and the use of the ACM Computing Classification System (CCS) for taxonomic classification of the work is discussed. These components are vital for the legal and academic categorization and discoverability of the articles within the ACM ecosystem and beyond.

Formatting and Content Structure

The document extends detailed guidance on structuring the content, including adherence to standard LaTeX sectioning commands and the preparation of tables, math equations, and figures. Particular attention is given to the formatting and placement of tables and figures to enhance readability and accessibility. The imperative of providing accurate figure descriptions is highlighted to facilitate content comprehension for visually impaired readers and improve search engine optimization.

Citations, Acknowledgments, and Appendices

Clear instructions are given on the preparation of bibliographies using BibTeX, ensuring completeness and accuracy in citations. Guidelines for acknowledging contributions and support are also provided, demarcating a specific acks environment for this section. Lastly, the document delineates how to incorporate appendices effectively, ensuring they are correctly sectioned and integrated into the article.

Implications and Future Directions

The establishment of uniform formatting guidelines by the ACM plays a crucial role in the standardization of academic publications in the computing field. By enforcing a consistent structure and visual presentation, these guidelines not only enhance the readability and accessibility of research but also streamline the publication process. Looking ahead, as AI and automated tools become increasingly prevalent in research and publication workflows, the importance of standardized templates and metadata becomes even more pronounced. Future developments in this area may include more sophisticated templates that further ease the publication process while maintaining high standards of accessibility and interactivity. The continuous evolution of these guidelines will likely parallel advances in publishing technologies, with a sustained focus on improving the accessibility, discoverability, and usability of scholarly communications.

In summary, the ACM's consolidation effort in article templating showcases a forward-thinking approach to academic publishing - one that respects the traditions of scholarly communication while embracing the technological advancements that shape its future.

PDF Markdown

Tweets

https://twitter.com/janusch_patas/status/1775384835055358319

[2404.01843] Sketch3D: Style-Consistent Guidance for Sketch-to-3D Generation (1 point, 0 comments)