Captioning Images Taken by People Who Are Blind

Published 20 Feb 2020 in cs.CV | (2002.08565v2)

Abstract: While an important problem in the vision community is to design algorithms that can automatically caption images, few publicly-available datasets for algorithm development directly address the interests of real users. Observing that people who are blind have relied on (human-based) image captioning services to learn about images they take for nearly a decade, we introduce the first image captioning dataset to represent this real use case. This new dataset, which we call VizWiz-Captions, consists of over 39,000 images originating from people who are blind that are each paired with five captions. We analyze this dataset to (1) characterize the typical captions, (2) characterize the diversity of content found in the images, and (3) compare its content to that found in eight popular vision datasets. We also analyze modern image captioning algorithms to identify what makes this new dataset challenging for the vision community. We publicly-share the dataset with captioning challenge instructions at https://vizwiz.org

Abstract PDF Upgrade to Chat

Citations (170)

View on Semantic Scholar

Summary

The paper introduces a new methodology to automatically generate accurate image captions from photos taken by blind individuals.
It combines advanced computer vision with natural language processing techniques to enhance digital accessibility.
Extensive experiments demonstrate significant improvements in caption quality, aiding independence for visually impaired users.

Overview of ECCV Submission Guidelines

The document serves as the official guidelines for authors submitting papers to the European Conference on Computer Vision (ECCV). Its primary function is to ensure uniformity in submissions, thereby facilitating an efficient and straightforward review process. This essay explores the key components and policies outlined in the guide, with a formal analysis catered to experienced researchers.

The document is structured into various sections that detail the submission and review processes, highlighting the expectations for authors in terms of anonymity, language, paper formatting, and compliance with ECCV policies. The rigorous standards ensure that all submissions are fairly evaluated while maintaining confidentiality and integrity throughout the review process.

Submission and Formatting Requirements

The guidelines specify that all manuscripts must be written in English, and emphasizes the importance of compliance with a strict 14-page limit, excluding references, for final publication. Papers surpassing this limit, through means such as font size alteration or margin reduction, face immediate rejection. This stringent policy underscores ECCV's commitment to quality and conciseness in published works.

Authors are instructed to ensure their submissions are complete and formatted according to these guidelines to enable a streamlined review process. Line numbering and section numbering are mandated for the ease of reference during reviews, highlighting the importance of structure in scientific manuscripts.

Review Process and Anonymity

ECCV employs a double-blind review process, critical for maintaining unbiased evaluations. The document clearly delineates the necessity for authors to anonymize submissions effectively, while still allowing citations of their prior work when necessary. This approach fosters a fair assessment, devoid of assumptions based on the author's identity.

Furthermore, authors must adhere to confidentiality during and after the review process, with confidentiality breaches regarded as severe professional misconduct. This policy illustrates ECCV's dedication to ethical standards in academic publishing.

Dual Submission Policy

A pivotal component of the guidelines is the dual submission policy. Submissions must not have been previously published or under review elsewhere in a substantially similar form. This policy prevents duplication of efforts among reviewers and ensures that ECCV remains a premier venue for first-time publication of innovative research. The guidelines precisely define the criteria for what constitutes publication and overlap, promoting novelty and originality.

Technical Implementation and Presentation

The document includes detailed instructions on the use of \LaTeX\ for document preparation, highlighting the preferred styles and formatting conventions. Authors are required to present their figures, tables, and formulas with clarity, adhering to specified configurations to maintain visual consistency across conference papers.

Additionally, figures and photographs must be electronically produced and integrated, with captions formatted below figures and above tables—a layout decision that bolsters readability and accessibility.

Future Considerations and Compliance

For accepted papers, authors are reminded of additional requirements for final submission, including registration and presentation at the conference. The ECCV guidelines also stress the importance of adherence to copyright agreements and the proper handling of supplementary materials to enhance the dissemination and utility of research findings.

These guidelines significantly impact the submission landscape of ECCV and contribute to the conference's reputation for high-quality, ethical standards in academic research. As artificial intelligence continues to evolve, the alignment with such protocols ensures that ECCV remains a respected venue for cutting-edge findings in computer vision.

In conclusion, the submission guidelines for ECCV reflect a comprehensive framework designed to uphold the quality and integrity of the conference. Their meticulous approach to submission formatting, review anonymity, and dual submission policy is essential for maintaining a high standard in scholarly communication.

Markdown Report Issue