Image Aesthetic Assessment: An Experimental Survey (1610.00838v2)

Published 4 Oct 2016 in cs.CV

Abstract: This survey aims at reviewing recent computer vision techniques used in the assessment of image aesthetic quality. Image aesthetic assessment aims at computationally distinguishing high-quality photos from low-quality ones based on photographic rules, typically in the form of binary classification or quality scoring. A variety of approaches has been proposed in the literature trying to solve this challenging problem. In this survey, we present a systematic listing of the reviewed approaches based on visual feature types (hand-crafted features and deep features) and evaluation criteria (dataset characteristics and evaluation metrics). Main contributions and novelties of the reviewed approaches are highlighted and discussed. In addition, following the emergence of deep learning techniques, we systematically evaluate recent deep learning settings that are useful for developing a robust deep model for aesthetic scoring. Experiments are conducted using simple yet solid baselines that are competitive with the current state-of-the-arts. Moreover, we discuss the possibility of manipulating the aesthetics of images through computational approaches. We hope that our survey could serve as a comprehensive reference source for future research on the study of image aesthetic assessment.

Citations (251)

View on Semantic Scholar

Summary

The paper provides a comprehensive survey of traditional and deep learning methodologies used in image aesthetic assessment, including hand-engineered features and CNN-based techniques.
Experiments show that fine-tuning and multi-task learning strategies significantly improve CNN performance for aesthetic classification, offering insights into category-specific learning.
The survey highlights challenges like subjectivity and data limitations, suggests exploring varied datasets and user-specific models, and demonstrates applications like aesthetic-driven image cropping.

An Experimental Survey on Image Aesthetic Assessment

The paper entitled "Image Aesthetic Assessment: An Experimental Survey" provides a comprehensive review of various methodologies in the field of image aesthetic assessment, a challenging domain within computer vision. The task involves computationally distinguishing between high-quality and low-quality photos based on photographic rules through binary classification or quality scoring. The survey explores numerous traditional and contemporary techniques aimed at solving this complex problem, arranging them based on visual feature types and evaluation criteria.

Traditional Approaches

The foundational approaches presented in the paper largely rely on hand-engineered features, meticulously crafted to mimic the underlying photographic rules. These typically include low-level descriptors such as brightness, contrast, and saliency, as well as higher-level indicators like image composition and regional clarity. The survey encompasses a detailed account of these features and methodologies, including their effectiveness and limitations.

Emergence of Deep Learning

A pivotal shift highlighted in the paper is the advent of deep learning, which has significantly improved the performance of image aesthetic assessment models. The paper examines how Convolutional Neural Networks (CNNs) and deep features have been leveraged to surpass the capabilities of conventional features. Techniques such as transfer learning and the use of large datasets have allowed for robust model training, leading to advancements in aesthetic quality classification.

Experimental Findings

In the survey, the authors conduct extensive experiments using baseline CNNs to elucidate various aspects of deep learning methodologies in image aesthetic assessment. They delve into training strategies like fine-tuning from pre-trained models, mini-batch formation tactics, and the utilization of auxiliary tasks to enhance learning performance. The results underscore the effectiveness of well-considered fine-tuning processes and multi-task learning strategies in boosting model robustness and accuracy.

Furthermore, the paper reveals that aesthetic classification using CNNs not only achieves competitive results with existing state-of-the-art methods but also offers insights into category-specific learning complexities. The researchers provide a layer-wise analysis to explore how different features contribute to aesthetic assessment across various image genres.

Image Aesthetic Manipulation

The survey also touches upon image aesthetic manipulation, specifically focusing on automatic image cropping as a practical application. The authors demonstrate the potential of CNNs trained for aesthetic assessment in enhancing image quality through aesthetically driven cropping, using convolutional response maps. This contribution showcases the broader applicability of aesthetic analysis beyond mere classification.

Implications and Future Directions

The research indicates that while significant strides have been made, challenges remain in achieving universal aesthetic judgment due to subjective nature and data limitations. The authors advocate for more varied and extensive datasets and suggest exploring user-specific models to accommodate individual aesthetic preferences. Future research could benefit from investigating advanced architectures and deeper models to further refine aesthetic understanding and assessment.

In conclusion, the survey offers a thorough account of the state-of-the-art in image aesthetic assessment, synthesizing past work and current trends, while providing practical insights and directions for future exploration. The integration of traditional feature engineering with cutting-edge deep learning techniques marks a promising pathway toward more refined aesthetic evaluation systems.

PDF Markdown