Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
175 tokens/sec
GPT-4o
8 tokens/sec
Gemini 2.5 Pro Pro
47 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Seeing the Intangible: Survey of Image Classification into High-Level and Abstract Categories (2308.10562v2)

Published 21 Aug 2023 in cs.CV and cs.CY

Abstract: The field of Computer Vision (CV) is increasingly shifting towards ``high-level'' visual sensemaking tasks, yet the exact nature of these tasks remains unclear and tacit. This survey paper addresses this ambiguity by systematically reviewing research on high-level visual understanding, focusing particularly on Abstract Concepts (ACs) in automatic image classification. Our survey contributes in three main ways: Firstly, it clarifies the tacit understanding of high-level semantics in CV through a multidisciplinary analysis, and categorization into distinct clusters, including commonsense, emotional, aesthetic, and inductive interpretative semantics. Secondly, it identifies and categorizes computer vision tasks associated with high-level visual sensemaking, offering insights into the diverse research areas within this domain. Lastly, it examines how abstract concepts such as values and ideologies are handled in CV, revealing challenges and opportunities in AC-based image classification. Notably, our survey of AC image classification tasks highlights persistent challenges, such as the limited efficacy of massive datasets and the importance of integrating supplementary information and mid-level features. We emphasize the growing relevance of hybrid AI systems in addressing the multifaceted nature of AC image classification tasks. Overall, this survey enhances our understanding of high-level visual reasoning in CV and lays the groundwork for future research endeavors.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (119)
  1. A Methodology for Semantic Enrichment of Cultural Heritage Images Using Artificial Intelligence Technologies. Journal of Imaging 7, 8 (2021), 121. https://doi.org/10.3390/jimaging7080121
  2. ArtEmis: Affective Language for Visual Art. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (Nashville, TN, USA). Computer Vision Foundation / IEEE, 11569–11579. https://doi.org/10.1109/cvpr46437.2021.01140
  3. Integrating knowledge and reasoning in image understanding. In 28th International Joint Conference on Artificial Intelligence, IJCAI 2019. International Joint Conferences on Artificial Intelligence, 6252–6259.
  4. Youssef Ahres and Nikolaus Volk. 2016. Abstract Concept and Emotion Detection in Tagged Images with CNNs. Unpublished Report, accessed from http://cs231n. stanford. edu/reports/2016/pdfs/008_ Report. pdf (2016), 8.
  5. Taylor Arnold and Lauren Tilton. 2019. Distant viewing: analyzing large visual corpora. Digital Scholarship in the Humanities 34, Supplement_1 (2019), i3–i16. https://doi.org/10.1093/llc/fqz013
  6. Multimodal Analysis of Cohesion in Multi-party Interactions. In Lrec. Marseille, France, 498–507.
  7. Deep learning architectures for computer vision applications: a study. In Advances in data and information sciences. Springer, 601–612.
  8. Lawrence W Barsalou. 2003. Abstraction in perceptual symbol systems. Philosophical Trans. of the Royal Society B: Biological Sciences 358, 1435 (2003), 1177–1187.
  9. Words as social tools: Language, sociality and inner grounding in abstract concepts. Physics of Life Reviews 29 (2019), 120–153. https://doi.org/10.1016/j.plrev.2018.12.001
  10. Varieties of abstract concepts: development, use and representation in the brain. Philosophical Transactions of the Royal Society B: Biological Sciences 373, 1752 (2018), 20170121. https://doi.org/10.1098/rstb.2017.0121
  11. Anna M. Borghi and Ferdinand Binkofski. 2014. Words as social tools: An embodied view on abstract concepts. Vol. 2. Springer.
  12. Event Recognition in Photo Collections with a Stopwatch HMM. In 2013 IEEE International Conference on Computer Vision. Ieee, Sydney, Australia, 1193–1200. https://doi.org/10.1109/iccv.2013.151
  13. Identifying liars through automatic decoding of children’s facial expressions. Child development 91, 4 (2020), e995–e1011.
  14. Concreteness ratings for 40 thousand generally known English word lemmas. Behavior research methods 46 (2014), 904–911.
  15. Fatality killed the cat or: BabelPic, a multimodal dataset for non-concrete concepts. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online, 4680–4686. https://doi.org/10.18653/v1/2020.acl-main.425
  16. Emotional modelling and classification of a large-scale collection of scene images in a cluster environment. Plos One 13, 1 (2018), e0191064. https://doi.org/10.1371/journal.pone.0191064
  17. A Deep Learning Perspective on Beauty, Sentiment, and Remembrance of Art. IEEE Access 7 (2019), 73694–73710. https://doi.org/10.1109/access.2019.2921101
  18. We are Humor Beings: Understanding and Predicting Visual Humor. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Ieee, Las Vegas, NV, USA, 4603–4612. https://doi.org/10.1109/cvpr.2016.498
  19. Multi-task Recurrent Neural Network for Immediacy Prediction. In 2015 IEEE International Conference on Computer Vision (ICCV). Ieee, Santiago, Chile, 3352–3360. https://doi.org/10.1109/iccv.2015.383
  20. NUS-WIDE: a real-world web image database from National University of Singapore. In Proceedings of the ACM International Conference on Image and Video Retrieval (Civr ’09). Association for Computing Machinery, New York, NY, USA, 1–9. https://doi.org/10.1145/1646396.1646452
  21. Learning to Act Properly: Predicting and Explaining Affordances from Images. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Ieee, Salt Lake City, UT, 975–983. https://doi.org/10.1109/cvpr.2018.00108
  22. Ensemble Learning on Visual and Textual Data for Social Image Emotion Classification. International Journal of Machine Learning and Cybernetics 10, 8 (2019), 2057–2070. https://doi.org/10.1007/s13042-017-0734-0
  23. Unveiling the multimedia unconscious: Implicit cognitive processes and multimedia content analysis. In Proceedings of the 21st ACM international conference on Multimedia. 213–222.
  24. Recognition of Affective and Grammatical Facial Expressions: A Study for Brazilian Sign Language. In Computer Vision – ECCV 2020 Workshops (Lecture Notes in Computer Science), Adrien Bartoli and Andrea Fusiello (Eds.). Springer International Publishing, Cham, 218–236. https://doi.org/10.1007/978-3-030-66096-3_16
  25. Studying Aesthetics in Photographic Images Using a Computational Approach. In Computer Vision – ECCV 2006 (Lecture Notes in Computer Science), Aleš Leonardis, Horst Bischof, and Axel Pinz (Eds.). Springer, Berlin, Heidelberg, 288–301. https://doi.org/10.1007/11744078_23
  26. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition. Ieee, 248–255.
  27. Are You Really Smiling at Me? Spontaneous versus Posed Enjoyment Smiles. In Computer Vision – ECCV 2012 (Lecture Notes in Computer Science), Andrew Fitzgibbon, Svetlana Lazebnik, Pietro Perona, Yoichi Sato, and Cordelia Schmid (Eds.). Springer, Berlin, Heidelberg, 525–538. https://doi.org/10.1007/978-3-642-33712-3_38
  28. John P Eakins. 2000. Retrieval of still images by content. In European Summer School on Information Retrieval. Springer, 111–138.
  29. Jim Edwards. 2014. We are Now Posting a Staggering 1.8 Billion Photos to Social Media Every Day.
  30. Peter Enser and Peter Enser. 1999. Visual image retrieval: seeking the alliance of concept-based and content-based paradigms.
  31. ChaLearn Looking at People 2015: Apparent Age and Cultural Event Recognition Datasets and Results. In 2015 IEEE International Conference on Computer Vision Workshop (ICCVW). Ieee, Santiago, Chile, 243–251. https://doi.org/10.1109/iccvw.2015.40
  32. Andrew C. Gallagher and Tsuhan Chen. 2009. Understanding Images of Groups of People. In 2009 IEEE Conference on Computer Vision and Pattern Recognition. Ieee, Miami, FL, 256–263. https://doi.org/10.1109/cvpr.2009.5206828
  33. Noa Garcia and George Vogiatzis. 2018. How to Read Paintings: Semantic Art Understanding with Multi-Modal Retrieval. 0–0.
  34. Cross-lingual Visual Verb Sense Disambiguation. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 1998–2004.
  35. Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings. In Proceedings of NAACL-HLT. 182–192.
  36. Shreya Ghosh and Abhinav Dhall. 2019. Role of Group Level Affect to Find the Most Influential Person in Images. In Computer Vision – ECCV 2018 Workshops (Lecture Notes in Computer Science), Laura Leal-Taixé and Stefan Roth (Eds.). Springer International Publishing, Cham, 518–533. https://doi.org/10.1007/978-3-030-11012-3_39
  37. An End-To-End Network for Generating Social Relationship Graphs. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Ieee, Long Beach, CA, USA, 11178–11187. https://doi.org/10.1109/cvpr.2019.01144
  38. Predicting Facial Beauty without Landmarks. In Computer Vision – ECCV 2010 (Lecture Notes in Computer Science), Kostas Daniilidis, Petros Maragos, and Nikos Paragios (Eds.). Springer, Berlin, Heidelberg, 434–447. https://doi.org/10.1007/978-3-642-15567-3_32
  39. Howard Greisdorf and Brian O’Connor. 2002. Modelling what users see when they look at images: a cognitive viewpoint. Journal of Documentation 58, 1 (2002), 6–29. https://doi.org/10.1108/00220410210425386
  40. Detecting Persuasive Atypicality by Modeling Contextual Compatibility. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV). Ieee, Montreal, QC, Canada, 952–962. https://doi.org/10.1109/iccv48922.2021.00101
  41. Jointly Embedding Knowledge Graphs and Logical Rules. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Austin, Texas, 192–202. https://doi.org/10.18653/v1/D16-1019
  42. Mind the gap: another look at the problem of the semantic gap in image retrieval. In Multimedia Content Analysis, Management, and Retrieval 2006 (San Jose, CA), Edward Y Chang, Alan Hanjalic, and Nicu Sebe (Eds.), Vol. 6073. International Society for Optics and Photonics, Spie, 607309. https://doi.org/10.1117/12.647755
  43. The Semantic Content of Abstract Concepts: A Property Listing Study of 296 Abstract Words. Frontiers in Psychology 9 (2018), 1748. https://doi.org/10.3389/fpsyg.2018.01748
  44. Mask r-cnn. In Proceedings of the IEEE international conference on computer vision. 2961–2969.
  45. Wei-Lin Hsiao and Kristen Grauman. 2017. Learning the Latent “Look”: Unsupervised Discovery of a Style-Coherent Embedding from Fashion Images. In 2017 IEEE International Conference on Computer Vision (ICCV). Ieee, Venice, 4213–4222. https://doi.org/10.1109/iccv.2017.451
  46. X. Huang and A. Kovashka. 2016. Inferring Visual Persuasion via Body Language, Setting, and Deep Features. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. 778–784. https://doi.org/10.1109/cvprw.2016.102
  47. Hayley Hung and Daniel Gatica-Perez. 2010. Estimating cohesion in small groups using audio-visual nonverbal behavior. IEEE Transactions on Multimedia 12, 6 (2010), 563–575.
  48. Automatic Understanding of Image and Video Advertisements. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1705–1715.
  49. Intentonomy: a Dataset and Study towards Human Intent Understanding. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Ieee, Nashville, TN, USA, 12981–12991. https://doi.org/10.1109/cvpr46437.2021.01279
  50. Visual Persuasion: Inferring Communicative Intents of Images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 216–223.
  51. Automated Facial Trait Judgment and Election Outcome Prediction: Social Dimensions of Face. In 2015 IEEE International Conference on Computer Vision (ICCV). Ieee, Santiago, Chile, 3712–3720. https://doi.org/10.1109/iccv.2015.423
  52. Corinne Jörgensen. 2003. Image Retrieval: Theory and Research. Scarecrow Press.
  53. Nasrin Kalanat and Adriana Kovashka. 2022. Symbolic image detection using scene and knowledge graphs. arXiv preprint arXiv:2206.04863 (2022).
  54. Alex Kendall and Roberto Cipolla. 2017. Geometric loss functions for camera pose regression with deep learning. In Proceedings of the IEEE conference on computer vision and pattern recognition. 5974–5983.
  55. Looking Beyond the Visible Scene. In 2014 IEEE Conference on Computer Vision and Pattern Recognition. Ieee, Columbus, OH, USA, 3710–3717. https://doi.org/10.1109/cvpr.2014.474
  56. Hipster Wars: Discovering Elements of Fashion Styles. In Computer Vision – ECCV 2014 (Lecture Notes in Computer Science), David Fleet, Tomas Pajdla, Bernt Schiele, and Tinne Tuytelaars (Eds.). Springer International Publishing, Cham, 472–488. https://doi.org/10.1007/978-3-319-10590-1_31
  57. Douwe Kiela and Léon Bottou. 2014. Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Doha, Qatar, 36–45. https://doi.org/10.3115/v1/D14-1005
  58. Sewa db: A rich database for audio-visual emotion and sentiment research in the wild. IEEE transactions on pattern analysis and machine intelligence 43, 3 (2019), 1022–1040.
  59. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 25 (2012).
  60. The Open Images Dataset V4. International Journal of Computer Vision 128, 7 (2020), 1956–1981. https://doi.org/10.1007/s11263-020-01316-z
  61. Angeliki Lazaridou et al. 2015. Combining Language and Vision with a Multimodal Skip-gram Model. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics.
  62. Entangled Transformer for Image Captioning. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV). Ieee, Seoul, Korea (South), 8927–8936. https://doi.org/10.1109/iccv.2019.00902
  63. Dual-Glance Model for Deciphering Social Relationships. In 2017 IEEE International Conference on Computer Vision (ICCV). Ieee, Venice, 2669–2678. https://doi.org/10.1109/iccv.2017.289
  64. Visual Social Relationship Recognition. International Journal of Computer Vision 128, 6 (2020), 1750–1764. https://doi.org/10.1007/s11263-020-01295-1
  65. Situation Recognition with Graph Neural Networks. In 2017 IEEE International Conference on Computer Vision (ICCV). Ieee, Venice, 4183–4192. https://doi.org/10.1109/iccv.2017.448
  66. Graph-Based Social Relation Reasoning. In Computer Vision – ECCV 2020 (Lecture Notes in Computer Science), Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer International Publishing, Cham, 18–34. https://doi.org/10.1007/978-3-030-58555-6_2
  67. Exploiting Feature Hierarchies with Convolutional Neural Networks for Cultural Event Recognition. In 2015 IEEE International Conference on Computer Vision Workshop (ICCVW). Ieee, Santiago, Chile, 274–279. https://doi.org/10.1109/iccvw.2015.44
  68. Attend and imagine: Multi-label image classification with visual attention and recurrent neural networks. IEEE Transactions on Multimedia 21, 8 (2019), 1971–1981.
  69. The Role of Facial Regions in Evaluating Social Dimensions. In Computer Vision – ECCV 2012. Workshops and Demonstrations (Lecture Notes in Computer Science), Andrea Fusiello, Vittorio Murino, and Rita Cucchiara (Eds.). Springer, Berlin, Heidelberg, 210–219. https://doi.org/10.1007/978-3-642-33868-7_21
  70. Saif Mohammad and Svetlana Kiritchenko. 2018a. WikiArt Emotions: An Annotated Dataset of Emotions Evoked by Art. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). European Language Resources Association (ELRA), Miyazaki, Japan.
  71. Saif M. Mohammad and Svetlana Kiritchenko. 2018b. WikiArt Emotions: An Annotated Dataset of Emotions Evoked by Art. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018, Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Kôiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, Stelios Piperidis, and Takenobu Tokunaga (Eds.). European Language Resources Association (ELRA).
  72. Affectnet: A database for facial expression, valence, and arousal computing in the wild. IEEE Transactions on Affective Computing 10, 1 (2017), 18–31.
  73. Survey on Visual Sentiment Analysis. IET Image Processing 14, 8 (2020), 1440–1456. https://doi.org/10.1049/iet-ipr.2019.1270
  74. Erwin Panofsky and Benjamin Drechsel. 1955. Meaning in the visual arts. University of Chicago Press Chicago.
  75. Grounded situation recognition. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part IV 16 (Lecture Notes in Computer Science), Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer, Springer International Publishing, Cham, 314–332. https://doi.org/10.1007/978-3-030-58548-8_19
  76. Rahul Raguram and Svetlana Lazebnik. 2008. Computing iconic summaries of general visual concepts. In 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. Ieee, Anchorage, AK, USA, 1–8. https://doi.org/10.1109/cvprw.2008.4562959
  77. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 779–788.
  78. Joseph Redmon and Ali Farhadi. 2017. YOLO9000: better, faster, stronger. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7263–7271.
  79. DLDR: Deep Linear Discriminative Retrieval for Cultural Event Classification from a Single Image. In 2015 IEEE International Conference on Computer Vision Workshop (ICCVW). Ieee, Santiago, Chile, 295–302. https://doi.org/10.1109/iccvw.2015.47
  80. Tracking historical changes in trustworthiness using machine learning analyses of facial cues in paintings. Nature Communications 11, 1 (2020), 4728. https://doi.org/10.1038/s41467-020-18566-7
  81. Social Profiling through Image Understanding: Personality Inference Using Convolutional Neural Networks. Computer Vision and Image Understanding 156 (2017), 34–50. https://doi.org/10.1016/j.cviu.2016.10.013
  82. What Do You Do? Occupation Recognition in a Photo via Social Context. In 2013 IEEE International Conference on Computer Vision. Ieee, Sydney, Australia, 3631–3638. https://doi.org/10.1109/iccv.2013.451
  83. SemEval-2020 Task 8: Memotion Analysis-the Visuo-Lingual Metaphor!. In Proceedings of the Fourteenth Workshop on Semantic Evaluation. 759–773.
  84. Sara Shatford. 1986. Analyzing the Subject of a Picture: A Theoretical Approach. (1986). https://doi.org/10.1300/j104v06n03_04
  85. Content-based image retrieval at the end of the early years. IEEE Transactions on pattern analysis and machine intelligence 22, 12 (2000), 1349–1380. https://doi.org/10.1109/34.895972
  86. From Groups to Leaders and Back. In Group and Crowd Behavior for Computer Vision. Elsevier, 161–182. https://doi.org/10.1016/b978-0-12-809276-7.00010-2
  87. Sebastian Stabinger and Antonio Rodriguez-Sanchez. 2017. Evaluation of deep learning on an abstract image classification dataset. In Proceedings of the IEEE International Conference on Computer Vision Workshops. 2767–2772.
  88. Artpedia: A new visual-semantic dataset with visual and contextual sentences in the artistic domain. In Image Analysis and Processing–ICIAP 2019: 20th International Conference, Trento, Italy, September 9–13, 2019, Proceedings, Part II 20. Springer, 729–740.
  89. Mohammed Suhail and Leonid Sigal. 2019. Mixture-Kernel Graph Attention Network for Situation Recognition. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV). Ieee, Seoul, Korea (South), 10362–10371. https://doi.org/10.1109/iccv.2019.01046
  90. Revisiting Unreasonable Effectiveness of Data in Deep Learning Era. In 2017 IEEE International Conference on Computer Vision (ICCV). Ieee, Venice, 843–852. https://doi.org/10.1109/iccv.2017.97
  91. A Domain Based Approach to Social Relation Recognition. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Ieee, Honolulu, HI, 3481–3490. https://doi.org/10.1109/cvpr.2017.54
  92. Christopher Thomas and Adriana Kovashka. 2019. Predicting the Politics of an Image Using Webly Supervised Data. In Advances in neural information processing systems, Vol. 32. Curran Associates, Inc. https://doi.org/10.48550/arxiv.1911.00147
  93. Christopher Thomas and Adriana Kovashka. 2020. Preserving Semantic Neighborhoods for Robust Cross-Modal Retrieval. In Computer Vision – ECCV 2020 (Lecture Notes in Computer Science), Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer International Publishing, Cham, 317–335. https://doi.org/10.1007/978-3-030-58523-5_19
  94. Christopher Thomas and Adriana Kovashka. 2021. Predicting Visual Political Bias Using Webly Supervised Data and an Auxiliary Task. International Journal of Computer Vision 129, 11 (2021), 2978–3003. https://doi.org/10.1007/s11263-021-01506-3
  95. Estimation of Continuous Valence and Arousal Levels from Faces in Naturalistic Conditions. Nature Machine Intelligence 3, 1 (2021), 42–50. https://doi.org/10.1038/s42256-020-00280-0
  96. Cross-Media Learning for Image Sentiment Analysis in the Wild. In 2017 IEEE International Conference on Computer Vision Workshops (ICCVW). Ieee, Venice, 308–317. https://doi.org/10.1109/iccvw.2017.45
  97. Modular design patterns for hybrid learning and reasoning systems: a taxonomy, patterns and use cases. Applied Intelligence 51, 9 (2021), 6528–6546.
  98. Computer Vision and Human Behaviour, Emotion and Cognition Detection: A Use Case on Student Engagement. Mathematics 9, 3 (2021), 287. https://doi.org/10.3390/math9030287
  99. Elizabeth B. Varghese and Sabu M. Thampi. 2018. A Deep Learning Approach to Predict Crowd Behavior Based on Emotion. In Smart Multimedia (Lecture Notes in Computer Science), Anup Basu and Stefano Berretti (Eds.). Springer International Publishing, Cham, 296–307. https://doi.org/10.1007/978-3-030-04375-9_25
  100. Automatic emotion recognition for groups: a review. IEEE Transactions on Affective Computing (2021), 1–1. https://doi.org/10.1109/taffc.2021.3065726
  101. Varieties of abstract concepts and their multiple dimensions. Language and Cognition 11, 3 (2019), 403–430. https://doi.org/10.1017/langcog.2019.23
  102. Seeing People in Social Context: Recognizing People and Social Relationships. In Computer Vision – ECCV 2010 (Lecture Notes in Computer Science), Kostas Daniilidis, Petros Maragos, and Nikos Paragios (Eds.). Springer, Berlin, Heidelberg, 169–182. https://doi.org/10.1007/978-3-642-15555-0_13
  103. Better Exploiting OS-CNNs for Better Event Recognition in Images. In 2015 IEEE International Conference on Computer Vision Workshop (ICCVW). Ieee, Santiago, Chile, 287–294. https://doi.org/10.1109/iccvw.2015.46
  104. Weining Wang and Qianhua He. 2008. A survey on emotional semantic image retrieval. In 2008 15th IEEE International Conference on Image Processing. 117–120. https://doi.org/10.1109/icip.2008.4711705
  105. Deep Spatial Pyramid Ensemble for Cultural Event Recognition. In 2015 IEEE International Conference on Computer Vision Workshop (ICCVW). Ieee, Santiago, Chile, 280–286. https://doi.org/10.1109/iccvw.2015.45
  106. Understanding and Mapping Natural Beauty. In 2017 IEEE International Conference on Computer Vision (ICCV). Ieee, Venice, 5590–5599. https://doi.org/10.1109/iccv.2017.596
  107. Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning. IEEE Access 7 (2019), 172683–172693. https://doi.org/10.1109/access.2019.2956775
  108. Attention-Aware Polarity Sensitive Embedding for Affective Image Retrieval. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV). Ieee, Seoul, Korea (South), 1140–1150. https://doi.org/10.1109/iccv.2019.00123
  109. Situation Recognition: Visual Semantic Role Labeling for Image Understanding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Ieee, Las Vegas, NV, USA, 5534–5542. https://doi.org/10.1109/cvpr.2016.597
  110. K. Ye and A. Kovashka. 2018. ADVISE: Symbolism and External Knowledge for Decoding Advertisements. In Computer Vision – ECCV 2018, Vittorio Ferrari, Martial Hebert, Cristian Sminchisescu, and Yair Weiss (Eds.). Vol. 11219 Lncs. Springer International Publishing, Cham, 868–886. https://doi.org/10.1007/978-3-030-01267-0_51
  111. Interpreting the Rhetoric of Visual Advertisements. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 4 (2019), 1308–1323. https://doi.org/10.1109/tpami.2019.2947440
  112. Eiling Yee. 2019. Abstraction and concepts: when, how, where, what and why? Language, Cognition and Neuroscience 34, 10 (2019), 1257–1265. https://doi.org/10.1080/23273798.2019.1660797
  113. Recognize Complex Events from Static Images by Fusing Deep Channels. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Ieee, Boston, MA, USA, 1600–1609. https://doi.org/10.1109/cvpr.2015.7298768
  114. Automated decoding of facial expressions reveals marked differences in children when telling antisocial versus prosocial lies. Journal of Experimental Child Psychology 150 (2016), 165–179. https://doi.org/10.1016/j.jecp.2016.05.007
  115. A review on automatic image annotation techniques. Pattern Recognition 45, 1 (2012), 346–362. https://doi.org/10.1016/j.patcog.2011.05.013
  116. Learning Social Relation Traits from Face Images. In 2015 IEEE International Conference on Computer Vision (ICCV). Ieee, Santiago, Chile, 3631–3639. https://doi.org/10.1109/iccv.2015.414
  117. From Facial Expression Recognition to Interpersonal Relation Prediction. International Journal of Computer Vision 126, 5 (2018), 550–569. https://doi.org/10.1007/s11263-017-1055-1
  118. Affective Image Content Analysis: A Comprehensive Survey. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence. International Joint Conferences on Artificial Intelligence Organization, Stockholm, Sweden, 5534–5541. https://doi.org/10.24963/ijcai.2018/780
  119. Computational emotion analysis from images: Recent advances and future directions. Human Perception of Visual Information: Psychological and Computational Perspectives (2022), 85–113.
Citations (1)

Summary

We haven't generated a summary for this paper yet.