2000 character limit reached
Toward a Human-Level Video Understanding Intelligence (2110.04203v2)
Published 8 Oct 2021 in cs.AI, cs.CV, and cs.HC
Abstract: We aim to develop an AI agent that can watch video clips and have a conversation with human about the video story. Developing video understanding intelligence is a significantly challenging task, and evaluation methods for adequately measuring and analyzing the progress of AI agent are lacking as well. In this paper, we propose the Video Turing Test to provide effective and practical assessments of video understanding intelligence as well as human-likeness evaluation of AI agents. We define a general format and procedure of the Video Turing Test and present a case study to confirm the effectiveness and usefulness of the proposed test.
- Yu-Jung Heo (14 papers)
- Minsu Lee (13 papers)
- Seongho Choi (9 papers)
- Woo Suk Choi (3 papers)
- Minjung Shin (9 papers)
- Minjoon Jung (6 papers)
- Jeh-Kwang Ryu (6 papers)
- Byoung-Tak Zhang (83 papers)