Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

139 tokens/sec

GPT-4o

47 tokens/sec

Gemini 2.5 Pro Pro

43 tokens/sec

o3 Pro

4 tokens/sec

GPT-4.1 Pro

47 tokens/sec

DeepSeek R1 via Azure Pro

28 tokens/sec

2000 character limit reached

8 3

OMNI: Open-endedness via Models of human Notions of Interestingness (2306.01711v3)

Published 2 Jun 2023 in cs.AI and cs.LG

Abstract: Open-ended algorithms aim to learn new, interesting behaviors forever. That requires a vast environment search space, but there are thus infinitely many possible tasks. Even after filtering for tasks the current agent can learn (i.e., learning progress), countless learnable yet uninteresting tasks remain (e.g., minor variations of previously learned tasks). An Achilles Heel of open-endedness research is the inability to quantify (and thus prioritize) tasks that are not just learnable, but also $\textit{interesting}$ (e.g., worthwhile and novel). We propose solving this problem by $\textit{Open-endedness via Models of human Notions of Interestingness}$ (OMNI). The insight is that we can utilize foundation models (FMs) as a model of interestingness (MoI), because they $\textit{already}$ internalize human concepts of interestingness from training on vast amounts of human-generated data, where humans naturally write about what they find interesting or boring. We show that FM-based MoIs improve open-ended learning by focusing on tasks that are both learnable $\textit{and interesting}$, outperforming baselines based on uniform task sampling or learning progress alone. This approach has the potential to dramatically advance the ability to intelligently select which tasks to focus on next (i.e., auto-curricula), and could be seen as AI selecting its own next task to learn, facilitating self-improving AI and AI-Generating Algorithms. Project website at https://www.jennyzhangzt.com/omni/

References (82)

Citations (19)

View on Semantic Scholar

Summary

The paper presents OMNI as a framework that leverages foundation models to assess and prioritize tasks using human notions of interestingness.
It combines a Learning Progress curriculum with a Model of Interestingness to focus on tasks that are both learnable and engaging.
Experiments across diverse environments demonstrate OMNI’s superior performance and its alignment with oracle-level task selection.

Overview of "OMNI: Open-endedness via Models of human Notions of Interestingness"

The paper "OMNI: Open-endedness via Models of human Notions of Interestingness" introduces a novel approach to address a key challenge in the domain of open-ended learning algorithms—specifically, the need to quantify and prioritize tasks based on their learnability and interestingness. The inability to effectively target tasks that are not just learnable but also intrinsically interesting has been a significant hurdle in advancing open-ended learning systems.

Introduction to Open-Endedness via OMNI

Open-ended algorithms aim to continuously explore and learn novel behaviors. This requires navigating vast task spaces, which inherently contain an infinite number of possible tasks, posing a challenge known as the "Achilles Heel" of open-endedness. While previous approaches have focused on learning progress, they often succumb to exploring trivial or repetitive tasks, leading to inefficiencies.

The approach proposed in this paper, termed "Open-endedness via Models of human Notions of Interestingness" (OMNI), leverages foundation models (FMs) trained on large datasets of human-generated content. These models inherently understand human concepts of interestingness by virtue of their training. The authors advocate using these models to guide the selection of tasks based on learnability and interestingness, potentially advancing the self-improvement capabilities of AI through auto-curricula.

Methodology and Implementation

The OMNI methodology involves two primary elements: the Learning Progress (LP) curriculum and the Model of Interestingness (MoI) devised using foundation models.

Learning Progress Curriculum: This curriculum biases task selection towards tasks at the frontier of the agent’s capabilities by measuring bidirectional learning progress. This involves normalizing current task success rates and tracking changes over time, thereby focusing on tasks that exhibit the most meaningful progress for the agent’s learning trajectory.
Model of Interestingness (MoI): The core innovation lies in using foundation models to predict which tasks are interesting—defined as novel and worthwhile in human terms. By consulting these models, OMNI focuses on tasks with high learning progress that are also interesting, effectively filtering out uninteresting, redundant challenges.

Experiments and Results

Experiments were conducted across various environments, including Crafter and BabyAI, demonstrating OMNI's capability to outperform baseline methods both in terms of average task success rates and the number of tasks learned. Interestingly, OMNI closely aligns its task performance to an oracle that perfectly discerns task interestingness, validating the approach's effectiveness.

Further, the paper extends these experiments to infinite task spaces, exemplified by the AI2-THOR domain, where it shows that OMNI not only generates learnable and interesting tasks but also facilitates the definition of reward functions through FM-generated code, marking a novel step in open-ended environments.

Implications and Future Prospects

The implications of OMNI are significant both practically and theoretically. Practically, this method provides a new pathway for developing AI systems capable of identifying and pursuing meaningful tasks without human intervention, potentially steering AI towards self-improving mechanisms. Theoretically, OMNI offers an innovative perspective on modeling human-like curiosity and novel task exploration within AI, which may lead to broader applications beyond current open-ended setups.

The paper suggests future directions, such as incorporating multi-modal models for richer task representations and exploring human feedback systems akin to Reinforcement Learning with Human Feedback (RLHF) to further refine the Model of Interestingness. Such developments could be pivotal in addressing the safety challenges inherent to open-ended systems by aligning them closer to human values and ensuring beneficial AI trajectories.

Conclusion

Overall, this paper makes a significant contribution to the field of open-ended learning by leveraging foundation models to emulate human notions of interestingness in task selection. By addressing the challenges of infinite task spaces and intelligently prioritizing tasks, OMNI stands as a promising framework that enhances the capacity of AI systems to engage in perpetual, intriguing, and relevant learning. This represents a step towards not only more effective AI-generating algorithms but also towards AI systems that can autonomously drive meaningful innovation and discovery.

PDF Markdown

Tweets

https://twitter.com/ceobillionaire/status/1759221023088501045

https://twitter.com/Montreal_AI/status/1759215959129166158

https://twitter.com/Quebec_AI/status/1759221671691456952

https://twitter.com/jayden_teoh_/status/1896949472166596823

https://twitter.com/OptimusPri97731/status/1797680045307420880

YouTube

Show All Videos