Papers
Topics
Authors
Recent
Detailed Answer
Quick Answer
Concise responses based on abstracts only
Detailed Answer
Well-researched responses based on abstracts and relevant paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses
Gemini 2.5 Flash
Gemini 2.5 Flash 41 tok/s
Gemini 2.5 Pro 46 tok/s Pro
GPT-5 Medium 21 tok/s Pro
GPT-5 High 20 tok/s Pro
GPT-4o 91 tok/s Pro
Kimi K2 178 tok/s Pro
GPT OSS 120B 474 tok/s Pro
Claude Sonnet 4 38 tok/s Pro
2000 character limit reached

Towards Explainable Strategy Templates using NLP Transformers (2311.14061v1)

Published 23 Nov 2023 in cs.AI

Abstract: This paper bridges the gap between mathematical heuristic strategies learned from Deep Reinforcement Learning (DRL) in automated agent negotiation, and comprehensible, natural language explanations. Our aim is to make these strategies more accessible to non-experts. By leveraging traditional NLP techniques and LLMs equipped with Transformers, we outline how parts of DRL strategies composed of parts within strategy templates can be transformed into user-friendly, human-like English narratives. To achieve this, we present a top-level algorithm that involves parsing mathematical expressions of strategy templates, semantically interpreting variables and structures, generating rule-based primary explanations, and utilizing a Generative Pre-trained Transformer (GPT) model to refine and contextualize these explanations. Subsequent customization for varied audiences and meticulous validation processes in an example illustrate the applicability and potential of this approach.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (23)
  1. Concurrent Bilateral Negotiation for Open E-Markets: The CONAN Strategy. Knowledge Information Systems 56, 2 (2018), 463–501.
  2. Decoupling negotiating agents to explore the space of negotiation strategies. In Novel Insights in Agent-based Complex Automated Negotiation. Springer, 61–83.
  3. Pallavi Bagga. 2021. Agent Learning for Automated Bilateral Negotiations. Ph. D. Dissertation. Royal Holloway, University of London.
  4. ANEGMA: an automated negotiation model for e-markets. Journal of Autonomous Agents and Multi-Agent Systems 35 (2021).
  5. Learnable strategies for bilateral agent negotiation over multiple issues. arXiv preprint arXiv:2009.08302 (2020).
  6. Pareto Bid Estimation for Multi-Issue Bilateral Negotiation under User Preference Uncertainty. In 2021 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). IEEE, 1–6.
  7. Deep learnable strategy templates for multi-issue bilateral negotiation. In Proc. of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022), P. Faliszewski, V. Mascardi, C. Pelachaud, and M.E. Taylor (Eds.).
  8. Language models are few-shot learners. arXiv preprint arXiv:2005.14165 (2020).
  9. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE transactions on evolutionary computation 6, 2 (2002), 182–197.
  10. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv preprint arXiv:1810.04805 (2018).
  11. Optimal negotiation strategies for agents with incomplete information. In ATAL’01. Springer, 377–392.
  12. A comparative study of game theoretic and evolutionary models of bargaining for software agents. Artificial Intelligence Review 23, 2 (2005), 187–205.
  13. Matthew Honnibal and Ines Montani. 2015. spaCy: Industrial-strength Natural Language Processing in Python. https://spacy.io.
  14. Ching-Lai Hwang and Kwangsun Yoon. 1981. Methods for multiple attribute decision making. In Multiple attribute decision making. Springer, 58–191.
  15. FinRL: Deep reinforcement learning framework to automate trading in quantitative finance. In Proceedings of the second ACM international conference on AI in finance. 1–9.
  16. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019).
  17. SymPy: symbolic computing in Python. PeerJ Computer Science 3 (2017), e103. https://www.sympy.org
  18. NASA Software. 1985. C Language Integrated Production System (CLIPS). NASA Lyndon B. Johnson Space Center, Houston, Texas. https://www.clipsrules.net/ Version 6.31.
  19. OpenAI. 2023. GPT-4: Technical Report. arXiv preprint arXiv:4812508 (2023). https://cdn.openai.com/papers/gpt-4.pdf
  20. Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research 21, 1 (2020), 5485–5551.
  21. Ariel Rubinstein. 1982. Perfect equilibrium in a bargaining model. Econometrica: Journal of the Econometric Society (1982), 97–109.
  22. Automating supply chain negotiations using autonomous agents: a case study in transportation logistics. In Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems. 1506–1513.
  23. Xlnet: Generalized autoregressive pretraining for language understanding. Advances in neural information processing systems 32 (2019).
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-Up Questions

We haven't generated follow-up questions for this paper yet.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube