Meta Adaptation using Importance Weighted Demonstrations (1911.10322v2)
Abstract: Imitation learning has gained immense popularity because of its high sample-efficiency. However, in real-world scenarios, where the trajectory distribution of most of the tasks dynamically shifts, model fitting on continuously aggregated data alone would be futile. In some cases, the distribution shifts, so much, that it is difficult for an agent to infer the new task. We propose a novel algorithm to generalize on any related task by leveraging prior knowledge on a set of specific tasks, which involves assigning importance weights to each past demonstration. We show experiments where the robot is trained from a diversity of environmental tasks and is also able to adapt to an unseen environment, using few-shot learning. We also developed a prototype robot system to test our approach on the task of visual navigation, and experimental results obtained were able to confirm these suppositions.
- “Global overview of Imitation Learning” In CoRR abs/1801.06503, 2018 arXiv: http://arxiv.org/abs/1801.06503
- “End to End Learning for Self-Driving Cars” In CoRR abs/1604.07316, 2016 arXiv: http://arxiv.org/abs/1604.07316
- Sonia Chernova and Manuela M. Veloso “Confidence-based policy learning from demonstration using Gaussian mixture models” In 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), Honolulu, Hawaii, USA, May 14-18, 2007, 2007, pp. 233 DOI: 10.1145/1329125.1329407
- “End-to-End Driving Via Conditional Imitation Learning” In 2018 IEEE International Conference on Robotics and Automation, ICRA 2018, Brisbane, Australia, May 21-25, 2018, 2018, pp. 1–9 DOI: 10.1109/ICRA.2018.8460487
- “One-Shot Imitation Learning” In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 4-9 December 2017, Long Beach, CA, USA, 2017, pp. 1087–1098 URL: http://papers.nips.cc/paper/6709-one-shot-imitation-learning
- “Model-based imitation learning by probabilistic trajectory matching” In 2013 IEEE International Conference on Robotics and Automation, Karlsruhe, Germany, May 6-10, 2013, 2013, pp. 1922–1927 DOI: 10.1109/ICRA.2013.6630832
- Chelsea Finn, Pieter Abbeel and Sergey Levine “Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks” In Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, NSW, Australia, 6-11 August 2017, 2017, pp. 1126–1135 URL: http://proceedings.mlr.press/v70/finn17a.html
- “One-Shot Visual Imitation Learning via Meta-Learning” In 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, California, USA, November 13-15, 2017, Proceedings, 2017, pp. 357–368 URL: http://proceedings.mlr.press/v78/finn17a.html
- Roy Fox, Ari Pakman and Naftali Tishby “Taming the Noise in Reinforcement Learning via Soft Updates” In Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, UAI 2016, June 25-29, 2016, New York City, NY, USA, 2016 URL: http://auai.org/uai2016/proceedings/papers/219.pdf
- “Reinforcement Learning from Imperfect Demonstrations” In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Workshop Track Proceedings, 2018 URL: https://openreview.net/forum?id=HytbCQG8z
- “Lightweight Learner for Shared Knowledge Lifelong Learning” In CoRR abs/2305.15591, 2023 DOI: 10.48550/arXiv.2305.15591
- “Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning” In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings, 2017 URL: https://openreview.net/forum?id=Hyq4yhile
- “Recurrent World Models Facilitate Policy Evolution” In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, 3-8 December 2018, Montréal, Canada., 2018, pp. 2455–2467 URL: http://papers.nips.cc/paper/7512-recurrent-world-models-facilitate-policy-evolution
- He He, Hal Daumé III and Jason Eisner “Imitation Learning by Coaching” In Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012, Lake Tahoe, Nevada, United States., 2012, pp. 3158–3166 URL: http://papers.nips.cc/paper/4545-imitation-learning-by-coaching
- “Deep Q-learning From Demonstrations” In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI-18), the 30th innovative Applications of Artificial Intelligence (IAAI-18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI-18), New Orleans, Louisiana, USA, February 2-7, 2018, 2018, pp. 3223–3230 URL: https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16976
- “Learning from Demonstrations for Real World Reinforcement Learning” In CoRR abs/1704.03732, 2017 arXiv: http://arxiv.org/abs/1704.03732
- “Evolved Policy Gradients” In CoRR abs/1802.04821, 2018 arXiv: http://arxiv.org/abs/1802.04821
- Bingyi Kang, Zequn Jie and Jiashi Feng “Policy Optimization with Demonstrations” In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018, 2018, pp. 2474–2483 URL: http://proceedings.mlr.press/v80/kang18a.html
- “DART: Noise Injection for Robust Imitation Learning” In 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, California, USA, November 13-15, 2017, Proceedings, 2017, pp. 143–156 URL: http://proceedings.mlr.press/v78/laskey17a.html
- “Hierarchical Imitation and Reinforcement Learning” In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018, 2018, pp. 2923–2932 URL: http://proceedings.mlr.press/v80/le18a.html
- Kiran Kumar Lekkala and Vinay Kumar Mittal “Accurate and augmented navigation for quadcopter based on multi-sensor fusion” In 2016 IEEE Annual India Conference (INDICON), 2016, pp. 1–6 IEEE
- Kiran Kumar Lekkala and Vinay Kumar Mittal “Artificial intelligence for precision movement robot” In 2015 2nd International Conference on Signal Processing and Integrated Networks (SPIN), 2015, pp. 378–383 IEEE
- Kiran Kumar Lekkala and Vinay Kumar Mittal “PID controlled 2D precision robot” In 2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT), 2014, pp. 1141–1145 IEEE
- “An Algorithmic Perspective on Imitation Learning” In Foundations and Trends in Robotics 7.1-2, 2018, pp. 1–179 DOI: 10.1561/2300000053
- “Agile Autonomous Driving using End-to-End Deep Imitation Learning” In Robotics: Science and Systems XIV, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA, June 26-30, 2018, 2018 DOI: 10.15607/RSS.2018.XIV.056
- “Curiosity-driven Exploration by Self-supervised Prediction” In International Conference on Machine Learning (ICML), 2017
- “Dataset Shift in Machine Learning” The MIT Press, 2009
- “Learning to Reweight Examples for Robust Deep Learning” In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018, 2018, pp. 4331–4340 URL: http://proceedings.mlr.press/v80/ren18a.html
- Stéphane Ross, Geoffrey J. Gordon and Drew Bagnell “A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning” In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2011, Fort Lauderdale, USA, April 11-13, 2011, 2011, pp. 627–635 URL: http://proceedings.mlr.press/v15/ross11a/ross11a.pdf
- “Meta learning Framework for Automated Driving” In CoRR abs/1706.04038, 2017 arXiv: http://arxiv.org/abs/1706.04038
- Jürgen Schmidhuber “Formal Theory of Creativity, Fun, and Intrinsic Motivation (1990-2010)” In IEEE Trans. Autonomous Mental Development 2.3, 2010, pp. 230–247 DOI: 10.1109/TAMD.2010.2056368
- Jürgen Schmidhuber “Reinforcement Learning with Interacting Continually Running Fully Recurrent Networks” In International Neural Network Conference: July 9–13, 1990 Palais Des Congres — Paris — France Dordrecht: Springer Netherlands, 1990, pp. 817–820 DOI: 10.1007/978-94-009-0643-3˙97
- “Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction” In Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, NSW, Australia, 6-11 August 2017, 2017, pp. 3309–3318 URL: http://proceedings.mlr.press/v70/sun17d.html
- “Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning” In CoRR abs/1812.00971, 2018 arXiv: http://arxiv.org/abs/1812.00971
- “Shared Multi-Task Imitation Learning for Indoor Self-Navigation” In IEEE Global Communications Conference, GLOBECOM 2018, Abu Dhabi, United Arab Emirates, December 9-13, 2018, 2018, pp. 1–7 DOI: 10.1109/GLOCOM.2018.8647614
- “Ferroelectric fet based context-switching fpga enabling dynamic reconfiguration for adaptive deep learning machines” In arXiv preprint arXiv:2212.00089, 2022
- “One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks” In CoRR abs/1810.11043, 2018 arXiv: http://arxiv.org/abs/1810.11043