SAILOR: Perceptual Anchoring For Robotic Cognitive Architectures (2303.08204v2)
Abstract: Symbolic anchoring is a crucial problem in the field of robotics, as it enables robots to obtain symbolic knowledge from the perceptual information acquired through their sensors. In cognitive-based robots, this process of processing sub-symbolic data from real-world sensors to obtain symbolic knowledge is still an open problem. To address this issue, this paper presents SAILOR, a framework for providing symbolic anchoring in the ROS 2 ecosystem. SAILOR aims to maintain the link between symbolic data and perceptual data in real robots over time, increasing the intelligent behavior of robots. It provides a semantic world modeling approach using two deep learning-based sub-symbolic robotic skills: object recognition and matching function. The object recognition skill allows the robot to recognize and identify objects in its environment, while the matching function enables the robot to decide if new perceptual data corresponds to existing symbolic data. This paper provides a description of the proposed method and the development of the framework, as well as its integration in MERLIN2 (a hybrid cognitive architecture fully functional in robots running ROS 2).
- A survey of cognitive architectures in the past 20 years. \JournalTitleIEEE transactions on cybernetics 48, 3280–3290 (2018).
- 40 years of cognitive architectures: core cognitive abilities and practical applications. \JournalTitleArtificial Intelligence Review 53, 17–94 (2020).
- A review of 40 years of cognitive architecture research: Focus on perception, attention, learning and applications. \JournalTitlearXiv preprint arXiv:1610.08602 1–74 (2016).
- Pddl2. 1: An extension to pddl for expressing temporal planning domains. \JournalTitleJournal of artificial intelligence research 20, 61–124 (2003).
- Cashmore, M. et al. Rosplan: Planning in the robot operating system. In Proceedings of the international conference on automated planning and scheduling, vol. 25, 333–341 (2015).
- Plansys2: A planning system framework for ros2. In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 9742–9749 (IEEE, 2021).
- Forward-chaining partial-order planning. In Proceedings of the International Conference on Automated Planning and Scheduling, vol. 20, 42–49 (2010).
- An introduction to the anchoring problem. \JournalTitleRobotics and autonomous systems 43, 85–96 (2003).
- Harnad, S. The symbol grounding problem. \JournalTitlePhysica D: Nonlinear Phenomena 42, 335–346 (1990).
- Vogt, P. The physical symbol grounding problem. \JournalTitleCognitive Systems Research 3, 429–457 (2002).
- Anchoring symbols to vision data by fuzzy logic. In Symbolic and Quantitative Approaches to Reasoning and Uncertainty 5, 104–115 (Springer, 1999).
- Grounding commonsense knowledge in intelligent systems. \JournalTitleJournal of Ambient Intelligence and Smart Environments 1, 311–321 (2009).
- Lowe, D. G. Distinctive image features from scale-invariant keypoints. \JournalTitleInternational journal of computer vision 60, 91–110 (2004).
- Semantic world modeling using probabilistic multiple hypothesis anchoring. \JournalTitleRobotics and Autonomous Systems 61, 95–105 (2013).
- Learning actions to improve the perceptual anchoring of objects. \JournalTitleFrontiers in Robotics and AI 3, 76 (2017).
- Context-aware 3d object anchoring for mobile robots. \JournalTitleRobotics and Autonomous Systems 110, 12–32 (2018).
- Kuhn, H. W. The hungarian method for the assignment problem. \JournalTitleNaval research logistics quarterly 2, 83–97 (1955).
- Semantic relational object tracking. \JournalTitleIEEE Transactions on Cognitive and Developmental Systems 12, 84–97 (2019).
- Object detection using deep learning, cnns and vision transformers: A review. \JournalTitleIEEE Access (2023).
- Imagenet classification with deep convolutional neural networks. \JournalTitleCommunications of the ACM 60, 84–90 (2017).
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, 770–778 (2016).
- Faster r-cnn: Towards real-time object detection with region proposal networks. \JournalTitleAdvances in neural information processing systems 28 (2015).
- You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, 779–788 (2016).
- Carion, N. et al. End-to-end object detection with transformers. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part I 16, 213–229 (Springer, 2020).
- YOLO by Ultralytics (2023).
- Object tracking: A survey. \JournalTitleAcm computing surveys (CSUR) 38, 13–es (2006).
- Image retrieval: Ideas, influences, and trends of the new age. \JournalTitleACM Computing Surveys (Csur) 40, 1–60 (2008).
- Siamese network features for image matching. In 2016 23rd international conference on pattern recognition (ICPR), 378–383 (IEEE, 2016).
- Sketch-based image retrieval via siamese convolutional neural network. In 2016 IEEE international conference on image processing (ICIP), 2460–2464 (IEEE, 2016).
- Product image retrieval using category-aware siamese convolutional neural network feature. \JournalTitleJournal of King Saud University-Computer and Information Sciences 34, 2680–2687 (2022).
- Sharma, N. et al. Siamese convolutional neural network-based twin structure model for independent offline signature verification. \JournalTitleSustainability 14, 11484 (2022).
- Zhang, K. et al. Content-based image retrieval with a convolutional siamese neural network: Distinguishing lung cancer and tuberculosis in ct images. \JournalTitleComputers in biology and medicine 140, 105096 (2022).
- Anchoring symbols to sensor data: preliminary report. In AAAI/IAAI, 129–135 (2000).
- Fuzzy anchoring. In 10th IEEE International Conference on Fuzzy Systems.(Cat. No. 01CH37297), vol. 1, 111–114 (IEEE, 2001).
- Maintaining coherent perceptual information using anchoring. In INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, vol. 19, 1477 (LAWRENCE ERLBAUM ASSOCIATES LTD, 2005).
- Zhang, Y. et al. Bytetrack: Multi-object tracking by associating every detection box. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXII, 1–21 (Springer, 2022).
- Resnet in resnet: Generalizing residual architectures. \JournalTitlearXiv preprint arXiv:1603.08029 (2016).
- Adam: A method for stochastic optimization. \JournalTitleCornell University - arXiv DOI: 10.48550/arxiv.1412.6980 (2015).
- Kant: A tool for grounding and knowledge management. In Ferrández Vicente, J. M., Álvarez-Sánchez, J. R., de la Paz López, F. & Adeli, H. (eds.) Bio-inspired Systems and Applications: from Robotics to Ambient Intelligence, 452–461 (Springer International Publishing, Cham, 2022).
- Vision meets robotics: The kitti dataset. \JournalTitleInternational Journal of Robotics Research (IJRR) (2013).
- Caesar, H. et al. nuscenes: A multimodal dataset for autonomous driving. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 11621–11631 (2020).
- 3d multi-object tracking with differentiable pose estimation. \JournalTitlearXiv preprint arXiv:2206.13785 (2022).
- Generating symbolic representation from sensor data: inferring knowledge in robotics competitions. In 2018 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC), 261–266 (IEEE, 2018).
- Merlin2: Machined ros 2 planing. \JournalTitleSoftware Impacts 15, 100477, DOI: https://doi.org/10.1016/j.simpa.2023.100477 (2023).
- Categorical, A. Glossary of terms. \JournalTitleMachine Learning 30, 271–274 (1998).