Vision-based Safe Autonomous UAV Docking with Panoramic Sensors (2305.16008v2)
Abstract: The remarkable growth of unmanned aerial vehicles (UAVs) has also sparked concerns about safety measures during their missions. To advance towards safer autonomous aerial robots, this work presents a vision-based solution to ensuring safe autonomous UAV landings with minimal infrastructure. During docking maneuvers, UAVs pose a hazard to people in the vicinity. In this paper, we propose the use of a single omnidirectional panoramic camera pointing upwards from a landing pad to detect and estimate the position of people around the landing area. The images are processed in real-time in an embedded computer, which communicates with the onboard computer of approaching UAVs to transition between landing, hovering or emergency landing states. While landing, the ground camera also aids in finding an optimal position, which can be required in case of low-battery or when hovering is no longer possible. We use a YOLOv7-based object detection model and a XGBooxt model for localizing nearby people, and the open-source ROS and PX4 frameworks for communication, interfacing, and control of the UAV. We present both simulation and real-world indoor experimental results to show the efficiency of our methods.
- Uav in the advent of the twenties: Where we stand and what is next. ISPRS journal of photogrammetry and remote sensing, 184:215–242, 2022.
- Development of a low-cost agricultural remote sensing system based on an autonomous unmanned aerial vehicle (UAV). Biosystems engineering, 108(2):174–190, 2011.
- Persistent UAV delivery logistics: MILP formulation and efficient heuristic. Computers & Industrial Engineering, 120:418–428, 2018.
- Air risk maps for unmanned aircraft in urban environments. In 2022 International Conference on Unmanned Aircraft Systems (ICUAS), pages 1073–1082. IEEE, 2022.
- Embedded vision systems: A review of the literature. In Applied Reconfigurable Computing. Architectures, Tools, and Applications: 14th International Symposium, ARC 2018, Santorini, Greece, May 2-4, 2018, Proceedings 14, pages 204–216. Springer, 2018.
- Vision-based autonomous landing system for unmanned aerial vehicle: A survey. In 2014 International Conference on Multisensor Fusion and Information Integration for Intelligent Systems (MFI), pages 1–8. IEEE, 2014.
- Autonomous control for micro-flying robot and small wireless helicopter xrb. In 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 2906–2911. IEEE, 2006.
- Trinocular ground system to control uavs. In 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 3361–3367. Ieee, 2009.
- A ground-based near infrared camera array system for uav auto-landing in gps-denied environment. Sensors, 16(9):1393, 2016.
- A survey of safe landing zone detection techniques for autonomous unmanned aerial vehicles (UAVs). Expert Systems with Applications, 179:115091, 2021.
- UAV landing using computer vision techniques for human detection. Sensors, 20(3):613, 2020.
- Crowd detection for drone safe landing through fully-convolutional neural networks. In SOFSEM 2020: Theory and Practice of Computer Science: 46th International Conference on Current Trends in Theory and Practice of Informatics, SOFSEM 2020, Limassol, Cyprus, January 20–24, 2020, Proceedings 46, pages 301–312. Springer, 2020.
- Visual-based Safe Landing for UAVs in Populated Areas: Real-time Validation in Virtual Environments. arXiv preprint arXiv:2203.13792, 2022.
- Safeuav: Learning to estimate depth and safe landing areas for uavs from synthetic data. In Proceedings of the European Conference on Computer Vision (ECCV) Workshops, pages 0–0, 2018.
- Generalized object detection on fisheye cameras for autonomous driving: Dataset, representations and baseline. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 2272–2280, 2021.
- Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767, 2018.
- Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(4):1452–1459, 2021.
- Object detection and localization in 3D environment by fusing raw fisheye image and attitude data. Journal of Visual Communication and Image Representation, 59:128–139, 2019.
- Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision, pages 2980–2988, 2017.
- MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. CoRR, abs/1704.04861, 2017.
- Woodscape: A multi-task, multi-camera fisheye dataset for autonomous driving. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 9308–9318, 2019.
- KITTI-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022.
- Are we ready for autonomous driving? the kitti vision benchmark suite. In 2012 IEEE conference on computer vision and pattern recognition, pages 3354–3361. IEEE, 2012.
- YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv preprint arXiv:2207.02696, 2022.
- Sigmoid-weighted linear units for neural network function approximation in reinforcement learning. Neural Networks, 107:3–11, 2018.
- Monocular depth estimation based on deep learning: An overview. Science China Technological Sciences, 63(9):1612–1627, 2020.
- Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer. IEEE transactions on pattern analysis and machine intelligence, 44(3):1623–1637, 2020.
- Dist-YOLO: Fast Object Detection with Distance Estimation. Applied Sciences, 12(3):1354, 2022.
- Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, pages 785–794, 2016.
- A survey on image data augmentation for deep learning. Journal of big data, 6(1):1–48, 2019.
- Data augmentation for object detection: A review. In 2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS), pages 537–543. IEEE, 2021.
- ResFormer: Scaling ViTs with Multi-Resolution Training. arXiv preprint arXiv:2212.00776, 2022.
- Robust multi-resolution pedestrian detection in traffic scenes. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3033–3040, 2013.
- Autoland project: Fixed-wing UAV Landing on a Fast Patrol Boat using Computer Vision. In OCEANS 2019 MTS/IEEE SEATTLE, pages 1–5. IEEE.
- Phuoc Nguyen Thuan (2 papers)
- Tomi Westerlund (62 papers)
- Jorge Peña Queralta (54 papers)