基于语义分割和几何分析的火龙果果实与枝条的三维重建

高嘉正; 李文涛; 罗陈迪; 欧阳春凡; 曾春林; 苏佳荣; 舒锦瑞; 周学成

doi:10.11975/j.issn.1002-6819.202402104

摘要: 针对非结构化环境中采摘机器人缺少足够环境信息的问题，该研究提出了一种用于机器臂避障和路径规划的果实与枝条检测和三维重建方法。采用MobileNetV2取代传统DeepLabV3+主干特征提取网络Xception，并在特征提取模块引入了坐标注意力机制，通过改进网络对采集的RGB图像进行目标检测，并将检测到的火龙果和枝条语义掩膜转换成三维点云。提出一种基于非线性最小二乘法的椭球体拟合方法用于重建火龙果，用有限数量的AABB包围盒获取不规则的枝条的三维空间位姿信息。测试表明，改进后模型的平均交并比（mean intersection over union，mIoU）和平均像素精度（mean pixel accuracy，mPA）分别达到95.59%、98.01%，相较原模型分别提升2.57个百分点和1.44个百分点；平均推理时间和模型内存占用量分别降至94.74ms和22.52MB，分别仅为原模型的59%和11%。三维重建试验表明，火龙果果实重建的短轴尺寸和深度距离的平均绝对误差分别为0.44和2.04mm，枝条重建的样本标准差在各坐标轴上均小于10mm。结果证实了该研究方法可以有效地重建火龙果果实和枝条，可以为火龙果采摘机器人的采摘路径规划避障提供基础。

Abstract: Robotics have been rapidly evolving with the onset of complex tasks for the discrete decision-making or subtle handling in modern agriculture, especially the harvesting of fruits from the orchards. Harvesting robots have also limited to an obstacle under an unstructured environment, such as disorderly grown branches. An added layer of complexity can also reduce the successful harvesting rates during operation. In this study, three-dimensional reconstruction of fruit branches was presented to efficiently avoid the obstacles for robots. The path planning was also determined for the target instruction during detection. Firstly, Intel RealSense D435i depth camera was used as image acquisition equipment to collect the images from dragon fruit plantations in Zengcheng District, Conghua District and Panyu District of Guangzhou. The data set was expanded to 3347 images by operations, such as turning, adding noise and blurring. The MobileNetV2 was selected to replace the traditional feature extraction network, Xception in the DeepLabV3+ backbone. A coordinate attention mechanism was incorporated into the feature extraction module to enhance the performance. The mobile-friendly and lightweight architecture of MobileNetV2 was used to most effectively discern and detect data. Each detected piece of information was converted into a three-dimensional point cloud. Then these point clouds were segmented into simply connected regions by region growing. As such, the robot was operated on the actual physical features of the on-field elements onto a digital platform after transformation. Moreover, a hypothesis was then proposed, indicating that dragon fruits were significantly represented by a three-dimensional ellipsoid. An exhaustive study was conducted to verify this presumption using fitting ellipsoids. Non-linear least square estimation was then utilized to reconstruct the dragon fruit point cloud. The irregular branches were enclosed within a finite set of axis-aligned bounding boxes (AABB), in order to obtain their three-dimensional spatial pose information for the robots to avoid around them. Finally, the harvesting system of dragon fruit was constructed with the three-dimensional reconstruction. Several experiments were conducted to validate the reconstruction efficiency and picking competence, including reconstruction error experiments and simulated picking tests. All coordinate axes were performed in the Cartesian space. The remarkable performance was achieved in the improved DeepLabV3+ model, both in terms of mean intersection over union (mIoU) and mean pixel accuracy (mPA) with a score of 95.59% and 98.01%, respectively, indicating an increased accuracy and precision. In terms of technical efficiency, both inference time and the model memory usage were optimized with the average of 94.74 ms and the only 22.52MB, which was 11% of the original model. The improved model increased the accuracy and the general performance of the system. In practical, the minimal average absolute errors were 0.44 and 2.04 mm, respectively, for the dimensions and the depth of the dragon fruit post-reconstruction. Furthermore, the standard deviation of the three-dimensional reconstructions of the branches was also less than 10mm on all axes. The simulation of the picking test was successfully completed in 86.79% of the experiments. The hypothesis was validated to enhance the efficiency and success rate of the dragon fruit harvesting robot. The dragon fruits and the branches were effectively reconstructed for the planning the path of the robotic system in an authentic environment. The successfully harvesting the fruits was realized to avoid the obstacles in an unstructured environment.

基于语义分割和几何分析的火龙果果实与枝条的三维重建

Three-dimensional reconstruction for dragon fruits and branches using semantic segmentation and geometric analysis