温室甜瓜自动采摘系统目标检测模型及空间定位研究

    Study on Target Detection Model and Spatial Location of Greenhouse Muskmelon Automatic Picking System

    • 摘要:
      目的 提高温室甜瓜采摘机器人在复杂光线变化和枝叶遮挡情况下的检测精度,实现检测目标的空间坐标定位。
      方法 基于YOLOv3,研究优化不同主干网络,头部、颈部网络结构及边界框损失函数组合对模型检测性能的影响,建立甜瓜严重遮挡下的目标检测网络模型YOLOResNet70,然后将模型与Intel RealSense D435i传感器融合进行目标空间定位。
      结果 模型YOLOResNet70采用ResNet70为主干网络,结合SPP (Spatial pyramid pooling)、CIoU (Complete intersection over union)、FPN (Feature pyramid network) 以及NMS (Greedy non-maximum suppression) 时性能最佳,模型平均精度(AP) 达到89.4%,优于Y OLOv3的83.3% 和YOLOv5的82%,其检测速度(61.8帧/s)比YOLOv4(54.1帧/s)快14%。
      结论 通过对不同光照条件下的遮挡甜瓜图像进行检测测试,表明YOLOResNet70模型鲁棒性良好,并且与Intel RealSense D435i深度传感器融合实现了甜瓜的空间定位坐标,与手工测量结果吻合,为甜瓜采摘机器人目标检测和空间定位提供了理论和模型支持。

       

      Abstract:
      Objective The study was conducted to improve the detection accuracy of muskmelon picking robot in greenhouse under complex light changes and branch and leaf occlusion, and realize the spatial coordinate positioning of detection targets.
      Method Based on YOLOv3, the study explored the impacts of optimizing the combination of different backbone networks, head and neck network structures and bounding box loss function on the model detection performance, established a target detection network model YOLOResNet70 under severe muskmelon occlusion, and then fused the model with Intel RealSense D435i depth visual sensor for target space positioning.
      Result With ResNet70 as the backbone network, YOLOResNet70 had the best performance with the combination of SPP (Spatial pyramid pooling), CIoU (Complete Intersection over Union), FPN (Feature Pyramid Network) and NMS (Greedy non-maximum suppression). The average accuracy (AP) of the model reached 89.4%, which was better than 83.3% of YOLOv3 and 82% of YOLOv5, and the detection speed (61.8 frames/s) was 14% faster than that of YOLOv4 (54.1 frames/s).
      Conclusion Through the detection and test of occluded muskmelon images under different lighting conditions, it shows that the YOLOResNet70 model has good robustness, and the model is fused with Intel RealSense D435i depth visual sensor to achieve the spatial positioning coordinates of muskmelon, which is consistent with the manual measurement result. It provides theoretical and model support for target detection and spatial positioning of muskmelon picking robot.

       

    /

    返回文章
    返回