东北大学学报(自然科学版) ›› 2023, Vol. 44 ›› Issue (6): 799-807.DOI: 10.12068/j.issn.1005-3026.2023.06.006

• 机械工程 • 上一篇    下一篇

基于改进Cascade R-CNN模型的机器人抓取检测研究

姜杨, 赵峰禹, 陈枭   

  1. (东北大学 机器人科学与工程学院, 辽宁 沈阳110169)
  • 发布日期:2023-06-20
  • 通讯作者: 姜杨
  • 作者简介:姜杨(1982-),男,辽宁沈阳人,东北大学副教授.
  • 基金资助:
    国家自然科学基金资助项目(U20A20197); 辽宁省重点研发计划项目(2020JH2/10100040).

Research on Robotic Grasping Detection Based on Improved Cascade R-CNN Model

JIANG Yang, ZHAO Feng-yu, CHEN Xiao   

  1. School of Robotics & Engineering, Northeastern University, Shenyang 110169, China.
  • Published:2023-06-20
  • Contact: JIANG Yang
  • About author:-
  • Supported by:
    -

摘要: 为提高多物体抓取检测网络的抓取检测准确率,提出一种基于改进Cascade R-CNN模型的机械臂抓取检测算法.首先,引入ResNeXt结构能够在不加大网络设计难度的前提下提高了模型的准确率;引入带空洞卷积的空间金字塔池化模块以解决分辨率较低的问题;接着对抓取框回归分支和角度分类分支以分治方法进行优化.其次,针对多物体抓取数据集缺乏的问题,构建多目标抓取数据集(multi-object grasping dataset,MOGD),有效地扩充了多物体抓取检测数据集.最后,基于改进Cascade R-CNN模型设计抓取检测网络,实验结果表明,改进后的算法效率更高,PI-Cascade R-CNN实验准确率为93%,较Cascade R-CNN提升1.5个百分点.

关键词: 抓取检测;空洞卷积;Cascade R-CNN;多物体检测;机器人抓取

Abstract: In order to improve the grasping detection accuracy of the multi-object grasping detection network, a robotic arm grasping detection algorithm based on the improved Cascade R-CNN model is proposed. Firstly, the introduction of the ResNeXt structure can improve the accuracy of the model without increasing the difficulty of network design. The atrous spatial pyramid pooling module is introduced to solve the problem of low resolution. Then, the grasping box regression branch and the angle classification branch are optimized by the divide and conquer method. Secondly, aiming at the lack of multi-object grasping datasets, a multi-object grasping dataset (MOGD) is constructed, which effectively expands the multi-object grasping detection dataset. Finally, a grasping detection network is designed based on the improved Cascade R-CNN model. The experimental results show that the improved algorithm is more efficient. The experimental accuracy of the PI-Cascade R-CNN is 93 %, which is 1.5 percentage higher than that of the Cascade R-CNN.

Key words: grasping detection; atrous convolution; Cascade R-CNN; multi-object detection; robotic grasping

中图分类号: