基于双维自注意力的跨阶段融合燃料棒外观缺陷检测方法

Cross Stage Fusion Fuel Rod Appearance Defect Detection Method Based on Dual Dimensional Self Attention

  • 摘要: 压水堆核燃料棒是核电站普遍采用的一种燃料元件,燃料棒的生产质量关乎核电站的运行安全,对燃料棒的外观进行检测极其重要。针对生产过程中燃料棒外观缺陷背景复杂、特征提取难的特点,本文提出了一种基于双维自注意力的跨阶段融合模型(DSCFM),引入了一个新颖的多尺度特征融合结构(MFFS)作为模型的颈部结构,用于高效地处理和融合不同层次的特征信息。此外,为了进一步挖掘和利用底层的结构信息,并对深层信息进行有效整合,提出了一个具有双维度特性的自注意力特征融合模块(DSAF)。最后,结合多尺度反卷积结构,实现了对特征的有效上采样和优化,显著提高了模型对不同尺度信息的捕获能力。在燃料棒外观缺陷数据集上进行验证,实验表明DSCFM相较于其他检测模型,能够快速精准地识别缺陷,其mAP值达到了82.0%。

     

    Abstract: Pressurized water reactor nuclear fuel rods are a type of fuel element commonly used in nuclear power plants. The production quality of fuel rods is related to the safe operation of nuclear power plants. It is very important to inspect the appearance quality of fuel rods. In view of the complex background and difficult feature extraction of fuel rod appearance defects in the produce scene, this paper proposed a novel cross stage fusion model based on dual dimensional self attention (DSCFM) . The model used Extended-ELAN as the backbone network to extract features and designed a novel multi-scale feature fusion structure (MFFS) as the neck structure of the model to efficiently process and fuse feature information at different levels. The purpose of the MFFS design is to optimize the information flow between features at different levels. A large amount of detailed information was retained through a detailed fusion mechanism, while strengthening the model’s ability to understand the complexity of the scene. In addition, to further mine and utilize the underlying structural information and effectively integrate deep information, a self attention feature fusion module with dual dimensional characteristics (DSAF) was proposed. DSAF expanded the deep and underlying features into dual dimensional feature maps, and used its own transposed matrix to generate channel and spatial attention maps, thereby accurately enhancing the expression of key information, while suppressing irrelevant noise and optimizing the feature fusion process. Through this dual dimensional self attention mechanism, DSAF dynamically adjusts feature responses, effectively captures long-term dependencies, and enhances the model’s adaptability and interpretation capabilities for complex scenes. Finally, combined with a multi-scale deconvolution structure, the DSCFM achieves effective upsampling and optimization of features, significantly improving the model’s ability to capture information at different scales and its robustness in various visual tasks. The results are verified on a fuel rod appearance defect dataset, and the experiments show that compared with other detection models, the DSCFM can quickly and accurately identify defects, with an mAP of 82.0% and a recall rate of 77.9%.

     

/

返回文章
返回