文章摘要
刘承伟,洪峰,冯海泓,胡梦璐.结合多尺度卷积网络和双端注意力机制的水声目标识别[J].声学技术,2023,42(2):161~167
结合多尺度卷积网络和双端注意力机制的水声目标识别
Underwater acoustic target recognition based on dual attention networks and multiresolution convolutional neural networks
投稿时间:2021-08-03  修订日期:2021-10-15
DOI:10.16300/j.cnki.1000-3630.2023.02.006
中文关键词: 水下目标识别  注意力机制  多尺度卷积  特征融合
英文关键词: underwater target recognition  attention mechanism  multi-scale convolution  feature aggregation
基金项目:国家自然科学基金资助项目(11574249,11074202);青岛海洋科学与技术试点国家实验室“问海计划”(2021WHZZB1005)。
作者单位
刘承伟 中国科学院声学研究所东海研究站, 上海 201805
中国科学院大学, 北京 100049 
洪峰 中国科学院声学研究所东海研究站, 上海 201805 
冯海泓 中国科学院声学研究所东海研究站, 上海 201805
中国科学院大学, 北京 100049 
胡梦璐 中国科学院声学研究所东海研究站, 上海 201805
中国科学院大学, 北京 100049 
摘要点击次数: 335
全文下载次数: 405
中文摘要:
      水声目标识别是被动声呐系统的主要应用之一。为了进一步提升小样本条件下水下目标的识别率,文章提出一种基于多尺度卷积和双端注意力机制相融合的方法。首先,提取梅尔倒谱系数,色度谱和计算谱对比度等特征,建立基于多类别特征子集的三维聚合特征。其次,采用多尺度卷积滤波器算子构造多分辨率卷积神经网络,以更好地适应三维聚合特征的时频结构。另外,采用双端注意力模型捕获样本的全局依赖和局部特性。采用基于指数加权的对数交叉熵函数作为损失函数,提升样本数较少类别的识别率。实验结果表明,该方法在ShipsEar数据上的平均识别率为95.5%,取得了较好的分类效果。
英文摘要:
      Underwater acoustic target recognition (UATR) based on radiated noise is one of the main passive sonar applications. To further improve the classification accuracy of underwater target with small sample, a novel method based on dual attention networks (DAN) and a multiresolution convolutional neural network (DAN-MCNN) is proposed. Firstly, the three-dimensional (3D) aggregated features are designed by the multi-class feature subsets, which are composed of MFCC, Log-Mel spectrogram, chroma, spectral contrast, and tonnetz. Then, based on the frequency perception mechanism of the human ear and the auditory attention mechanism, a multi-resolution pooling and convolution scheme is adopted to construct the MCNN architecture, which can better adapt to the time-frequency structure of the 3D aggregated characteristics. Besides, the DAN module is used to capture the global dependence and local characteristics of samples. An exponentially weighted categorical cross-entropy (EWCE) is taken as the loss function to improve the recognition rate of categories with fewer samples. The experimental results show that the proposed approach achieves average recognition accuracy of 95.5% in the ShipsEar dataset, which is the best classification result.
查看全文   查看/发表评论  下载PDF阅读器
关闭