欢迎访问《声学技术》编辑部！

文章摘要

刘承伟,洪峰,冯海泓,胡梦璐.结合多尺度卷积网络和双端注意力机制的水声目标识别[J].声学技术,2023,42(2):161~167

结合多尺度卷积网络和双端注意力机制的水声目标识别

Underwater acoustic target recognition based on dual attention networks and multiresolution convolutional neural networks

投稿时间：2021-08-03 修订日期：2021-10-15

DOI：10.16300/j.cnki.1000-3630.2023.02.006

中文关键词: 水下目标识别注意力机制多尺度卷积特征融合

英文关键词: underwater target recognition attention mechanism multi-scale convolution feature aggregation

基金项目:国家自然科学基金资助项目(11574249,11074202);青岛海洋科学与技术试点国家实验室“问海计划”(2021WHZZB1005)。

作者	单位
刘承伟	中国科学院声学研究所东海研究站, 上海 201805 中国科学院大学, 北京 100049
洪峰	中国科学院声学研究所东海研究站, 上海 201805
冯海泓	中国科学院声学研究所东海研究站, 上海 201805 中国科学院大学, 北京 100049
胡梦璐	中国科学院声学研究所东海研究站, 上海 201805 中国科学院大学, 北京 100049

摘要点击次数: 335

全文下载次数: 405

中文摘要:

水声目标识别是被动声呐系统的主要应用之一。为了进一步提升小样本条件下水下目标的识别率，文章提出一种基于多尺度卷积和双端注意力机制相融合的方法。首先，提取梅尔倒谱系数,色度谱和计算谱对比度等特征，建立基于多类别特征子集的三维聚合特征。其次，采用多尺度卷积滤波器算子构造多分辨率卷积神经网络，以更好地适应三维聚合特征的时频结构。另外，采用双端注意力模型捕获样本的全局依赖和局部特性。采用基于指数加权的对数交叉熵函数作为损失函数，提升样本数较少类别的识别率。实验结果表明，该方法在ShipsEar数据上的平均识别率为95.5%，取得了较好的分类效果。

英文摘要:

Underwater acoustic target recognition (UATR) based on radiated noise is one of the main passive sonar applications. To further improve the classification accuracy of underwater target with small sample, a novel method based on dual attention networks (DAN) and a multiresolution convolutional neural network (DAN-MCNN) is proposed. Firstly, the three-dimensional (3D) aggregated features are designed by the multi-class feature subsets, which are composed of MFCC, Log-Mel spectrogram, chroma, spectral contrast, and tonnetz. Then, based on the frequency perception mechanism of the human ear and the auditory attention mechanism, a multi-resolution pooling and convolution scheme is adopted to construct the MCNN architecture, which can better adapt to the time-frequency structure of the 3D aggregated characteristics. Besides, the DAN module is used to capture the global dependence and local characteristics of samples. An exponentially weighted categorical cross-entropy (EWCE) is taken as the loss function to improve the recognition rate of categories with fewer samples. The experimental results show that the proposed approach achieves average recognition accuracy of 95.5% in the ShipsEar dataset, which is the best classification result.

查看全文查看/发表评论下载PDF阅读器

关闭