CN111709315A - Underwater acoustic target radiation noise identification method based on field adaptation - Google Patents
Underwater acoustic target radiation noise identification method based on field adaptation Download PDFInfo
- Publication number
- CN111709315A CN111709315A CN202010463032.0A CN202010463032A CN111709315A CN 111709315 A CN111709315 A CN 111709315A CN 202010463032 A CN202010463032 A CN 202010463032A CN 111709315 A CN111709315 A CN 111709315A
- Authority
- CN
- China
- Prior art keywords
- target
- domain
- label
- sample
- target domain
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 44
- 230000005855 radiation Effects 0.000 title claims abstract description 35
- 230000006978 adaptation Effects 0.000 title claims description 28
- 238000013527 convolutional neural network Methods 0.000 claims abstract description 52
- 238000012549 training Methods 0.000 claims abstract description 49
- 238000013507 mapping Methods 0.000 claims abstract description 8
- 230000006870 function Effects 0.000 claims description 52
- 238000009826 distribution Methods 0.000 claims description 35
- 210000002569 neuron Anatomy 0.000 claims description 15
- 238000011176 pooling Methods 0.000 claims description 12
- 238000005457 optimization Methods 0.000 claims description 6
- 230000004913 activation Effects 0.000 claims description 5
- 238000004422 calculation algorithm Methods 0.000 claims description 3
- 230000008859 change Effects 0.000 claims description 3
- 230000003247 decreasing effect Effects 0.000 claims description 3
- 230000008569 process Effects 0.000 claims description 3
- 238000012544 monitoring process Methods 0.000 abstract 1
- 238000010586 diagram Methods 0.000 description 6
- 238000004364 calculation method Methods 0.000 description 5
- 238000013508 migration Methods 0.000 description 5
- 230000005012 migration Effects 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2218/00—Aspects of pattern recognition specially adapted for signal processing
- G06F2218/12—Classification; Matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2415—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/047—Probabilistic or stochastic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2218/00—Aspects of pattern recognition specially adapted for signal processing
- G06F2218/08—Feature extraction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V2201/00—Indexing scheme relating to image or video recognition or understanding
- G06V2201/07—Target detection
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Probability & Statistics with Applications (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computational Linguistics (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Signal Processing (AREA)
- Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
Abstract
The invention discloses a field-adaptive underwater acoustic target radiation noise identification method, which comprises the steps of constructing a convolutional neural network shared by a source domain and a target domain, pre-training the convolutional neural network by using source domain label samples, respectively calculating source domain classification loss and target domain pseudo label classification loss by using the source domain label samples and the target domain samples, multilayer calculating the multi-core maximum mean difference distance of the source domain label samples and the target domain samples in the convolutional neural network, obtaining network total loss according to the source domain classification loss, the target domain pseudo label classification loss and the multi-core maximum mean difference distance, endowing a network-predicted target domain sample label to a label-free sample through pseudo label learning, enabling the label-free target domain underwater acoustic target data to have the capability of monitoring and training a model, and realizing the mapping of the class characteristics of the source domain underwater acoustic target data and the target domain underwater acoustic target data to the same label space, the problem of accurately identifying the underwater acoustic target sample when the target domain underwater acoustic target sample is not labeled and the data volume is rare is effectively solved.
Description
Technical Field
The invention belongs to the field of underwater acoustic target detection and identification, and particularly relates to an underwater acoustic target radiation noise identification method based on field adaptation.
Background
In recent years, the exploration of industrial science and technology in the marine field is deepened in various countries in the world, the emphasis on the underwater sound target in the marine field is gradually increased, the research on an accurate and efficient identification method for the underwater sound target becomes more important, and China still lags behind in this respect. Therefore, with the remarkable changes of the international strategic situation and the surrounding safety environment, research on underwater acoustic target identification needs to be advanced.
Document CN201910661350, a big data-based underwater acoustic target intelligent identification method, first collects a large amount of underwater acoustic target original signals, then extracts typical features of the signals manually by means of signal processing and the like, then constructs a training set from the extracted features to train the built neural network model, and finally inputs the training set to obtain an underwater acoustic target classification result. However, the method firstly needs to manually extract features, the automation degree of underwater sound target identification is not high, and end-to-end identification cannot be realized. Meanwhile, when the marginal distribution or conditional distribution of the target recognition domain data and the marginal distribution or conditional distribution used for model training data in the prior art are not completely the same, the method is easy to cause a lower recognition accuracy problem. Due to the interference of complex ocean background noise and the like, the classification method of underwater sound target radiation noise proposed by many scholars in recent years cannot obtain ideal classification performance.
Disclosure of Invention
The invention aims to provide an underwater sound target radiation noise identification method based on field adaptation so as to overcome the defects of the prior art.
In order to achieve the purpose, the invention adopts the following technical scheme:
a method for recognizing radiation noise of an underwater sound target based on field adaptation comprises the following steps:
step 1), constructing a convolutional neural network shared by a source domain and a target domain;
step 2), pre-training the convolutional neural network constructed in the step 1) by using the source domain label sample to obtain a pre-training model, and initializing a target domain sample pseudo label through the pre-training model;
step 3), respectively calculating source domain classification loss and target domain pseudo label classification loss by using the source domain label samples and the target domain samples, simultaneously calculating the cross-domain multi-core maximum mean difference distance of the source domain label samples and the target domain samples in a convolutional neural network in a multi-layer mode, and obtaining network total loss according to the source domain classification loss, the target domain pseudo label classification loss and the multi-core maximum mean difference distance;
and 4) training and optimizing the convolutional neural network aiming at the total network loss obtained in the step 3), continuously updating a target domain sample pseudo label according to the change of the model parameter of the convolutional neural network while training, circularly iterating until the loss function is converged to obtain an optimal underwater acoustic target radiation noise identification model, and identifying the target sample to be detected by adopting the optimal underwater acoustic target radiation noise identification model to obtain an underwater acoustic target radiation noise classification result.
Further, the convolutional neural network comprises 2 convolutional layers, 2 maximum pooling layers, 2 full-link layers and 1 output layer; the 1 st layer and the 3 rd layer of the convolutional neural network are convolutional layers, and the number of convolutional kernels corresponding to each convolutional layer is 10; the 2 nd layer and the 4 th layer of the convolutional neural network are maximum pooling layers, the size of the pooling window of each maximum pooling layer is 1X2, and the step length is 1; the 5 th layer and the 6 th layer of the convolutional neural network are full connection layers, the number of neurons of the 5 th layer is 2480, and the number of neurons of the 6 th layer is 256; layer 7 is the output layer and the activation function is the softmax function.
Further, in the step 2), the source domain label sample is input into the convolutional neural network constructed in the step 1), iterative optimization is carried out on the convolutional neural network model parameters through an optimization algorithm until the loss function is converged, and a pre-training model is obtained.
Further, in step 2), inputting the unlabeled target domain sample into a convolutional neural network, then outputting the probability that the unlabeled target domain sample is in each category, and taking the category corresponding to the maximum probability as the target domain sample pseudo label of the unlabeled target domain sample; when the target domain sample pseudo label is initialized, all non-label samples in the target domain sample are input into a pre-training model, each non-label sample is subjected to forward propagation to obtain a network output layer neuron value, and the output layer neuron value is converted into probability distribution of corresponding sample types through a softmax function.
Further, the method specifically comprises the following steps: the softmax function is expressed as follows:
wherein e is a natural index, FiIs the i-th neuron value of the output layer, SiTaking the probability value of the i-th class label for the label-free sample;
then obtaining the probability distribution of the target domain samples in the mark space: p ═ S1S2…Sk];
Wherein k is the total number of categories;
and finally, outputting the target domain sample pseudo label through the following formula:
wherein the argmax function returns the index of the maximum value in the array,is a binarized pseudo label for the target domain sample.
Further, a cross entropy function is adopted as a loss function of the source domain label samples, and the source domain classification loss is as follows:
wherein,the probability of the prediction of the ith sample of the source domain being a true label,the true label of the corresponding source domain label exemplar, nSThe number of samples of the source domain small batch of the input network in each iteration period.
Further, a cross entropy function is adopted as a loss function in the target domain sample training, and the target domain pseudo label classification loss is as follows:
wherein,the probability of being a true label for the prediction of the jth sample of the target domain,pseudo-label of the target domain sample for the jth sample of the target domain, nTThe number of samples of the target domain small batch input into the network in each iteration period is counted.
Further, the difference distance of the maximum mean values of the multiple nuclei is as follows:
wherein, XS、XTRespectively a source domain and a target domainIs characterized in that n and m are the small-batch sample numbers of the source domain and the target domain respectively,representation-to-core mappingThe relevant feature mapping, H is the regenerated kernel Hilbert space,is defined as a convex combination of a plurality of kernel functions, wherein:
wherein, βuAs a weighting coefficient, kuIs a Gaussian kernel function, and m is the number of kernel functions;
the total loss of the network is:
wherein,in order to classify the loss for the source domain,for the target domain pseudo label classification loss, α, β are the hyper-parameters set before training, L is the set composed of network convolution layer and full link layer,output characteristics, k, of the source domain and the target domain of the l-th layer of the network, respectivelylFor the compromise factor:
further, the training process of the convolutional neural network comprises the following steps:
1. calculating network total loss using chain-type derivative ruleThe model parameters are adjusted by the initial learning rate of 0.001 and the learning rate is continuously decreased;
2. updating the target domain sample pseudo label according to the adjusted model parameter;
3. and (3) continuously repeating the step 1 to the step 2, and circularly iterating until the loss function is converged to finally obtain the optimal underwater sound target radiation noise identification model.
Compared with the prior art, the invention has the following beneficial technical effects:
the invention discloses a field-adaptive underwater acoustic target radiation noise identification method, which comprises the steps of constructing a convolutional neural network shared by a source domain and a target domain, pre-training the convolutional neural network by using source domain label samples, respectively calculating source domain classification loss and target domain pseudo label classification loss by using the source domain label samples and the target domain samples, simultaneously calculating multi-core maximum mean difference distance of the source domain label samples and the target domain samples in a multi-layer mode in the convolutional neural network, obtaining network total loss according to the source domain classification loss, the target domain pseudo label classification loss and the multi-core maximum mean difference distance, initializing target domain sample pseudo labels through a pre-training model, endowing the non-labeled samples with network-predicted target domain sample labels through pseudo label learning, and enabling the non-labeled target domain underwater acoustic target data to have the capability of a supervision training model, finally, mapping of the classification features of the source domain underwater acoustic target data and the target domain underwater acoustic target data to the same label space is achieved, simultaneous adaptation of cross-domain marginal probability distribution and conditional probability distribution of the source domain data and the target domain data is achieved through multi-core maximum mean difference and pseudo label learning, the problem of accurate identification of the target domain underwater acoustic target samples when the target domain underwater acoustic target samples are not labeled and the data volume is small is effectively solved, and finally the classification performance of the target identification field underwater acoustic target is improved.
Furthermore, from the perspective of field adaptation, a convolutional neural network is constructed, MK-MMD is used as a measurement criterion of marginal distribution difference of a source domain and a target domain, MK-MMD distance is adapted to multiple layers in a depth network at the same time, the test performance of the convolutional neural network is superior to that of an MMD field adaptation method widely used at present, the marginal distribution distance between cross domains is effectively reduced, and accumulated source domain underwater acoustic target label data is applied to underwater acoustic target data identification of a target attention field to realize target identification.
Further, the source domain classification loss and the target domain pseudo label classification loss are respectively calculated through forward propagation and are merged into the network total loss for training so as to draw the condition distribution difference between the source domain and the target domain.
Furthermore, original bandwidth parameters in the kernel function are replaced by calculating the square of the average value difference of the characteristics of the input source domain and the target domain, the problems that the experimental performance fluctuation is large and the like caused by different artificially set Gaussian kernel function bandwidth parameters in the calculation of the MK-MMD are solved, and the MK-MMD distance is fused into the total loss of the network to draw the boundary distribution difference across the domains.
Drawings
FIG. 1 is a flow chart in an embodiment of the present invention.
Fig. 2 is a specific structural diagram of a convolutional neural network in an embodiment of the present invention.
Fig. 3 is a diagram of the total loss of the network according to the method of the present invention.
Fig. 4 is a schematic diagram of five types of underwater acoustic targets related to a training data set in the embodiment of the present invention.
FIG. 5 is a visualization diagram of sample migration characteristics t-SNE of a source domain and a target domain after domain adaptation in the embodiment of the present invention.
Detailed Description
The invention is described in further detail below with reference to the accompanying drawings:
the following detailed description of specific embodiments of the present invention is made with reference to the accompanying drawings and examples. The following examples are intended to illustrate the invention but are not intended to limit the scope of the invention.
As shown in fig. 1, the method for identifying radiation noise of underwater acoustic target based on field adaptation of the present invention includes the following steps:
step 1: constructing a convolutional neural network shared by a source domain and a target domain;
in the method of the present invention, the specific structure of the constructed convolutional neural network is shown in fig. 2. The convolutional neural network comprises 7 layers, wherein the 7 layers comprise 2 convolutional layers, 2 maximum pooling layers, 2 full-connection layers and 1 output layer; the layer 1 and the layer 3 of the convolutional neural network are convolutional layers, the number of convolutional kernels corresponding to the convolutional layers is 10, the sizes of the convolutional kernels are 1X5 and 1X3 respectively, the step length is 1, and the activation function is a Leaky _ ReLU function;
the 2 nd layer and the 4 th layer of the convolutional neural network are maximum pooling layers, the size of the pooling window of each maximum pooling layer is 1X2, and the step length is 1; the 5 th and 6 th layers of the convolutional neural network are full connection layers, the number of neurons of the 5 th layer is 2480, the number of neurons of the 6 th layer is 256, and the activation function is a Leaky _ ReLU function; layer 7 is the output layer and the activation function is the softmax function.
The method and the device share the same implementation field of parameters of all network layers (convolutional layers and full-connection layers) when the source domain label sample and the target domain sample are transmitted in the forward direction of the network. The field adaptation is realized by constructing a convolution neural network shared by a source field and a target field, namely, the aim of migrating the knowledge of the source field data to the target field is fulfilled by drawing the distribution difference of the source field data and the target field data in the same characteristic space. In short, the related knowledge is obtained in a field learning, and the application to a new field is completed through a field adaptation method. Therefore, different from the traditional deep learning, the training samples of the field adaptation method are divided into source field label data and target field label-free data, the source field label data is used for solving the problems that the sample size is rare and no reliable label exists in the target identification field, and the method is the final target of the method.
2, pre-training the convolutional neural network constructed in the step 1 by using a source domain label sample to obtain a pre-training model, and initializing a target domain sample pseudo label through the pre-training model;
the training set of the convolutional neural network is composed of source domain label samples and target domain samples without labels.
In each experiment, the training set comprises source domain label samples and target domain samples, wherein the used source domain label samples and target domain samples are respectively from radiated noise signals collected by the same sonar collecting equipment at corresponding ports when the 5-class underwater acoustic targets are in different sailing states. The schematic diagram of 5-class underwater sound target types is shown in fig. 4, and the schematic diagram is respectively a motorboat, a pilot ship, a passenger ship, an ocean-going passenger ship and a ro-ro ship. 1095 source domain label samples are collected, wherein each ship type comprises 219 samples, and each target domain sample comprises 485 samples, wherein each ship type comprises 97 samples; each sample is a radiated noise signal of length 1000 points, and is a one-dimensional time series.
And (3) during pre-training, inputting the source domain label sample into the convolutional neural network constructed in the step (1), and performing iterative optimization on the model parameters of the convolutional neural network through an optimization algorithm until the loss function is converged to obtain a pre-training model.
Inputting an unlabeled target domain sample into a convolutional neural network, then outputting the probability that the unlabeled target domain sample is in each category, and taking the category corresponding to the maximum probability as a target domain sample pseudolabel of the unlabeled target domain sample; when the target domain sample pseudo label is initialized, all non-label samples in the target domain sample are input into a pre-training model, each non-label sample is subjected to forward propagation to obtain a network output layer neuron value, and the output layer neuron value is converted into probability distribution of corresponding sample types through a softmax function. The softmax function is expressed as follows:
wherein e is a natural index, FiIs the i-th neuron value of the output layer, SiTaking the probability value of the i-th class label for the label-free sample;
then obtaining the probability distribution of the target domain samples in the mark space: p ═ S1S2…Sk];
Wherein k is the total number of categories;
and finally, outputting the target domain sample pseudo label through the following formula to finish initialization of the target domain sample pseudo label:
wherein the argmax function returns the index of the maximum value in the array,is a binarized pseudo label for the target domain sample.
And step 3: respectively calculating source domain classification loss and target domain pseudo label classification loss by using a source domain label sample and a target domain sample, simultaneously calculating the cross-domain multi-core maximum mean difference distance of the source domain label sample and the target domain sample in a convolutional neural network in a multi-layer mode, and obtaining network total loss according to the source domain classification loss, the target domain pseudo label classification loss and the multi-core maximum mean difference distance;
in the example, a cross entropy function is adopted as a loss function when a source domain label sample and a target domain sample are trained; the source domain classification loss calculation is as follows:
wherein,the probability of the prediction of the ith sample of the source domain being a true label,the true label of the corresponding source domain label exemplar, nSInputting the source domain small batch sample number of the network in each iteration period;
the target domain pseudo label classification loss calculation formula is as follows:
wherein,the probability of being a true label for the prediction of the jth sample of the target domain,pseudo-label of the target domain sample for the jth sample of the target domain, nTThe number of samples of the target domain small batch input into the network in each iteration period is counted.
Adapting the cross-domain probability distribution to obtain the total network loss, and inputting the source domain label sample into a pre-training model to obtain the source domain classification loss; and inputting the target domain sample into a pre-training model according to the initialized target domain sample pseudo label to obtain the target domain pseudo label classification loss.
The cross-domain probability distribution of the source domain data and the target domain data comprises marginal distribution and conditional distribution;
the adaptation of the condition distribution means that under the condition that sample class labels are considered, the migration characteristics of the source domain label samples and the target domain samples are mapped to the same label space by simultaneously optimizing the source domain classification loss and the target domain pseudo label classification loss. When the conditional distribution of cross-domain data is specifically adapted, the source domain label samples and the target domain samples are simultaneously input into a pre-training model, the source domain classification loss and the target domain pseudo label classification loss are respectively calculated through forward propagation, and the total network loss for training is integrated to draw the conditional distribution difference between the source domain and the target domain.
The marginal distribution difference of the cross-domain data refers to the distribution distance of the features of all samples in the source domain data and the target domain data in a high-dimensional space without considering the sample class labels. The adaptation to the boundary distribution is achieved by minimizing the MK-MMD between the source domain label samples and the target domain samples in the training. In domain adaptation, since the marginal distributions of the source domain data and the target domain data are different, in order to enable adaptation between the cross-domain data, the distribution difference between them needs to be evaluated: the multi-core maximum mean difference (MK-MMD) is characterized in that different domain samples are evenly embedded into a regeneration core Hilbert space (RKHS) by fusing a plurality of appropriate kernel functions, and then the marginal distribution difference between cross domains is measured. Meanwhile, in order to solve the problems of large experimental performance fluctuation and the like caused by different artificially set Gaussian kernel function bandwidth parameters in MK-MMD calculation, a median heuristic bandwidth estimation method is selected, and original bandwidth parameters in the kernel function are replaced by calculating the square of the average value difference of input source domain and target domain characteristics. Calculating the MK-MMD distance of cross-domain in a convolutional neural network in a multi-layer mode, and integrating the MK-MMD distance into the total loss of the network to draw up the marginal distribution difference of the cross-domain;
the expression for the MK-MMD distance is as follows:
wherein, Xs、XTRespectively the characteristics of a source domain and a target domain, n and m respectively represent the small batch sample numbers of the source domain and the target domain,representation-to-core mappingThe relevant feature mapping, H is the regenerated kernel Hilbert space,defined as a convex combination of a plurality of kernel functions, expressed as follows:
wherein, βuAs a weighting coefficient, kuIs a gaussian kernel function, and m is the number of kernel functions.
Finally, as shown in fig. 3, the total loss of the network is:
wherein,in order to classify the loss for the source domain,for the target domain pseudo label classification loss, α and β are hyper-parameters set before training, L is a set formed by the 1 st, 3 rd, 5 th and 6 th layers of the network,output characteristics, k, of the source domain and the target domain of the l-th layer of the network, respectivelylFor the compromise factor, it is as follows:
and 4, step 4: and (3) training and optimizing the network total loss convolutional neural network obtained in the step (3), continuously updating the pseudo label of the target domain sample according to the change of the model parameter of the convolutional neural network while training, circularly iterating until the loss function is converged to obtain an optimal underwater acoustic target radiation noise identification model, and identifying the target sample to be detected by adopting the optimal underwater acoustic target radiation noise identification model to obtain an underwater acoustic target radiation noise classification result.
In this example, an Adam optimizer is used that updates the model parameters in the convolutional neural network by calculating an update step by comprehensively considering the first moment estimate and the second moment estimate of the gradient. The method has the advantages of simple implementation, high calculation efficiency, less memory requirement, no influence of gradient expansion transformation on updating of model parameters, good interpretive performance of hyper-parameters, no adjustment or little fine adjustment in general and the like.
The training process of the convolutional neural network comprises the following steps:
1) calculating the total loss of the network by using the chain-type derivation ruleA gradient of (a); adjusting the model parameters by continuously decreasing the initial learning rate of 0.001;
2) updating the target domain sample pseudo label according to the adjusted model parameter;
3) and continuously repeating the step 1) to the step 2), and circularly iterating until the loss function is converged to finally obtain the optimal underwater sound target radiation noise identification model.
Specifically, the target domain samples are input into the optimal underwater acoustic target radiation noise identification model obtained through training in the step 4, and then the underwater acoustic target radiation noise classification result can be obtained.
And (3) inputting all target domain samples serving as a test set into the network trained in the step 5, outputting 5 values by each sample through forward propagation, wherein each value represents the probability that the sample to be tested belongs to each class, and the higher the value is, the higher the probability that the sample to be tested belongs to the corresponding underwater sound target class is, so that the classification purpose is achieved. For example, an output vector of (0.1, 0.2, 0.1, 0.0, 0.6) represents that the probability that the target domain sample to be measured belongs to the category 5 underwater sound target class is the largest.
Finally, as shown in fig. 5, migration features of source domain and target domain samples extracted at a layer before an output layer are visualized through t-SNE, and it is proved that the method provided by the invention can effectively adapt to distribution differences among different domain data to improve target domain data prediction performance.
In summary, the invention provides a field adaptation-based underwater acoustic target radiation noise identification method aiming at the problem of different probability distributions of source domain data and target domain data in the traditional underwater acoustic target identification field, and compared with the traditional underwater acoustic target identification technology, the method provided by the invention better adapts marginal distribution and conditional distribution between the source domain data and the target domain data, effectively solves the problem of accurately identifying target domain underwater acoustic target samples when the target domain underwater acoustic target samples are not labeled and the data amount is rare, and finally improves the classification performance of the target identification field underwater acoustic targets. The classification method is adopted to perform classification experiments on five different types of underwater acoustic target radiation noise signals, and compared with two methods based on parameter migration and MMD field adaptation, experimental results prove that the accuracy of underwater acoustic target identification performed by the method reaches 93.8%, the highest accuracy of the other two migration methods using the same data set is 81.7%, and the accuracy is remarkably improved, so that the method can better perform classification identification on the underwater acoustic target radiation noise signals of ships and warships and the like.
The above-mentioned contents are only for explaining the technical idea of the invention of the present application, and can not be used as the basis for limiting the protection scope of the invention, and any modifications and substitutions made on the technical solution according to the design concept and technical features proposed by the present invention are within the protection scope of the claims of the present invention.
Claims (9)
1. A method for identifying underwater sound target radiation noise based on field adaptation is characterized by comprising the following steps:
step 1), constructing a convolutional neural network shared by a source domain and a target domain;
step 2), pre-training the convolutional neural network constructed in the step 1) by using the source domain label sample to obtain a pre-training model, and initializing a target domain sample pseudo label through the pre-training model;
step 3), respectively calculating source domain classification loss and target domain pseudo label classification loss by using the source domain label samples and the target domain samples, simultaneously calculating the cross-domain multi-core maximum mean difference distance of the source domain label samples and the target domain samples in a convolutional neural network in a multi-layer mode, and obtaining network total loss according to the source domain classification loss, the target domain pseudo label classification loss and the multi-core maximum mean difference distance;
and 4) training and optimizing the convolutional neural network aiming at the total network loss obtained in the step 3), continuously updating a target domain sample pseudo label according to the change of the model parameter of the convolutional neural network while training, circularly iterating until the loss function is converged to obtain an optimal underwater acoustic target radiation noise identification model, and identifying the target sample to be detected by adopting the optimal underwater acoustic target radiation noise identification model to obtain an underwater acoustic target radiation noise classification result.
2. The method for recognizing the radiation noise of the underwater acoustic target based on the field adaptation as claimed in claim 1, wherein the convolutional neural network comprises 2 convolutional layers, 2 max pooling layers, 2 full-link layers and 1 output layer; the 1 st layer and the 3 rd layer of the convolutional neural network are convolutional layers, and the number of convolutional kernels corresponding to each convolutional layer is 10; the 2 nd layer and the 4 th layer of the convolutional neural network are maximum pooling layers, the size of the pooling window of each maximum pooling layer is 1X2, and the step length is 1; the 5 th layer and the 6 th layer of the convolutional neural network are full connection layers, the number of neurons of the 5 th layer is 2480, and the number of neurons of the 6 th layer is 256; layer 7 is the output layer and the activation function is the softmax function.
3. The method for recognizing the radiation noise of the underwater acoustic target based on the field adaptation as claimed in claim 1, wherein in the step 2), the source domain label sample is input into the convolutional neural network constructed in the step 1), and iterative optimization is performed on the convolutional neural network model parameters through an optimization algorithm until the loss function is converged to obtain a pre-training model.
4. The method for identifying the underwater acoustic target radiation noise based on the field adaptation as claimed in claim 1, wherein in the step 2), the unlabeled target domain sample is input into a convolutional neural network, then the probability that the unlabeled target domain sample is in each category is output, and the category corresponding to the maximum probability value is taken as the target domain sample pseudo label of the unlabeled target domain sample; when the target domain sample pseudo label is initialized, all non-label samples in the target domain sample are input into a pre-training model, each non-label sample is subjected to forward propagation to obtain a network output layer neuron value, and the output layer neuron value is converted into probability distribution of corresponding sample types through a softmax function.
5. The method for identifying the underwater acoustic target radiation noise based on the field adaptation as claimed in claim 4, is characterized in that: the softmax function is expressed as follows:
wherein e is a natural index, FiIs the i-th neuron value of the output layer, SiTaking the probability value of the i-th class label for the label-free sample;
then obtaining the probability distribution of the target domain samples in the mark space: p ═ S1S2…Sk];
Wherein k is the total number of categories;
and finally, outputting the target domain sample pseudo label through the following formula:
6. The method for recognizing the radiation noise of the underwater acoustic target based on the field adaptation as claimed in claim 1, wherein a cross entropy function is adopted as a loss function of the source domain label samples, and the source domain classification loss is as follows:
7. The method for identifying the underwater acoustic target radiation noise based on the field adaptation as claimed in claim 1, wherein a cross entropy function is adopted as a loss function in the target domain sample training, and the target domain pseudo label classification loss is as follows:
wherein,the probability of being a true label for the prediction of the jth sample of the target domain,pseudo-label of the target domain sample for the jth sample of the target domain, nTThe number of samples of the target domain small batch input into the network in each iteration period is counted.
8. The method for recognizing the radiation noise of the underwater acoustic target based on the field adaptation as claimed in claim 1, wherein the difference distance of the maximum mean of the multiple cores is as follows:
wherein, XS、XTRespectively the characteristics of a source domain and a target domain, n and m respectively represent the small batch sample numbers of the source domain and the target domain,representation-to-core mapping The relevant feature mapping, H is the regenerated kernel Hilbert space,is defined as a convex combination of a plurality of kernel functions, wherein:
wherein, βuAs a weighting coefficient, kuIs a Gaussian kernel function, and m is the number of kernel functions;
the total loss of the network is:
wherein,in order to classify the loss for the source domain,for the target domain pseudo label classification loss, α, β are the hyper-parameters set before training, L is the set composed of network convolution layer and full link layer,output characteristics, k, of the source domain and the target domain of the l-th layer of the network, respectivelylFor the compromise factor:
9. the method for recognizing the radiation noise of the underwater acoustic target based on the field adaptation as claimed in claim 1, wherein the training process of the convolutional neural network comprises the following steps:
1. calculating network total loss using chain-type derivative ruleThe model parameters are adjusted by the initial learning rate of 0.001 and the learning rate is continuously decreased;
2. updating the target domain sample pseudo label according to the adjusted model parameter;
3. and (3) continuously repeating the step 1 to the step 2, and circularly iterating until the loss function is converged to finally obtain the optimal underwater sound target radiation noise identification model.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010463032.0A CN111709315A (en) | 2020-05-27 | 2020-05-27 | Underwater acoustic target radiation noise identification method based on field adaptation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010463032.0A CN111709315A (en) | 2020-05-27 | 2020-05-27 | Underwater acoustic target radiation noise identification method based on field adaptation |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111709315A true CN111709315A (en) | 2020-09-25 |
Family
ID=72538604
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010463032.0A Pending CN111709315A (en) | 2020-05-27 | 2020-05-27 | Underwater acoustic target radiation noise identification method based on field adaptation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111709315A (en) |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112270208A (en) * | 2020-09-28 | 2021-01-26 | 天津大学 | Cross data domain micro-representation classification method based on feature subspace separation |
CN112287994A (en) * | 2020-10-26 | 2021-01-29 | 北京嘀嘀无限科技发展有限公司 | Pseudo label processing method, device, equipment and computer readable storage medium |
CN112507621A (en) * | 2020-12-16 | 2021-03-16 | 电子科技大学 | Radar interference semi-supervised identification method based on label propagation |
CN112580518A (en) * | 2020-12-22 | 2021-03-30 | 杭州电子科技大学 | Cross-tested EEG cognitive state identification method based on prototype clustering domain adaptive algorithm |
CN112669331A (en) * | 2020-12-25 | 2021-04-16 | 上海交通大学 | Target data migration iterative learning method and target data migration iterative learning system |
CN112686333A (en) * | 2021-01-19 | 2021-04-20 | 科润智能控制股份有限公司 | Switch cabinet partial discharge mode identification method based on depth subdomain adaptive migration network |
CN112861616A (en) * | 2020-12-31 | 2021-05-28 | 电子科技大学 | Passive field self-adaptive target detection method |
CN113033088A (en) * | 2021-03-23 | 2021-06-25 | 浙江工业大学 | Wind driven generator system fault diagnosis method based on depth subdomain adaptive migration network |
CN113095381A (en) * | 2021-03-29 | 2021-07-09 | 西安交通大学 | Underwater sound target identification method and system based on improved DBN |
CN113168567A (en) * | 2018-10-29 | 2021-07-23 | Hrl实验室有限责任公司 | System and method for small sample transfer learning |
CN113269024A (en) * | 2021-03-31 | 2021-08-17 | 中南大学 | Unsupervised domain self-adaptive network polarization SAR terrain classification method and device considering polarization statistical characteristics |
CN113360659A (en) * | 2021-07-19 | 2021-09-07 | 云南大学 | Cross-domain emotion classification method and system based on semi-supervised learning |
CN113420775A (en) * | 2021-03-31 | 2021-09-21 | 中国矿业大学 | Image classification method under extremely small quantity of training samples based on adaptive subdomain field adaptation of non-linearity |
CN113449631A (en) * | 2021-06-25 | 2021-09-28 | 中南大学 | Image classification method and system |
CN113705769A (en) * | 2021-05-17 | 2021-11-26 | 华为技术有限公司 | Neural network training method and device |
CN113902104A (en) * | 2021-11-01 | 2022-01-07 | 南京工程学院 | Non-invasive load monitoring method combining unsupervised domain self-adaptive strategy and attention mechanism |
CN113989595A (en) * | 2021-11-05 | 2022-01-28 | 西安交通大学 | Federal multi-source domain adaptation method and system based on shadow model |
CN114372559A (en) * | 2021-12-28 | 2022-04-19 | 大连理工大学 | Construction method of deep adaptive network based on robust soft label |
CN114444605A (en) * | 2022-01-30 | 2022-05-06 | 南京邮电大学 | Unsupervised domain adaptation method based on double-unbalance scene |
CN114912482A (en) * | 2022-04-30 | 2022-08-16 | 中国人民解放军海军航空大学 | Method and device for identifying radiation source |
CN114997308A (en) * | 2022-06-01 | 2022-09-02 | 北京航空航天大学 | Cloud edge collaborative self-adaptive industrial data classification method and system |
WO2022193973A1 (en) * | 2021-03-17 | 2022-09-22 | 腾讯科技(深圳)有限公司 | Image processing method and apparatus, electronic device, computer readable storage medium, and computer program product |
CN115471662A (en) * | 2022-11-03 | 2022-12-13 | 深圳比特微电子科技有限公司 | Training method, recognition method, device and storage medium of semantic segmentation model |
CN116108353A (en) * | 2023-04-12 | 2023-05-12 | 厦门大学 | Small sample deep learning underwater sound target recognition method based on data packet |
CN117113061A (en) * | 2023-09-14 | 2023-11-24 | 中国人民解放军军事科学院系统工程研究院 | Cross-receiver radiation source fingerprint identification method and device |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109948741A (en) * | 2019-03-04 | 2019-06-28 | 北京邮电大学 | A kind of transfer learning method and device |
CN110321926A (en) * | 2019-05-24 | 2019-10-11 | 北京理工大学 | A kind of moving method and system based on depth residual GM network |
-
2020
- 2020-05-27 CN CN202010463032.0A patent/CN111709315A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109948741A (en) * | 2019-03-04 | 2019-06-28 | 北京邮电大学 | A kind of transfer learning method and device |
CN110321926A (en) * | 2019-05-24 | 2019-10-11 | 北京理工大学 | A kind of moving method and system based on depth residual GM network |
Non-Patent Citations (6)
Title |
---|
BIN YANG等: ""A Transfer Learning Method for Intelligent Fault Diagnosis from Laboratory Machines to Real-Case Machines"", 《2018 INTERNATIONAL CONFERENCE ON SENSING, DIAGNOSTICS, PROGNOSTICS, AND CONTROL》 * |
SHAN LI等: ""Deep Emotion Transfer Network for Cross-database Facial Expression Recognition"", 《2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR)》 * |
周牧等: ""基于多核最大均值差异迁移学习的WLAN室内入侵检测方法"", 《电子与信息学报》 * |
桂存斌: ""基于最小化最大平均差异损失的无监督领域自适应"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
王翎等: ""结合域混淆与MK-MMD的深度适应网络"", 《小型微型计算机系统》 * |
秦磊: ""基于深度学习的通信辐射源个体识别"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113168567A (en) * | 2018-10-29 | 2021-07-23 | Hrl实验室有限责任公司 | System and method for small sample transfer learning |
CN112270208B (en) * | 2020-09-28 | 2023-07-07 | 天津大学 | Cross data domain micro-expression classification method based on feature subspace separation |
CN112270208A (en) * | 2020-09-28 | 2021-01-26 | 天津大学 | Cross data domain micro-representation classification method based on feature subspace separation |
CN112287994A (en) * | 2020-10-26 | 2021-01-29 | 北京嘀嘀无限科技发展有限公司 | Pseudo label processing method, device, equipment and computer readable storage medium |
CN112507621A (en) * | 2020-12-16 | 2021-03-16 | 电子科技大学 | Radar interference semi-supervised identification method based on label propagation |
CN112507621B (en) * | 2020-12-16 | 2023-04-07 | 电子科技大学 | Radar interference semi-supervised identification method based on label propagation |
CN112580518B (en) * | 2020-12-22 | 2024-04-05 | 杭州电子科技大学 | Cross-test EEG cognitive state recognition method based on prototype clustering domain adaptation algorithm |
CN112580518A (en) * | 2020-12-22 | 2021-03-30 | 杭州电子科技大学 | Cross-tested EEG cognitive state identification method based on prototype clustering domain adaptive algorithm |
CN112669331A (en) * | 2020-12-25 | 2021-04-16 | 上海交通大学 | Target data migration iterative learning method and target data migration iterative learning system |
CN112861616A (en) * | 2020-12-31 | 2021-05-28 | 电子科技大学 | Passive field self-adaptive target detection method |
CN112686333A (en) * | 2021-01-19 | 2021-04-20 | 科润智能控制股份有限公司 | Switch cabinet partial discharge mode identification method based on depth subdomain adaptive migration network |
WO2022193973A1 (en) * | 2021-03-17 | 2022-09-22 | 腾讯科技(深圳)有限公司 | Image processing method and apparatus, electronic device, computer readable storage medium, and computer program product |
CN113033088A (en) * | 2021-03-23 | 2021-06-25 | 浙江工业大学 | Wind driven generator system fault diagnosis method based on depth subdomain adaptive migration network |
CN113095381A (en) * | 2021-03-29 | 2021-07-09 | 西安交通大学 | Underwater sound target identification method and system based on improved DBN |
CN113095381B (en) * | 2021-03-29 | 2024-04-05 | 西安交通大学 | Underwater sound target identification method and system based on improved DBN |
CN113269024A (en) * | 2021-03-31 | 2021-08-17 | 中南大学 | Unsupervised domain self-adaptive network polarization SAR terrain classification method and device considering polarization statistical characteristics |
CN113420775A (en) * | 2021-03-31 | 2021-09-21 | 中国矿业大学 | Image classification method under extremely small quantity of training samples based on adaptive subdomain field adaptation of non-linearity |
CN113420775B (en) * | 2021-03-31 | 2024-03-29 | 中国矿业大学 | Picture classification method based on nonlinearity self-adaptive subdomain field adaptation under extremely small quantity of training samples |
CN113269024B (en) * | 2021-03-31 | 2022-04-26 | 中南大学 | Unsupervised domain self-adaptive network polarization SAR terrain classification method and device considering polarization statistical characteristics |
CN113705769B (en) * | 2021-05-17 | 2024-09-13 | 华为技术有限公司 | Neural network training method and device |
CN113705769A (en) * | 2021-05-17 | 2021-11-26 | 华为技术有限公司 | Neural network training method and device |
CN113449631A (en) * | 2021-06-25 | 2021-09-28 | 中南大学 | Image classification method and system |
CN113360659A (en) * | 2021-07-19 | 2021-09-07 | 云南大学 | Cross-domain emotion classification method and system based on semi-supervised learning |
CN113902104A (en) * | 2021-11-01 | 2022-01-07 | 南京工程学院 | Non-invasive load monitoring method combining unsupervised domain self-adaptive strategy and attention mechanism |
CN113989595A (en) * | 2021-11-05 | 2022-01-28 | 西安交通大学 | Federal multi-source domain adaptation method and system based on shadow model |
CN113989595B (en) * | 2021-11-05 | 2024-05-07 | 西安交通大学 | Shadow model-based federal multi-source domain adaptation method and system |
CN114372559B (en) * | 2021-12-28 | 2024-10-29 | 大连理工大学 | Method for constructing depth adaptation network based on robust soft tag |
CN114372559A (en) * | 2021-12-28 | 2022-04-19 | 大连理工大学 | Construction method of deep adaptive network based on robust soft label |
CN114444605B (en) * | 2022-01-30 | 2024-04-05 | 南京邮电大学 | Unsupervised domain adaptation method based on double unbalanced scene |
CN114444605A (en) * | 2022-01-30 | 2022-05-06 | 南京邮电大学 | Unsupervised domain adaptation method based on double-unbalance scene |
CN114912482A (en) * | 2022-04-30 | 2022-08-16 | 中国人民解放军海军航空大学 | Method and device for identifying radiation source |
CN114997308A (en) * | 2022-06-01 | 2022-09-02 | 北京航空航天大学 | Cloud edge collaborative self-adaptive industrial data classification method and system |
CN115471662B (en) * | 2022-11-03 | 2023-05-02 | 深圳比特微电子科技有限公司 | Training method, recognition method, device and storage medium for semantic segmentation model |
CN115471662A (en) * | 2022-11-03 | 2022-12-13 | 深圳比特微电子科技有限公司 | Training method, recognition method, device and storage medium of semantic segmentation model |
CN116108353A (en) * | 2023-04-12 | 2023-05-12 | 厦门大学 | Small sample deep learning underwater sound target recognition method based on data packet |
CN117113061B (en) * | 2023-09-14 | 2024-02-23 | 中国人民解放军军事科学院系统工程研究院 | Cross-receiver radiation source fingerprint identification method and device |
CN117113061A (en) * | 2023-09-14 | 2023-11-24 | 中国人民解放军军事科学院系统工程研究院 | Cross-receiver radiation source fingerprint identification method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111709315A (en) | Underwater acoustic target radiation noise identification method based on field adaptation | |
CN109993280B (en) | Underwater sound source positioning method based on deep learning | |
CN112364779B (en) | Underwater sound target identification method based on signal processing and deep-shallow network multi-model fusion | |
CN109086824B (en) | Seabed substrate sonar image classification method based on convolutional neural network | |
CN110245608B (en) | Underwater target identification method based on half tensor product neural network | |
CN110109059B (en) | Radar radiation source signal identification method based on deep learning network | |
CN111160176B (en) | Fusion feature-based ground radar target classification method for one-dimensional convolutional neural network | |
CN110728360A (en) | Micro-energy device energy identification method based on BP neural network | |
CN106529428A (en) | Underwater target recognition method based on deep learning | |
Yang et al. | Underwater acoustic research trends with machine learning: general background | |
CN112001270A (en) | Ground radar automatic target classification and identification method based on one-dimensional convolutional neural network | |
CN109581339B (en) | Sonar identification method based on automatic adjustment self-coding network of brainstorming storm | |
CN110991257B (en) | Polarized SAR oil spill detection method based on feature fusion and SVM | |
CN114595732B (en) | Radar radiation source sorting method based on depth clustering | |
CN106628097A (en) | Ship equipment fault diagnosis method based on improved radial basis function neutral network | |
CN113076920B (en) | Intelligent fault diagnosis method based on asymmetric domain confrontation self-adaptive model | |
CN113780242A (en) | Cross-scene underwater sound target classification method based on model transfer learning | |
CN117009916A (en) | Actuator fault diagnosis method based on multi-sensor information fusion and transfer learning | |
CN115170943A (en) | Improved visual transform seabed substrate sonar image classification method based on transfer learning | |
CN113111786A (en) | Underwater target identification method based on small sample training image convolutional network | |
CN114814776B (en) | PD radar target detection method based on graph attention network and transfer learning | |
CN113221758A (en) | Underwater acoustic target identification method based on GRU-NIN model | |
CN113064133A (en) | Sea surface small target feature detection method based on time-frequency domain depth network | |
CN113239809A (en) | Underwater sound target identification method based on multi-scale sparse SRU classification model | |
CN113095381B (en) | Underwater sound target identification method and system based on improved DBN |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200925 |