Xiaolong CHEN,Xiaoqian MU,Jian GUAN,Ningbo LIU,Wei ZHOU
Marine Target Detection Research Group,Naval Aviation University,Yantai 264001,China
Abstract:As a classic deep learning target detection algorithm,Faster R-CNN (region convolutional neural network) has been widely used in high-resolution synthetic aperture radar (SAR) and inverse SAR (ISAR) image detection.However,for most common low-resolution radar plane position indicator (PPI) images,it is difficult to achieve good performance.In this paper,taking navigation radar PPI images as an example,a marine target detection method based on the Marine-Faster R-CNN algorithm is proposed in the case of complex background (e.g.,sea clutter) and target characteristics.The method performs feature extraction and target recognition on PPI images generated by radar echoes with the convolutional neural network (CNN).First,to improve the accuracy of detecting marine targets and reduce the false alarm rate,Faster R-CNN was optimized as the Marine-Faster R-CNN in five respects:new backbone network,anchor size,dense target detection,data sample balance,and scale normalization.Then,JRC(Japan Radio Co.,Ltd.)navigation radar was used to collect echo data under different conditions to build a marine target dataset.Finally,comparisons with the classic Faster R-CNN method and the constant false alarm rate(CFAR) algorithm proved that the proposed method is more accurate and robust,has stronger generalization ability,and can be applied to the detection of marine targets for navigation radar.Its performance was tested with datasets from different observation conditions(sea states,radar parameters,and different targets).
Key words:Marine target detection;Navigation radar;Plane position indicator (PPI) images;Convolutional neural network(CNN);Faster R-CNN(region convolutional neural network)method
Marine target detection is an important research branch in the field of target detection,as well as an important part of radar signal processing.It is of great significance for maritime transportation,marine envi‐ronment monitoring,and small target searching and rescue at sea (Daniels,2010;Yavari et al.,2016).As one of the main methods of target detection,radar has advantages of working continuously in all weathers,and plays a vital role in marine target detection along‐side infrared detection and visible light detection(Guan et al.,2012;Chen et al.,2014).However,there are still many technical difficulties in detecting marine targets,such as the complicated marine envi‐ronment,low echo signal-to-noise/clutter ratio (SNR/SCR),and complex non-uniform backgrounds (Ward et al.,2007;Chen et al.,2015).On one hand,small reflection coefficients of a marine target may result in weak target returns with a relative low SCR,which easily cause missed detection (Trunk and George,1970).On the other hand,sea spikes generated under high sea states are a kind of strong sea clutter.The in‐tensity of sea spikes is quite similar to that of a target echo,making it easy to register false alarms.More‐over,the motion of the sea surface and the wind cause further interference to the detection of marine targets(Ward et al.,2007).
The main traditional radar marine target detec‐tion methods are constant false alarm rate (CFAR)detection and coherent integration (Jalil et al.,2016;Maresca et al.,2019).The detection performance of CFAR is limited by the background distribution model.The generalization ability to the environment is too weak to ensure the stability and high accuracy of detection.The coherent integration cannot match accurately the complex target characteristics and is therefore not versatile.Moreover,traditional radar target detection and classification processes are com‐plex,requiring detection before classification,which is time-consuming.Given that the existing target detection algorithms are not capable of adapting to complex background environments and various target characteristics,new research methods need to be de‐veloped to resolve this problem (Tian et al.,2016;Yu et al.,2016).In recent years,with the improvement of information technology,the convolutional neural network (CNN) has been well developed and applied in the field of image detection and recognition.This should provide novel research ideas for rapid,stable,and high-precision radar target detection,especially for marine target detection (Mou et al.,2019a;Wang et al.,2019).
CNN,as the most important theory of deep learning,was proposed by LeCun et al.(1998),and then developed into an important branch of ma‐chine learning.It can automatically learn and extract the features of a target,and realize tasks such as intelligent recognition and image segmentation.Since 2012,CNNs have developed rapidly and new models have been proposed,e.g.,AlexNet,VGGNet (visual geometry group net),ResNet (He et al.,2016),and DenseNet.CNN-based object detection algorithms include two-stage detection algorithms such as RCNN(region convolutional neural network),Fast R-CNN,Faster R-CNN (Ren et al.,2017),and Mask R-CNN(He et al.,2020),and single-stage detection algo‐rithms such as YOLO (you only look once),SSD(single shot multibox detector),YOLOv2,and YOLOv3.Among these,Faster R-CNN,which has high detection accuracy and good applicability,has been widely used in pedestrian detection and radar image detection (Liu et al.,2020).The most widely used radar fields are synthetic aperture radar (SAR)and inverse SAR (ISAR) image detection,especially for ship detection (Dong et al.,2019).However,SAR and ISAR images are high-resolution radar images and marine targets have significant contour shape in‐formation,which enables a high detection probability to be achieved using deep learning methods.While most radars use narrowband signals,the resolution of radar plane position indicator (PPI) images is low,coupled with complex background (e.g.,sea clutter)and target characteristics,which greatly increases the difficulty of target detection.Moreover,a marine target in a radar PPI image is a small-scale target compared to the overall PPI image.When applied to this field,Faster R-CNN has the problems of a high false alarm rate and low accuracy and thus needs to be optimized to achieve better performance.While some prelimi‐nary optimization work on Faster R-CNN has been done (Mou et al.,2019b),its improvement was limited by a high false alarm rate,and its detection probability needs further improvement.Therefore,more research is needed.
In this paper,taking navigation radar PPI images as an example,a marine target detection method based on an improved Faster R-CNN algorithm,i.e.,Marine-Faster R-CNN,is proposed to extract and identify targets in radar PPI images.First,the model is developed to improve the accuracy of detecting marine targets and reduce their false alarm rate,and five aspects of Faster R-CNN are optimized to meet the detection requirements of radar marine targets:a newly designed backbone network named feature fusion network (FFNet),anchor point size,multiple target detection,data sample balance,and scale normaliza‐tion.Based on this,the Marine-Faster R-CNN based marine target detection algorithm is introduced.Then,JRC (Japan Radio Co.,Ltd.) navigation radar is used to collect radar echo data for various observation con‐ditions,and data preprocessing and image labeling are performed to build marine target detection datasets.The results of detection experiments show that the proposed method has better detection accuracy and generalization ability than the traditional Faster R-CNN and CFAR methods.The main contribution of this study includes two aspects.First,we propose a CNNbased method,named Marine-Faster R-CNN,for ma‐rine target image detection of low and medium reso‐lution radars.The method includes the FFNet for im‐proving the feature extraction ability and reducing the false alarm rate,small-scale anchors for improving the ability to detect small targets,and the power nonmaximum suppression (P-NMS) method for improv‐ing the ability to detect multiple targets.In particular,FFNet achieves target enhancement and sea clutter suppression through feature fusion when extracting image features.Second,we have constructed relatively complete marine target radar image datasets,to test the performance of the new method under different observation conditions.
Faster R-CNN is a popular deep learning target detection algorithm,which has three parts:a shared CNN,a region proposal network(RPN),and a classi‐fication and regression network.The shared CNN can extract target features and input the extracted fea‐ture maps into the RPN and ROI (region of interest)pooling.RPN implements candidate box generation tasks,which are combined with ROI pooling for fea‐ture mapping.The classification and regression net‐work completes final detection and classification.
Although the Faster R-CNN algorithm has achieved good results in conventional target detection,prob‐lems of high false alarm rates and low accuracy are encountered when applying it to the field of radar marine target detection.The most serious challenges include the detection of marine targets with complex target characteristics in complex sea backgrounds,missed detection and false detection.To improve the accuracy of marine target detection and reduce the false alarm rate,in this study we propose FFNet as the backbone network of Faster R-CNN.We also im‐prove the four other aspects of Faster R-CNN and op‐timize the RPN and classification and regression net‐work,to obtain the proposed Marine-Faster R-CNN model.The structure of Marine-Faster R-CNN is shown in Fig.1,and the backbone network FFNet in Fig.2.
Fig.1 The structure of Marine-Faster R-CNN
Fig.2 The structure of FFNet
The specific optimization method of the model is as follows:
1.Improving the accuracy of detection and re‐ducing false alarms:FFNet is proposed as the back‐bone network of Faster R-CNN,which improves the accuracy of feature extraction and enhances the ability to extract and transfer small target features.This part is identified by ①in Fig.1.
The image feature extraction process of FFNet is as follows:first,convolve the input image using the convolution module (CM);then perform the first downsampling followed by feature fusion;after that,perform the second CM;next,perform the second downsampling followed by feature fusion;and then perform the third CM;perform the first upsampling and fusion with the features after the first downsampling;finally,perform feature extraction using four dense convolutional modules(DCMs).
The feature fusion comprises four main aspects:(1) The feature map of the CM output is added and fused after downsampling,average pooling,and max‐imum pooling;(2) The outputs after the first downs‐ampling and after the first upsampling are added and fused;(3)The outputs of the front layer and the back layer in the DCM are added and fused;(4)The dense convolution network and the sparse convolution net‐work in the DCM are fused.
2.Optimizing the RPN to improve the accuracy of detecting marine targets and reduce the false alarm rate:small-scale anchors are adopted to reduce the problems of missed detection and false alarms caused by small ships in large-scale radar PPI images during the actual detection.In the detection of marine tar‐gets in PPI images,the radar image is very large,while a ship is very small,so the target occupies a small proportion of the image size.Thus,the detec‐tion of the ship in the image involves detection of a small target.However,the classic Faster R-CNN mod‐el is designed for ordinary targets,and its anchor frame size is not suitable for small target detection in large radar images.Therefore,we conduct research and cluster analysis on the anchor and redesign it to meet the detection needs for marine radar image targets.The optimization is identified by ②in Fig.1.
The clustering using representative (CURE) al‐gorithm is used to cluster labeled truth boxes.The CURE algorithm is an agglomerative clustering method with excellent clustering performance.It has low sensitivity to anomalous data and data input order,and good anti-interference performance.The formula for measuring the distance between truth boxes in the algorithm is as follows:
whereuandvare the two closest categories,that is,the two categories to be aggregated,dist(u,v) repre‐sents the distance betweenuandv,is an arbitrary truth box from the truth box collection {c1,c2,…,cn},and the total number of truth boxes isn.The two parametersofciare the area and the ratio of the length to width of the truth box,re‐spectively.dist(p,q) represents the distance betweenpandq;here we use the Euclidean distance.
After clustering,the center and representative points of the new class are obtained:
whereis the center of the new class,|u|and |v|rep‐resent the numbers of truth boxes in classesuandvrespectively,andrepresent the mean of the ground-truth boxes inuandvrespectively,wi(i=1,2,…,n) is the new representative point of the original classpiafter clustering,andαis the clustering regulator.
In the initial state,each truth box is a class,and then the two classes with the smallest distance among all classes are condensed into a new class.is the center of the condensed class,andwiis the new representative point.According to the above method,clustering continues until the final clusters are formed.
At last,nine categories are obtained after clus‐tering,so there are still nine setting anchors.The scales of the anchors are modified to (73,42),(64,61),(55,40),(37,27),(32,32),(28,36),(42,53),(47,48),and(56,72).
3.Improving the accuracy of detection of ma‐rine targets:the P-NMS algorithm is proposed to re‐solve the problem of missed detection in multiple tar‐gets areas.The optimized part is identified by ③in Fig.1.In actual sea surface target detection scenes,multiple targets are often encountered.Dense multi‐ple targets are more likely to cause missed detection,which greatly limits the generalization ability of the algorithm and the further improvement of detection accuracy.P-NMS is based on improvement of the NMS algorithm.The core idea of NMS is to remove all candidate frames that overlap with the optimal candidate frame.The formula of NMS is as follows:
However,in the actual detection of multiple ma‐rine targets,the candidate frames of multiple targets may overlap,and the NMS algorithm suppresses all candidate frames whose overlap ratios are higher than a certain threshold.This may cause individual targets to miss detection.Considering that ships are different from other types of targets in that they should not get too close when sailing or berthing,there is no high overlap of candidate frames.Based on these considerations,we design a power function method called P-NMS,which is suitable for the detec‐tion of multiple targets with a moderate overlap in candidate frames.The formula is as follows:
wheresiis the score of theithcandidate frame,Mdenotes the current highest scoring frame,biis the frame to be processed,Ntis the detection threshold,and IOU is the ratio of the intersection to the union between the prediction frame and the labeled frame.
4.Optimizing the classification and regression networks to improve the accuracy of detecting ma‐rine targets and reduce the false alarm rate:focal loss is used as the classification loss function of the RCNN.This solves the imbalance problem on posi‐tive and negative samples,simple and difficult samples,and improves the training results.Deep learning tar‐get detection models are known to have the problem of sample imbalance.In terms of the imbalance of positive and negative samples,there will be far fewer positive samples containing marine targets than nega‐tive samples containing clutter.When the clutter in‐tensity is strong,it will greatly aggravate the proba‐bility that some negative samples containing sea clutter are falsely detected as targets,resulting in false alarms.In terms of the imbalance of simple and dif‐ficult samples,if there are too many simple sam‐ples which are easily detected,the model will be overfitted,which reduces the generalization ability and accuracy of the model for marine target detec‐tion.Therefore,we optimize the sample imbalance problem to reduce the false alarm rate of marine target detection,enhance the model’s generalization ability and accuracy for marine target detection,and im‐prove the detection performance of the model.The optimization is identified by ④in Fig.1.Its formula is as follows:
whereαtis the balance factor,γis the modulation co‐efficient,andpis the probability of the training re‐sults being correctly classified,i.e.,
In the above,y∈{±1}specifies the ground-truth class,which indicates whether it is in this class.
Focal loss is based on the cross entropy func‐tion.The formula of the cross entropy function is
Focal loss adds modulation coefficientsγ,so that the loss value of easy-to-separate samples in the total classification loss is reduced (too many easy-to-separate counter examples will lead to invalid learning and detection of only the background),while increasing the loss of difficult-to-separate sam‐ples over the proportion of the total classification loss value.This ensures the useful training of the dif‐ficult samples on the model.A balance factorαtis added to resolve the problem of the uneven propor‐tion of positive and negative samples.
5.Optimizing classification and regression net‐works to improve the accuracy of detection of marine targets:precise ROI pooling is used as the scale nor‐malization function to improve the accuracy of the quantization operation,reduce the accuracy loss of the scale normalization process,and improve the integrity and accuracy of the features input to the classification and regression network.In marine target detection based on navigation radar PPI images,marine targets are small targets relative to the entire image.Achieving accurate detection requires a particularly high accuracy of feature extraction.ROI pooling involves normalizing the ROI of the RPN and the feature extraction network,and its accuracy directly affects the classification and regression network.Therefore,we optimize ROI pool‐ing to improve its accuracy.This will improve the de‐tection accuracy of the entire model for small marine target detection.The optimization is identified by ⑤in Fig.1.The formula is shown as follows:
where (x1,y1) and (x2,y2) are the upper-left corner and lower-right corner of one bin area,respectively(a bin is the area divided before pooling).The IC function calculates the product of each successivexandy-direction offset,and the deviation is within one pixel,using pixel (i,j) as representation.Then multi‐plying the product of the offset by the pixel valuewi,jof(i,j) obtainsf(x,y).Perform integrationf(x,y) from(x1,y1)to(x2,y2)to obtain the sum of pixels in the en‐tire bin areas,and take the average of the integral re‐sults to obtain the output of the bin area.Finally,each bin area outputs a value to form a feature map matrix.Precise ROI pooling makes the pixel values have a gra‐dient transfer,which improves the accuracy of pooling.
A flowchart of the radar image marine target de‐tection algorithm based on Marine-Faster R-CNN is shown in Fig.3.The flowchart includes two stages:the offline training stage and the actual detection stage.The offline training procedure is as follows:
1.Use navigation radar to transmit long and short pulses with different sampling frequencies to collect radar echo data under different conditions (different weather conditions and sea states) to ensure the diversity of the collected data samples.At the same time,collect complex samples (samples that are not easily detected) to improve the generalization perfor‐mance of the dataset in different situations.
2.Convert the range-azimuth raw echo data ob‐tained by the radar scan into PPI images.
3.Pre-process the images,e.g.,crop and label the images to build a marine target dataset.
Fig.3 Flowchart of the marine target detection algorithm
4.The Faster R-CNN model is optimized in five respects to be more suitable for radar detection,and the Marine-Faster R-CNN model is obtained.
5.Input the dataset into the Marine-Faster R-CNN model for training and optimization.
6.Adjust the initial training parameters of the model and perform iterative optimization,and select the model with the best training result as the final marine target detection model for detection and clas‐sification tasks.
The actual detection stage is as follows:
1.Collect real-time navigation radar data.
2.Convert the original data into PPI images.
3.Input the PPI images to the Marine-Faster R-CNN,and finally the detection results are obtained.
X-band non-coherent JRC-5312 navigation radar was used to transmit electromagnetic signals,and data acquisition experiments were conducted near Yantai Port,China,to obtain radar returns of ships under different sea states and weather conditions.Radar returns were stored as datasets in DAT format.The marine environment of the Yantai Port area is shown in Fig.4.The area is close to the main international waterway and is located in the core area of the North‐east Asia International Economic Circle.
Fig.4 Yantai Port sea area
Not only are there a lot of passenger and cargo ships entering and leaving the port every day,but also fish breeding areas around and fishing boats,all of which can provide great convenience for the collec‐tion of marine targets.The data collection process is shown in Fig.5.After the electromagnetic waves are emitted by the radar antenna,the echoes of ships are collected by the radar signal acquisition equipment,and then output to the terminal display to obtain the PPI images after signal processing.The position in‐formation of the ships obtained by the automatic identification system (AIS) device can be displayed in the PPI images as verification of the target detec‐tion results.The acquisition software interface and real-time PPI are shown in Fig.6,in which the land clutter,sea clutter,and ship’s echo are displayed.The parameters of the JRC navigation radar are shown in Table 1.
Radar datasets were collected under various ob‐servation conditions including different radar param‐eters,weather conditions,and sea states,and differ‐ent types of maritime targets,to ensure the diversity of the collected data samples (Table 2).To improve the generalization ability of the dataset to complex envi‐ronments,and ameliorate the imbalance between the numbers of simple and difficult samples,it is neces‐sary to collect difficult samples.Difficult samples are samples that cannot be easily distinguished by classifiers.Such samples help improve the recogni‐tion ability and detection performance of the training model.Therefore,the collection of target samples in complex weather such as rain and snow,high wind and wave weather,and small-scale target samples,can not only further increase the diversity of data samples,but also resolve the problem of imbalance between the numbers of simple and difficult samples.After the data collection was completed,the collected content was recorded(Fig.7).
Fig.5 Data collection diagram
Fig.6 Real-time radar PPI images
Table 1 JRC navigation radar parameters
Table 2 Specific implementation of sample diversity
Fig.7 Content of data acquisition records
Matlab was used to convert the DAT format raw data collected by the PCI (peripheral component in‐terconnect) card to MAT format,and to read the format data to generate PPI images.Samples of ship targets under various background environments were collected through various collection methods,and the diversity of the datasets was guaranteed.A typical background environment image and a part of the da‐tasets for different pulses and frequencies,numbers of targets,sea states,and complex samples are shown in Figs.8 and 9.
Fig.8 Normal sample PPI(clear weather)(a)and difficult sample PPI(complex weather background)(b)
Because the PPI image is large (greater than 4000×4000 pixels) and there are no targets in most areas,training with the whole PPI image may take up too much memory and too much time,so the original PPI images need to be cropped.Fig.6 shows that the marine targets are located mainly in the upper right part of the PPI image,that is,in the sea area east of the Yantai Port.Therefore,this part was cropped and the training and testing datasets were constructed based on this PPI image.An example of the cropped area images is shown in Fig.10.
Fig.9 Comparison of different background environments
Fig.10 An example of the cropped area image
We used Pascal VOC (visual object classes) for‐mat to make the dataset.First,use the software to an‐notate the image and generate an annotation file in XML format;then,set 80%of the images in the data‐set for training,20% for testing and verification.The four text files,trainval.txt,train.txt,val.txt,and test.txt,serve as an index directory for the training valida‐tion,training,validation,and test sets.The training dataset includes 2000 images and the test dataset 500 images.A diagram showing the use of annotation software to annotate the cropped sub-image is shown in Fig.11,in which the five rectangular boxes are the truth boxes after labeling.
Fig.11 Diagram of image annotation
In this section,the training results of the Marine-Faster R-CNN model trained on the surface target dataset are shown first.Then,five evaluation indexes were selected based on the evaluation criteria of radar target detection and computer image detection.These indexes were used to analyze and evaluate the experi‐mental results.Finally,detection results are compared with the classic Faster R-CNN and CFAR.
The computer configuration was as follows:Intel?CoreTMi7-8700 K processor,16 GB memory,GeForce GTX 1080 Ti graphics card with 11 GB memory.The system environment was Ubuntu 18.04 and the frame‐work was Pytorch.The experimental parameters were set as follows:the gradient descent algorithm was the Adam,the initial learning rate was 0.001,and the num‐ber of iterations was set to 20 000.
The shared CNN (backbone network) model is essentially an extractor of target features.We proposed FFNet as the backbone network for the Marine-Faster R-CNN model.Fig.12 shows the loss value of the training,which consists of the classification loss(rpncls-loss) and regression loss (rpn-bbox-loss) of the RPN,and the classification loss (rcnn-cls-loss) and regression loss (rcnn-bbox-loss) of the RCNN.It can be seen that the training loss value stabilized at about 0.1 as the number of iterations increased.
Fig.12 Training loss curves of FFNet
Further analysis and evaluation of the training and detection results were carried out.Combining the evaluation parameters of deep learning target detection and traditional radar target detection,three evaluation parameters including the recall rate,accu‐racy rate,and false alarm rate were employed,and the pros and cons of the model and detection results were compared accordingly.The three evaluation pa‐rameters are as follows:
whereRstands for recall,which indicates the recall rate,Pdenotes precision,which indicates the accuracy rate,and FA is the false alarm rate.False positive(FP)indicates a negative sample predicted to be posi‐tive,false negative (FN) indicates a positive sample predicted to be negative,and true positive (TP) indi‐cates a positive sample predicted to be positive.
Fig.13 Targets in real scenes (a),targets in PPI images(b),and detection result images(c)
An example of target detection results is shown in Fig.13.In this image,there were five ships near an island in the low sea state in the Yantai Port.The proposed model could recognize all five ships and achieve good detection performance.
1.The effect of the focal loss function.An ex‐periment was performed to verify how the focal loss function improves the target detection results of the original model.The Faster R-CNN model,using FFNet as the backbone network,was compared with another sample balance optimization method,the on‐line hard example mining (OHEM) algorithm.The results are shown in Table 3,where the frame rate is the number of images(frames)recognized in 1 s.The OHEM algorithm is also a method for optimizing sample imbalances.The core principle is similar to that of the boosting algorithm of machine learning.Samples that are easily misclassified by the classifier are repeatedly trained to improve the model.The ra‐tio of positive to negative samples was 1:3.This fixed ratio led to poor generalization ability,which limits the improvement of detection accuracy.
Table 3 The results of different sample balance optimizations
The OHEM algorithm achieved an improve‐ment in the probability of detection,but caused a de‐crease in the detection speed (Table 3).Our method with focal loss takes both the probability of detection and speed into account:the probability of detection was increased by 0.62%,and the detection speed for each picture decreased by 0.07 frames/s,thereby achieving improvements in both aspects.
2.The effect of P-NMS.Table 3 shows that P-NMS improved the probability of detection more than the NMS algorithm.The 0.56% improvement achieved by P-NMS was higher than that achieved by Soft-NMS.The P-NMS algorithm is therefore more suitable for the detection of dense multi-targets on the sea surface.
3.The effect of precise ROI pooling.Table 3 shows that with the refinement of the normalization of the ROI region scale,the probability of detection of the model was gradually improved.For example,compared with the ROI pooling algorithm,the proba‐bility of detection of precise ROI pooling was in‐creased by 0.51%.However,the increased complexi‐ty of the algorithm affects the speed of training.
4.The effect of small anchors.Table 3 shows that small-scale anchors increased the detection prob‐ability by 0.67% compared with the original anchors.This illustrates the importance of anchor point scale adaptation to the detection target scale,and proves the effectiveness of the optimization.
To sum up,Table 3 shows that small-scale an‐chors made the largest contribution to the optimiza‐tion of the Faster R-CNN algorithm,increasing the de‐tection probability by 0.67%,followed by focal loss with a contribution of 0.62%,and then P-NMS,0.56%,and finally precise ROI pooling,0.51%.The total con‐tribution was 2.36%.However,the overall detection performance improvement was 2.31%,which shows that the improvements by the four optimization methods are not a simple summation of the individual’s.In gen‐eral,the changes had a good effect on the optimization of Faster R-CNN.
To verify the performance of the model pro‐posed in this paper,we compared it with the classic Faster R-CNN algorithm.In the experiment,the training set of the marine target dataset was used as the input data for training,and the test set was used for testing and verification.The test results are shown in Table 4,and the verification results in Figs.14?18.The experiments showed that the perfor‐mance of the Marine-Faster R-CNN algorithm was better than that of the classic Faster R-CNN algo‐rithm,with higher accuracy and a lower false alarm rate,and was suitable for marine target detection in complex backgrounds.
Fig.14 shows that Faster R-CNN(VGG16),Faster R-CNN (Resnet101),and Marine-Faster R-CNN allhad good detection results for sparse targets in a sim‐ple background.Fig.15 shows that Faster R-CNN(VGG16) missed detections while Faster R-CNN(Resnet101) gave false alarms for multi-target detec‐tion under the level-2 sea state.Figs.16 and 17 show that Faster R-CNN (VGG16) and Faster R-CNN(Resnet101) were more likely to cause false alarms than Marine-Faster R-CNN for multi-target detection under the level-3 sea state,and Faster R-CNN(VGG16)was more likely to have missed detection.Fig.18 shows that Faster R-CNN (VGG16) missed detection for target detection under the level-4 sea state,while
Faster R-CNN (Resnet101) and Marine-Faster R-CNN had good detection performance.
Table 4 Comparison of test results
Fig.14 Results under the level-2 sea state for small targets:(a) Faster R-CNN (VGG16);(b) Faster R-CNN (Resnet101);(c)Marine-Faster R-CNN
Fig.15 Results under the level-2 sea state for large and medium multiple targets:(a) Faster R-CNN (VGG16);(b)Faster R-CNN(Resnet101);(c)Marine-Faster R-CNN
Fig.16 Results under the level-3 sea state for large multiple targets:(a) Faster R-CNN (VGG16);(b) Faster R-CNN(Resnet101);(c)Marine-Faster R-CNN
Fig.17 Results under the level-3 sea state for medium targets:(a) Faster R-CNN (VGG16);(b) Faster R-CNN(Resnet101);(c)Marine-Faster R-CNN
Fig.18 Results under the level-4 sea state for large and small targets:(a) Faster R-CNN (VGG16);(b) Faster R-CNN(Resnet101);(c)Marine-Faster R-CNN
In summary,through the proposal of a new back‐bone network named FFNet and the adaptation of small-scale anchor points,the improved balance of samples,the refined scale normalization,and the new algorithm for NMS,Marine-Faster R-CNN showed a great improvement in performance com‐pared with the classic Faster R-CNN.The Marine-Faster R-CNN method has a higher detection proba‐bility and a lower false alarm rate.The new method has good generalization ability for different environ‐ments and is better than the classic Faster R-CNN.
To further verify the effectiveness of the pro‐posed model,we compared it with traditional target detection methods,including the dual-parameter CFAR detection method and the classic two-dimensional cell averaging constant false alarm rate (CA-CFAR)methods widely used in radar image detection.The core idea of CFAR detection is to judge whether there is a target in the detection unit by use of a decision threshold,which is estimated by the power level of the clutter background.A block diagram is shown in Fig.19.Taking CA-CFAR as an example,selectNref‐erence unitsx1,x2,…,xN,and obtain the total clutter background power levelZ.Then multiplyZby the background threshold decision factorTto determine the presence or absence of the target.The judgment criterion formula is as follows:
Fig.19 The diagram of two-dimensional cell averaging constant false alarm rate(CA-CFAR)
References to color refer to the online version of the figure whereDis the detection unit,H1indicates that a target exists in the detection unit,H0indicates that there is no target in the detection unit,andTis a threshold de‐cision factor.
The core idea of dual-parameter CFAR is to use the mean value and variance of the reference unit in the rectangular reference window to estimate the clutter power level,and to obtain the decision threshold to detect the detection unit.The figure of the refer‐ence window of the dual-parameter CFAR is shown in Fig.20.
Fig.20 Dual-parameter constant false alarm rate (CFAR)reference window
Select the target window,protection window,and background window.mAandmBare the mean values of all pixels in the target window and the background window,respectively,andσ2Bis the variance of the background unit in the background window.The deci‐sion criterion formula is as follows:
whereKis the threshold decision factor.
Assume that the clutter distribution type is a Ray‐leigh distribution.In the two-dimensional CA-CFAR,the number of reference units for the upper,lower,left,and right of the detection unit was set to 20,and the number of protection units for the upper,lower,left,and right of the detection unit was set to 2.In the dual-parameter CFAR,the length and width of the target window was 25,the protection window was 10,and the reference window was 50.The detection results of dual-parameter CFAR and CA-CFAR with the probabilities of false alarm of 10?4,10?3,and 10?2were selected for comparison with the Marine-Faster R-CNN algorithm.The test results are shown in Table 5.
Table 5 Comparison with CFAR
Table 5 shows that the detection performance of the CFAR algorithm was unstable and greatly affected by the setting of the probability of false alarm (Pfa).The reason is that it is limited by the clutter distribu‐tion model of the background environment,and a sin‐gle clutter distribution model cannot adapt to diverse background environments.The model is susceptible to interference by ground clutter (island,land) and sea clutter when detecting the target,and the false alarm rate was high.The Marine-Faster R-CNN method proposed in this paper had a higher detection probability under the same false alarm rate,and the detection probability was less affected by the false alarm rate compared with the CFAR algorithms.The target detection results of the three methods under the level-4 sea state are shown in Fig.21.
Fig.21 The detection results under the level-4 sea state:(a) original image;(b) two-dimensional CA-CFAR;(c)dual-parameter CFAR;(d)Marine-Faster R-CNN
In summary,the method proposed in this paper is more accurate and robust.Compared with the clas‐sic Faster R-CNN detection method,it can reduce missed detection and false alarms with more robust‐ness and generalization ability,and can be applied to the marine target detection of navigation radar.Compared with the traditional constant false alarm detection method,it can overcome the limitations of a low false alarm rate and low detection probability in complex background environments,greatly im‐prove the detection probability under the same false alarm rate,and is more robust.
Statistical model based detection methods can achieve their best performance only under the assumed statistical model.However,it is difficult to adapt to diverse and complex backgrounds and multi-type target scenarios.The classic Faster R-CNN target de‐tection algorithm has problems when dealing with low-resolution radar PPI images.Such problems in‐clude missed detection of multiple targets and rough scale normalization.A new marine radar target detec‐tion method based on the Marine-Faster R-CNN was proposed to alleviate the problem that the existing algorithms cannot adapt to complex background environments and various target characteristics.The measured data verified the validity of the new model.The main work can be summarized as follows:
1.The Faster R-CNN algorithm was optimized in five respects:a new backbone network,anchor point scale,data balance,scale normalization,and dense target detection,and a Marine-Faster R-CNN model was established.
2.By collecting data based on different radar pa‐rameters and under different weather conditions,a dataset of radar images for marine target detection was constructed.
3.Experiments with measured data proved that the Marine-Faster R-CNN has higher detection accu‐racy and better generalization ability than the classic Faster R-CNN and CFAR,and can achieve better ma‐rine target detection.
The proposed method should provide support for widespread applications in radar image detection.In the future,different CNN models will be used to further improve the detection performance.
Contributors
Xiaolong CHEN and Xiaoqian MU designed the re‐search.Ningbo LIU and Wei ZHOU provided the radar data.Xiaoqian MU processed the data.Xiaolong CHEN drafted the paper.Xiaolong CHEN,Xiaoqian MU,and Jian GUAN revised and finalized the paper.
Compliance with ethics guidelines
Xiaolong CHEN,Xiaoqian MU,Jian GUAN,Ningbo LIU,and Wei ZHOU declare that they have no conflict of interest.
Frontiers of Information Technology & Electronic Engineering2022年4期