Underwater target detection with an attention mechanism and improved scale
Author(s): Wei, XY (Wei, Xiangyu); Yu, L (Yu, Long); Tian, SW (Tian, Shengwei); Feng, PC (Feng, Pengcheng); Ning, X (Ning, Xin)
Source: MULTIMEDIA TOOLS AND APPLICATIONS DOI: 10.1007/s11042-021-11230-2 Early Access Date: AUG 2021
Abstract: The light problem and the complicated environment of underwater images make target detection difficult. These images are usually blurry because tiny inorganic and organic particles in the water have a great impact on light. To solve this problem, we add squeeze and excitation modules after the deep convolution layers of the YOLOv3 model to learn the relationship between channels and enhance the semantic information of deep features. In addition, many small targets will lose too much information after five downsamples. This is not conducive to detection. By expanding the detection scale, we combine the deep semantic information with the location information of the shallower layer to improve the detection performance of small targets. The experimental results show that the YOLOv3-brackish model greatly improved the detection of small fish, crabs, shrimp and starfish. In addition, there were minor improvements in the detection of big fish and jellyfish. The mean average precision increased by 4.43%.
Accession Number: WOS:000688418200003
ISSN: 1380-7501
eISSN: 1573-7721
Full Text: https://link.springer.com/article/10.1007%2Fs11042-021-11230-2