Automatic target detection of remote sensing images (RSI) plays an important role in military reconnaissance, disaster monitoring, and target rescue. The core task of remote sensing target detection is to judge the target categories and complete precise location. However, the existing target detection algorithms have limited accuracy and weak generalization capability for remote sensing images with complex backgrounds. To achieve accurate detection of different categories targets in remote sensing images, this study presents a novel feature enhancement single shot multibox detector (FESSD) algorithm for remote sensing target detection. The FESSD introduces feature enhancement module and attention mechanism into the convolution neural networks (CNN) model, which can effectively enhance the feature extraction ability and nonlinear relationship between different convolution features. Specifically, the feature enhancement module is used to extract the multi-scale feature information, and enhance the model nonlinear learning ability; the self-learning attention mechanism (SAM) is used to expand the convolution kernel local receptive field, which makes the model extract more valuable features. In addition, the nonlinear relationship between different convolution features is enhanced using the feature pyramid attention mechanism (PAM). The advantage of FESSD over other state-of-the-art target detection methods is validated by experiments on the presented seven-class target detection dataset (SD-RSI) and the public DIOR dataset.