Feature expression is a crucial part of the target tracking process. The artificial feature is relatively simple and has strong real-time performance, but there is a problem of insufficient representation ability. It is prone to drift when dealing with problems such as rapid change and target occlusion. With the strong feature expression ability of deep neural network features in target detection and recognition tasks, deep neural network features are gradually used as feature extraction tools, but how to use and integrate these features is still worth studying. In this paper, the Residual Neural Network(ResNet) is the main researched object, and the influence of each layer on the target tracking performance is analyzed in detail. The feature fusion strategy of the convolutional layer and the addition layer is finally determined. We train a classifier separately for these layers. Then we search the multi-layer response maps to infer the target location in a coarse-to-fine fashion. The algorithm of this paper is verified on the OTB-50 dataset. The one-pass evalution(OPE) value can reach 0.612, which is better than the same type of algorithms.