نبذة مختصرة : An in-depth learning-based approach is designed to develop the ability to recognize human behavior on the move. We introduce 3D residual structures and create 3D residual models. In order to get the most out of the data relationship of several consecutive frames, this study introduces 3D techniques for assigning different values to the existing frames. Experiments show that both structures improve recognition performance. For the 3D residual model, 3D attention model, and 3D attention residual model, this study proposes two model fusion strategies: average and weighted. Among them, the weighted fusion is to give a higher fusion proportion to the high accuracy model by using the model weight calculation method designed in this study. The experimental results show that the additive fusion strategy based on feature contribution has an obvious improvement effect on the test results of the two benchmark datasets, with an increase of more than 2% points, including an increase of 2.69% on HMDB51. The effect of splicing and fusion strategy has also increased by more than 1% point, including 1.34% on UCF101 dataset and about 1.9% on HMDB51. It is proven that deep learning can effectively recognize human behavior in sports.
No Comments.