CTNet adopted the CNN-Transformer structure, which combined the advantages of convolutional neural networks (CNNs) and Transformers to extract potential discriminative features. The CNN unit and the transformer unit are responsible for extracting local and global temporal features, respec...