U-shaped Vision Transformer and Its Application in Gear Pitting Measurement

Sijun Wang; Yi Qin; Dejun Xi; Chen Liang

doi:10.37965/jdmd.2022.130

Authors

Sijun Wang College of Mechanical and Vehicle Engineering, Chongqing University, China https://orcid.org/0000-0001-7968-6938
Yi Qin College of Mechanical and Vehicle Engineering, Chongqing University, China https://orcid.org/0000-0002-2160-4300
Dejun Xi College of Mechanical and Vehicle Engineering, Chongqing University, China
Chen Liang College of Mechanical and Vehicle Engineering, Chongqing University, China

DOI:

https://doi.org/10.37965/jdmd.2022.130

Keywords:

Vision transformer; residual connection; dilation rate; information interaction; pitting measurement

Abstract

Although convolutional neural networks (CNNs) have become the mainstream segmentation model, the locality of convolution makes them cannot well learn global and long-range semantic information. To further improve the performance of segmentation models, we propose u-shaped vision Transformer (UsViT), a model based on Transformer and convolution. Specifically, residual Transformer blocks are designed in the encoder of UsViT, which take advantages of residual network and Transformer backbone at the same time. What’s more, transpositions in each Transformer layer achieve the information interaction between spatial locations and feature channels, enhancing the capability of feature learning. In the decoder, for enhancing receptive filed, different dilation rates are introduced to each convolutional layer. In addition, residual connections are applied to make the information propagation smoother when training the model. We first verify the superiority of UsViT on Automatic Portrait Matting public dataset, which achieves 90.43% Acc、95.56% DSC and 94.66% IoU with relatively fewer parameters. Finally, UsViT is applied to gear pitting measurement in gear contact fatigue test, and the comparative results indicate that UsViT can improve the accuracy of pitting detection.

Conflict of Interest Statement
The authors declare no conflicts of interest.