In this work, we utilize a Transformer-based network for precise anatomical landmark detection in chest X-ray images. By combining the strengths of Transformers and UNet architecture, our proposed model achieves robust landmark localization by effectively capturing global context and spatial dependencies. Notably, our method surpasses the current state-of-the-art approaches, exhibiting a significant reduction in Mean Radial Error and a notable improvement in the rate of accurate landmark detection. Each of the landmark points in the labels is presented as a Gaussian heatmap for training the network, using a hybrid loss function, incorporating binary cross-entropy and Dice loss functions, allowing for pixel-wise classification of the heatmaps and segmentation-based training to accurately localize the landmark heatmaps. The promising results obtained highlight the underexplored potential of Transformers in anatomical landmark detection and offer a compelling solution for accurate anatomical landmark detection in chest X-rays. Our work demonstrates the viability of Transformer-based models in addressing the challenges of landmark detection in medical imaging.
|