Preprint Article Version 1 This version is not peer-reviewed

EfficientUNetViT: Efficient Breast Tumor Segmentation utilizing U-Net Architecture and Pretrained Vision Transformer

Version 1 : Received: 13 August 2024 / Approved: 14 August 2024 / Online: 14 August 2024 (08:20:54 CEST)

How to cite: Anari, S.; Oliveira, G. G. D.; Ranjbarzadeh, R.; Alves, A. M.; Vaz, G. C.; Bendechache, M. EfficientUNetViT: Efficient Breast Tumor Segmentation utilizing U-Net Architecture and Pretrained Vision Transformer. Preprints 2024, 2024081015. https://doi.org/10.20944/preprints202408.1015.v1 Anari, S.; Oliveira, G. G. D.; Ranjbarzadeh, R.; Alves, A. M.; Vaz, G. C.; Bendechache, M. EfficientUNetViT: Efficient Breast Tumor Segmentation utilizing U-Net Architecture and Pretrained Vision Transformer. Preprints 2024, 2024081015. https://doi.org/10.20944/preprints202408.1015.v1

Abstract

This study introduces a sophisticated neural network structure for segmenting breast tumors. It achieves this by combining a pretrained Vision Transformer (ViT) model with a U-Net framework. The U-Net architecture, commonly employed for biomedical image segmentation, is further enhanced with Depthwise Separable Convolutional Blocks to decrease computational complexity and parameter count, resulting in better efficiency and less overfitting. The Vision Transformer, renowned for its robust feature extraction capabilities utilizing self-attention processes, efficiently captures the overall context within images, surpassing the performance of conventional convolutional networks. By using a pretrained ViT as the encoder in our U-Net model, we take advantage of its extensive feature representations acquired from extensive datasets, resulting in a major enhancement in the model’s ability to generalize and train efficiently. The suggested model has exceptional performance in segmenting breast cancers from medical images, highlighting the advantages of integrating transformer-based encoders with efficient U-Net topologies. This hybrid methodology emphasizes the capabilities of transformers in the field of medical image processing and establishes a new standard for accuracy and efficiency in activities related to tumor segmentation.

Keywords

Breast cancer; Unet; Vision transformer; Depthwise Separable Convolutional

Subject

Public Health and Healthcare, Other

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.