Enhancing brain tumor detection in MRI with a rotation invariant Vision TransformerShow others and affiliations
2024 (English)In: Frontiers in Neuroinformatics, E-ISSN 1662-5196, Vol. 18, article id 1414925
Article in journal (Refereed) Published
Abstract [en]
Background: The Rotation Invariant Vision Transformer (RViT) is a novel deep learning model tailored for brain tumor classification using MRI scans.
Methods: RViT incorporates rotated patch embeddings to enhance the accuracy of brain tumor identification.
Results: Evaluation on the Brain Tumor MRI Dataset from Kaggle demonstrates RViT's superior performance with sensitivity (1.0), specificity (0.975), F1-score (0.984), Matthew's Correlation Coefficient (MCC) (0.972), and an overall accuracy of 0.986.
Conclusion: RViT outperforms the standard Vision Transformer model and several existing techniques, highlighting its efficacy in medical imaging. The study confirms that integrating rotational patch embeddings improves the model's capability to handle diverse orientations, a common challenge in tumor imaging. The specialized architecture and rotational invariance approach of RViT have the potential to enhance current methodologies for brain tumor detection and extend to other complex imaging tasks.
Place, publisher, year, edition, pages
Frontiers , 2024. Vol. 18, article id 1414925
Keywords [en]
brain tumor classification, Vision Transformers, rotational invariance, MRI, deep learning, rotated patch embeddings
National Category
Medical Engineering
Identifiers
URN: urn:nbn:se:hig:diva-45176DOI: 10.3389/fninf.2024.1414925ISI: 001259439600001PubMedID: 38957549Scopus ID: 2-s2.0-85197264623OAI: oai:DiVA.org:hig-45176DiVA, id: diva2:1882141
2024-07-042024-07-042025-01-20Bibliographically approved