Preprint Article Version 1 This version is not peer-reviewed

Using association rules to explore occult patterns in Breast Cancer Mitochondrial Genomes

Version 1 : Received: 30 July 2024 / Approved: 30 July 2024 / Online: 30 July 2024 (15:56:24 CEST)

How to cite: Casillas Godínez, C. K.; Orozco Luna, F. D. J.; González Santiago, A. E.; Sánchez Parada, M. G.; Mercado Sesma, A. R.; Jiménez Meza, A. R.; Baptista Rosas, R. C. Using association rules to explore occult patterns in Breast Cancer Mitochondrial Genomes. Preprints 2024, 2024072461. https://doi.org/10.20944/preprints202407.2461.v1 Casillas Godínez, C. K.; Orozco Luna, F. D. J.; González Santiago, A. E.; Sánchez Parada, M. G.; Mercado Sesma, A. R.; Jiménez Meza, A. R.; Baptista Rosas, R. C. Using association rules to explore occult patterns in Breast Cancer Mitochondrial Genomes. Preprints 2024, 2024072461. https://doi.org/10.20944/preprints202407.2461.v1

Abstract

Single nucleotide polymorphisms in the mitochondrial genome have been investigated in relation to breast cancer. Certain variants could be associated with an increased risk of this malignant disease. These associations underline the importance of mitochondrial function in cancer biology and highlight the potential for mtDNA-based biomarkers and future therapies. The general objective of this research was to develop a model which is capable of identifying nonlinear patterns in breast cancer mitochondrial sequences. Association rules were used to explore mtDNA variant positions. The analysis included 41 breast cancer patients and 28 control samples. An a priori algorithm identified significant association rules with lift > 1 and confidence > 0.5. A total of 5562 rules for cancer and 157,140 for control sequences were refined after redundancy removal. A total of 150 associations were identified, of which only 78 showed significant support and confidence, while 315 and 1438 positions showed strong associations with breast cancer. Association rules analysis revealed significant patterns, especially in sequences associated with the control region and a specific locus around genes coding for tRNAs and NADH dehydrogenase subunits. However, further research is necessary to establish causality, clinical relevance, and to confirm these findings.

Keywords

Breast cancer; Bioinformatics; Data Science; Pattern Search; Association Rules

Subject

Medicine and Pharmacology, Oncology and Oncogenics

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.