Version 1
: Received: 1 August 2024 / Approved: 2 August 2024 / Online: 2 August 2024 (16:48:02 CEST)
How to cite:
Bellingeri, M.; Mancabelli, L.; Milani, C.; Lugli, G. A.; Alfieri, R.; Turchetto, M.; Ventura, M.; Cassi, D. Machine Learning Monte Carlo Approaches and Statistical Physics Notions to Characterize Bacterial Species in Human Microbiota. Preprints2024, 2024080200. https://doi.org/10.20944/preprints202408.0200.v1
Bellingeri, M.; Mancabelli, L.; Milani, C.; Lugli, G. A.; Alfieri, R.; Turchetto, M.; Ventura, M.; Cassi, D. Machine Learning Monte Carlo Approaches and Statistical Physics Notions to Characterize Bacterial Species in Human Microbiota. Preprints 2024, 2024080200. https://doi.org/10.20944/preprints202408.0200.v1
Bellingeri, M.; Mancabelli, L.; Milani, C.; Lugli, G. A.; Alfieri, R.; Turchetto, M.; Ventura, M.; Cassi, D. Machine Learning Monte Carlo Approaches and Statistical Physics Notions to Characterize Bacterial Species in Human Microbiota. Preprints2024, 2024080200. https://doi.org/10.20944/preprints202408.0200.v1
APA Style
Bellingeri, M., Mancabelli, L., Milani, C., Lugli, G. A., Alfieri, R., Turchetto, M., Ventura, M., & Cassi, D. (2024). Machine Learning Monte Carlo Approaches and Statistical Physics Notions to Characterize Bacterial Species in Human Microbiota. Preprints. https://doi.org/10.20944/preprints202408.0200.v1
Chicago/Turabian Style
Bellingeri, M., Marco Ventura and Davide Cassi. 2024 "Machine Learning Monte Carlo Approaches and Statistical Physics Notions to Characterize Bacterial Species in Human Microbiota" Preprints. https://doi.org/10.20944/preprints202408.0200.v1
Abstract
Recent studies have shown correlations between the microbiota's composition and various health conditions. Machine learning (ML) techniques are essential for analyzing complex biological data, particularly in microbiome research. ML methods help analyze large datasets to uncover microbiota patterns and understand how these patterns affect human health. This study introduces a novel approach combining statistical physics with the Monte Carlo (MC) methods to characterize bacterial species in the human microbiota. We assess the significance of bacterial species in different age groups by using notions of statistical distances to evaluate species prevalence and abundance across age groups and employing MC simulations based on statistical mechanics principles. Our findings show that the microbiota composition experiences a significant transition from early childhood to adulthood. Species such as Bifidobacterium breve and Veillonella parvula decrease with age, while others like Agathobaculum butyriciproducens and Eubacterium rectale increase. Additionally, low-prevalence species may hold significant importance in characterizing age groups. Finally, we propose an overall species ranking by integrating the methods proposed here in a multicriteria classification strategy. Our research provides a comprehensive tool for microbiota analysis using statistical notions, ML techniques, and MC simulations.
Keywords
Monte Carlo simulation; Machine Learning; Human Microbiota; Statistical Physics; Microcanonical ensemble; Canonical ensemble; Database learning
Subject
Computer Science and Mathematics, Artificial Intelligence and Machine Learning
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.