Preprint Article Version 1 This version is not peer-reviewed

Research on Genome Data Recognition and Analysis based on Louvain Algorithm

Version 1 : Received: 16 July 2024 / Approved: 16 July 2024 / Online: 16 July 2024 (16:47:15 CEST)

How to cite: Huang, D.; Xu, L.; Tao, W.; Li, Y. Research on Genome Data Recognition and Analysis based on Louvain Algorithm. Preprints 2024, 2024071344. https://doi.org/10.20944/preprints202407.1344.v1 Huang, D.; Xu, L.; Tao, W.; Li, Y. Research on Genome Data Recognition and Analysis based on Louvain Algorithm. Preprints 2024, 2024071344. https://doi.org/10.20944/preprints202407.1344.v1

Abstract

In genomics analysis, with the development of sequencing technology and the rapid growth of data volume, how to effectively identify and analyze important information in genomic data from massive data has become an important challenge. Identify key patterns and features in genomic data through advanced community testing methods. First, genomic data often contains a lot of noise and redundant information that needs to be processed through steps such as normalization, filtering, and dimensionality reduction. Normalization makes data at different scales comparable, filtering out data points with low quality or missing values, and dimensionality reduction reduces data dimensions through principal component analysis (PCA) and retains the main information. The gene co-expression network establishes the connection relationship between genes by calculating the expression similarity matrix between genes, and applies the Louvain algorithm to maximize the modularity of the network, aggregate nodes layer by layer, form a community structure, and identify the complex relationship and potential structure between genes. Finally, we utilize the cancer dataset to evaluate the proposed model. From our extensive experimental results, we can observe that Louvain's algorithm indicates outperformances and accuracy in the recognition and analysis of genomic data.

Keywords

Genome data recognition, Principal component analysis, Louvain algorithm, Cancer dataset.

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.