Partition Quantitative Assessment (Pqa): A Quantitative Methodology to Assess the Embedded Noise in Clustered Omics and Systems Biology Data

Diego A. Camacho-Hernández; Victor E. Nieto-Caballero; José E. León-Burguete; Julio A. Freyre-González

doi:10.20944/preprints202012.0728.v1

Submitted:

28 December 2020

Posted:

29 December 2020

You are already at the latest version

Abstract

Identifying groups that share common features among datasets through clustering analysis is a typical problem in many fields of science, particularly in post-omics and systems biology research. In respect of this, quantifying how a measure can cluster or organize intrinsic groups is important since currently there is no statistical evaluation of how ordered is, or how much noise is embedded in the resulting clustered vector. Many of the literature focuses on how well the clustering algorithm orders the data, with several measures regarding external and internal statistical measures; but none measure has been developed to statistically quantify the noise in an arranged vector posterior a clustering algorithm, i.e., how much of the clustering is due to randomness. Here, we present a quantitative methodology, based on autocorrelation, to assess this problem.

Keywords:

omics data

;

hierarchical clustering

;

noise quantification

Subject:

Biology and Life Sciences - Biochemistry and Molecular Biology

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Partition Quantitative Assessment (Pqa): A Quantitative Methodology to Assess the Embedded Noise in Clustered Omics and Systems Biology Data

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe