Preprint Article Version 1 This version is not peer-reviewed

A Strategy of Weak-Connected Grid Search for Noise Filtering and Density Grid-Based Data Clustering

Version 1 : Received: 28 August 2024 / Approved: 28 August 2024 / Online: 28 August 2024 (09:06:01 CEST)

How to cite: Truong, N. T.; Nguyen, S. D.; Choi, S.-B. A Strategy of Weak-Connected Grid Search for Noise Filtering and Density Grid-Based Data Clustering. Preprints 2024, 2024082023. https://doi.org/10.20944/preprints202408.2023.v1 Truong, N. T.; Nguyen, S. D.; Choi, S.-B. A Strategy of Weak-Connected Grid Search for Noise Filtering and Density Grid-Based Data Clustering. Preprints 2024, 2024082023. https://doi.org/10.20944/preprints202408.2023.v1

Abstract

One of the efficient data mining tools is density-based clustering, including the density grid-based clustering. However, a common drawback always existing in clusters made by the density grid-based method is the existence of weakly connected grids deriving mainly from noise. Appearing such an unwanted connection with a high frequency reduces the accuracy of the obtained cluster data space (CDS) and its application efficiency. Here, we present an essential improvement to overcome this problem. First, we describe a concept of the weak-connected grid cell (WCG) and present a fuzzy-type approximation to depict the density-based distribution of data points at grid nodes. Then, we propose a strategy of searching WCG for density grid-based clustering (SWCG-DGB) to set up a CDS, filter noise, and tune the created CDS. A buffer is deployed during this phase to collect border points and filter noise, which improves the computational time significantly, especially for noisy datasets. Results from numerical surveys reflected the compared efficiency of this method in clustering validity, including the accuracy of the number of clusters.

Keywords

Clustering; Density-based clustering; Density grid-based clustering; Fuzzy approximation

Subject

Engineering, Mechanical Engineering

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.