New internal clustering validation measure for contiguous arbitrary‐shape clusters

Impacto

Downloads

Downloads per month over past year

Rojas Thomas, Juan Carlos and Santos Peñas, Matilde (2021) New internal clustering validation measure for contiguous arbitrary‐shape clusters. International Journal of Intelligent Systems, 36 (10). pp. 5506-5529. ISSN 0884-8173

[thumbnail of Int J of Intelligent Sys - 2021 - Rojas‐Thomas - New internal clustering validation measure for contiguous arbitrary‐shape.pdf]
Preview
PDF
Creative Commons Attribution Non-commercial.

1MB

Official URL: https://doi.org/10.1002/int.22521




Abstract

In this study a new internal clustering validation index is proposed. It is based on a measure of the uniformity of the data in clusters. It uses the local density of each cluster, in particular, the normalized variability of the density within the clusters to find the ideal partition. The new validity measure allows it to capture the spatial pattern of the data and obtain the right number of clusters in an automatic way. This new approach, unlike the traditional one that usually identifies well-separated compact clouds, works with arbitrary-shape clusters that may be contiguous or even overlapped. The proposed clustering measure has been evaluated on nine artificial data sets, with different cluster distributions and an increasing number of classes, on three highly nonlinear data sets, and on 17 real data sets. It has been compared with nine well-known clustering validation indices with very satisfactory results. This proves that including density in the definition of clustering validation indices may be useful to identify the right partition of arbitrary-shape and different-size clusters.


Item Type:Article
Additional Information:

CRUE-CSIC (Acuerdos Transformativos 2021)

Uncontrolled Keywords:arbitrary‐shape clusters, clustering, density, internal validation index, real data sets, uniformity
Subjects:Sciences > Computer science > Databases
Sciences > Computer science > Artificial intelligence
ID Code:70441
Deposited On:15 Feb 2022 17:23
Last Modified:18 Feb 2022 09:55

Origin of downloads

Repository Staff Only: item control page