Detail

Publication date: 1 de June, 2021

Data mining and knowledge discovery missing topic: anomalous cluster clustering

I consider first a rather simple intuitive criterion of individual cluster analysis, the product of average within-cluster similarity and the number of elements in it to be maximized, and bring forth its mathematical properties relating the criterion with high density subgraphs and spectral clustering approach. Then I present a simple approximation anomalous cluster model leading to the criterion and families of very effective ADDI crisp clustering methods (Mirkin, 1987) and FADDIS fuzzy clustering methods (Mirkin, Nascimento, 2012); the latter leading to misteries in the popular Laplace data normalization. Then I show that the celebrated square-error k-means clustering criterion can be equivalently reformulated as of finding a partition consisting of anomalous clusters. I will finish with a problem of consensus clustering to show that it is equivalent to anomalous similarity clustering and present experimental results of the superiority of this approach over competition.

Presenter

Boris Mirkin,

Date 25/03/2015
State Concluded