The Fuzzy Clustering and Data Analysis Toolbox is a collection of Matlab
functions. Its propose is to divide a given data set into subsets (called
clusters), hard and fuzzy partitioning mean, that these transitions between
the subsets are crisp or gradual.
CURE(Clustering Using Representatives)是一種針對大型數據庫的高效的聚類算法。基于劃分的傳統的聚類算法得到的是球狀的,相等大小的聚類,對異常數據比較脆弱。CURE采用了用多個點代表一個簇的方法,可以較好的處理以上問題。并且在處理大數據量的時候采用了隨機取樣,分區的方法,來提高其效率,使得其可以高效的處理大量數據。