Clustering Algorithms

Explore, analyze, and visualize your data with 14 state-of-the-art clustering algorithms

Algorithms Available

N-D

Dimensional Support

Auto-K

Cluster Detection

CSV / XLSX

Upload Your Data

Getting Started

Quick Guide

Upload Your Data

Upload a CSV or Excel (.xlsx) file with your numerical features. Headers are automatically detected.

Configure Parameters

Set the number of clusters, preprocessing options, and algorithm-specific parameters.

Analyze Results

Explore interactive visualizations, metrics, and detailed cluster analysis.

Clustering Algorithms

Choose an algorithm based on your data characteristics and goals

Partitioning (Centroid-Based)

4 algorithms

Available

K-Means

The Classic Partitioning Algorithm

The most widely-used clustering algorithm. It partitions data into k clusters by iteratively assigning points to the nearest centroid and updating centroids until convergence.

Key Features

FastScalableSimpleWell-understood⚡Outlier: Centroid Distance

Best for: Large datasets with spherical clusters

Similar to K-Means but uses actual data points (medoids) as cluster centers instead of means. More robust to outliers and noise in the data.

Key Features

RobustInterpretableAny distance metric⚡Outlier: Medoid Distance

Best for: Data with outliers or categorical features

A non-parametric algorithm that doesn't require specifying the number of clusters. It finds clusters by shifting points towards the mode (highest density) of the data.

Key Features

Automatic kMode-seekingFlexible shapes⚡Outlier: Mode Distance

Best for: Unknown number of clusters, image segmentation

Unlike hard clustering, FCM allows data points to belong to multiple clusters with varying degrees of membership. Perfect for overlapping or ambiguous cluster boundaries.

Key Features

Soft membershipOverlapping clustersFuzzifier control⚡Outlier: Membership Analysis

Best for: Overlapping clusters, uncertainty quantification

Try it now

Density-Based

3 algorithms

Available

DBSCAN

Density-Based Spatial Clustering

Discovers clusters of arbitrary shape based on density. Can identify noise points and doesn't require specifying the number of clusters beforehand.

Key Features

Arbitrary shapesNoise detectionNo k required⚡Outlier: k-Distance

Best for: Geographic data, anomaly detection

Hierarchical extension of DBSCAN that handles varying density clusters. Currently the gold standard for density-based clustering.

Key Features

Variable densitySoft clusteringRobust⚡Outlier: GLOSH Score

Best for: Complex real-world datasets

Try it now

Available

OPTICS

Ordering Points Algorithm

Creates an ordering of the database that captures density-based clustering structure. Produces reachability plots for visual cluster analysis.

Key Features

Reachability plotVariable densityHierarchical view⚡Outlier: Reachability Ratio

Best for: Exploratory analysis, varying density clusters

Try it now

Hierarchical

3 algorithms

Available

Agglomerative

Bottom-Up Hierarchical

Builds a hierarchy of clusters by progressively merging the closest pairs. Produces a dendrogram that can be cut at any level.

Key Features

DendrogramFlexible linkageAny k post-hoc⚡Outlier: Centroid Distance

Best for: Taxonomy creation, gene expression analysis

Try it now

Available

BIRCH

Scalable Hierarchical Clustering

Balanced Iterative Reducing and Clustering using Hierarchies. Designed for very large datasets with limited memory.

Key Features

Memory efficientIncrementalLarge-scale⚡Outlier: Subcluster Distance

Best for: Very large datasets, streaming data

Try it now

Available

Bisecting K-Means

Divisive Hierarchical

A divisive (top-down) hierarchical clustering algorithm that repeatedly bisects clusters using K-Means until the desired number of clusters is reached.

Key Features

Top-downDivisiveHierarchical structure⚡Outlier: Inertia-Weighted

Best for: Document clustering, when hierarchy matters

Try it now