K-Means Clustering
K-Means Clustering is an unsupervised learning algorithm used to partition a dataset into K distinct clusters based on feature similarity. The algorithm aims to minimize the variance within each cluster while maximizing the distance between clusters.
The algorithm works in the following steps:
- Initialization: Randomly select K centroids.
- Assignment: Assign each data point to the nearest centroid.
- Update: Recalculate the centroids based on current assignments.
- Repeat: Continue until convergence.
K-Means is widely used for customer segmentation, image compression, and anomaly detection.