K-Means-AS node

Last updated: Feb 11, 2025

K-Means-AS node (SPSS Modeler)

K-Means is one of the most commonly used clustering algorithms. It clusters data points into a predefined number of clusters. The K-Means-AS node in SPSS Modeler is implemented in Spark.

For more information about k-means algorithms, see Clustering.¹

Note: The K-Means-AS node performs one-hot encoding automatically for categorical variables.

¹ "Clustering - RDD-based API." Apache Spark. MLlib: Main Guide. Aug 2024.

Was the topic helpful?

0/1000