Jump to content

K-Means Cluster Evaluator Mod for Spotfire® Data Science - Team Studio Release 1.0.0

1 Screenshot


This offering consists of a Team Studio Mod (custom operator for Spotfire® Data Science - Team Studio) that calculates metrics to help the user decide on the optimal number of clusters (K value) for the K-Means method on the given input dataset.



The K-Means Cluster Evaluator operator calculates metrics for a range of K values, as specified in the input. These metrics are the Dunn Index and the Within Set Sum of Squared Errors (WSSE). A dataset containing the calculated metrics for all K values in the requested range is returned.


The operator is designed to run on a Hadoop cluster using Spark ML. More information about how to evaluate the optimal K for K-Means can be found in this 


If you want to start using this Mod/Custom operator in the Team Studio environment, please follow this Knowledge Base article for installation guidelines.



Release 1.0.0

Published: September 2020

Initial release includes:

  • Custom operator jar file
  • Documentation for the operator
  • License file

  • Create New...