Ho Yeung Lee <jobmattcon at gmail.com> writes: > actually i am using python's kmeans library. it is relevant in python I agree that it was not bad to ask the question here. However, the provided answer (apart from the "off topic" -- i.e. the reference to "https://en.wikipedia.org/wiki/Determining_the_number_of_clusters_in_a_data_set") was excellent. It tells you that the cluster number is a general parameter for the "kmeans" algorithm (independent of any implementation) and how you can get some approximations.