I have dataset with two label class (good and bad), I want to apply K Means on my dataset using python, should I use that label dataset or I have to delete the label class column ?
Asked
Active
Viewed 72 times
0
1 Answers
1
Delete the label column.
Assuming that you want to compare the clusters to the labels later, then the labels must not be part of the data passed to k-means.
And k-means only works well on continuous variables anyway.

Has QUIT--Anony-Mousse
- 7,999
- 1
- 14
- 31
binary feature
. – Louis T Feb 09 '19 at 09:49