0

I have dataset with two label class (good and bad), I want to apply K Means on my dataset using python, should I use that label dataset or I have to delete the label class column ?

Has QUIT--Anony-Mousse
  • 7,999
  • 1
  • 14
  • 31
lona
  • 119
  • 3

1 Answers1

1

Delete the label column.

Assuming that you want to compare the clusters to the labels later, then the labels must not be part of the data passed to k-means.

And k-means only works well on continuous variables anyway.

Has QUIT--Anony-Mousse
  • 7,999
  • 1
  • 14
  • 31