Chapter 27: Problem 22
The K-means algorithm uses a similarity metric of distance between a record and a cluster centroid. If the attributes of the records are not quantitative but categorical in nature, such as Income Level with values \\{low, medium, hight or Married with values \\{Yes, Nof or State of Residence with values \\{Alabama, Alaska, \(\ldots,\) Wyoming then the distance metric is not meaningful. Define a more suitable similarity metric that can be used for clustering data records that contain categorical data.
Short Answer
Step by step solution
Key Concepts
These are the key concepts you need to understand to accurately answer the question.