{
       "Question number": "1",
       "Sub-Question number": "null",
       "Question": "Imagine that you have N data and you wish to find K clusters using K-Means++. As- suming that N > K, can the K-Means++ algorithm choose the same datum twice to become a cluster center? Why or why not?",
       "Solution": "The K-Means++ algorithm will never choose the same datum twice to become a cen- ter. This is because the distribution over the data items is proportional to the squared distance to the closest cluster center. When a datum is a cluster center, this distribution will assign zero probability for that item."
}