{
       "Question number": "4",
       "Sub-Question number": "null",
       "Question": "Suppose that we have a data set and we train two support vector machines as follows. We train the first SVM on a random subset of the data. Then we add the remainder of the data and train another SVM on the complete data set. How might the size of the optimal margin change from the first to the second SVM? Would you expect it to increase, decrease, stay the same, or do something else? Provide an explanation and/or diagrams to make your case.",
       "Solution": "The margin for the full dataset will decrease or stay the same. Specifically, if the subset contains the support vectors from the full data set, the margin for the full data set stays the same; otherwise, the margin for the full data set decreses.\nMore data points means more contraints. The margin found on the full data set satis- fies all the classfication contraints in the subset problem, but the solution may not be optimized for the subset. One can also illustrate this with diagrams."
}