Abstract: In many practical domains, applying machine learning is challenging not due to the lack of available data, but because labeled samples are in short supply. A common approach for obtaining additional labeled samples is co-training, a semi-supervised learning setting where two learners (agents), trained on different perspectives of the data, iteratively label additional samples. The rationale of this approach is that the different learner perspectives will produce a more diverse labeled set, resulting in more effective classifiers. While co-training proved effective in multiple cases, the labeling mechanisms used by existing approaches are heuristic and error-prone. We propose CoMet, a meta learning-based co-training algorithm. CoMet utilizes meta-models trained on previously-analyzed datasets to select the samples to be labeled for the current dataset. Our experiments, conducted on 35 datasets, show that CoMet significantly outperforms the standard co-training approach.
Loading