Learning and Classifying Under Hard Budgets

Aloak Kapoor, Russell Greiner

2005 (modified: 03 Nov 2022)ECML 2005Readers: Everyone

Abstract: Since resources for data acquisition are seldom infinite, both learners and classifiers must act intelligently under hard budgets. In this paper, we consider problems in which feature values are unknown to both the learner and classifier, but can be acquired at a cost. Our goal is a learner that spends its fixed learning budget b L acquiring training data, to produce the most accurate “active classifier” that spends at most b C per instance. To produce this fixed-budget classifier, the fixed-budget learner must sequentially decide which feature values to collect to learn the relevant information about the distribution. We explore several approaches the learner can take, including the standard “round robin” policy (purchasing every feature of every instance until the b L budget is exhausted). We demonstrate empirically that round robin is problematic (especially for small b L ), and provide alternate learning strategies that achieve superior performance on a variety of datasets.

0 Replies