We're Not in Kansas Anymore: Detecting Domain Changes in Streams

Mark Dredze, Tim Oates, Christine D. Piatko

2010 (modified: 10 Nov 2022)EMNLP 2010Readers: Everyone

Abstract: Domain adaptation, the problem of adapting a natural language processing system trained in one domain to perform well in a different domain, has received significant attention. This paper addresses an important problem for deployed systems that has received little attention - detecting when such adaptation is needed by a system operating in the wild, i.e., performing classification over a stream of unlabeled examples. Our method uses A-distance, a metric for detecting shifts in data streams, combined with classification margins to detect domain shifts. We empirically show effective domain shift detection on a variety of data sets and shift conditions.

0 Replies