Concept drift detection and adaptation with hierarchical hypothesis testing |
| |
Authors: | Shujian Yu Zubin Abraham Heng Wang Mohak Shah Yantao Wei José C. Príncipe |
| |
Affiliation: | 1. Department of Electrical and Computer Engineering, University of Florida, Gainesville, FL 32611, USA;2. Robert Bosch LLC Research and Technology Center, Sunnyvale, CA 94085, USA;3. MZ Inc. (formerly Machine Zone) Research, Palo Alto, CA 94304, USA;4. University of Illinois at Chicago, Chicago, IL 60637, USA;5. School of Educational Information Technology, Central China Normal University, Wuhan 430079, China |
| |
Abstract: | A fundamental issue for statistical classification models in a streaming environment is that the joint distribution between predictor and response variables changes over time (a phenomenon also known as concept drifts), such that their classification performance deteriorates dramatically. In this paper, we first present a hierarchical hypothesis testing (HHT) framework that can detect and also adapt to various concept drift types (e.g., recurrent or irregular, gradual or abrupt), even in the presence of imbalanced data labels. A novel concept drift detector, namely Hierarchical Linear Four Rates (HLFR), is implemented under the HHT framework thereafter. By substituting a widely-acknowledged retraining scheme with an adaptive training strategy, we further demonstrate that the concept drift adaptation capability of HLFR can be significantly boosted. The theoretical analysis on the Type-I and Type-II errors of HLFR is also performed. Experiments on both simulated and real-world datasets illustrate that our methods outperform state-of-the-art methods in terms of detection precision, detection delay as well as the adaptability across different concept drift types. |
| |
Keywords: | Corresponding author. |
本文献已被 ScienceDirect 等数据库收录! |
|