Change detection in streaming multivariate data using likelihood detectors

Research output: Contribution to journalArticlepeer-review

Standard Standard

Change detection in streaming multivariate data using likelihood detectors. / Kuncheva, L.I.
In: IEEE Transactions on Knowledge and Data Engineering, Vol. 25, No. 5, 01.05.2013, p. 1175-1180.

Research output: Contribution to journalArticlepeer-review

HarvardHarvard

Kuncheva, LI 2013, 'Change detection in streaming multivariate data using likelihood detectors', IEEE Transactions on Knowledge and Data Engineering, vol. 25, no. 5, pp. 1175-1180. https://doi.org/10.1109/TKDE.2011.226

APA

CBE

MLA

VancouverVancouver

Kuncheva LI. Change detection in streaming multivariate data using likelihood detectors. IEEE Transactions on Knowledge and Data Engineering. 2013 May 1;25(5):1175-1180. doi: 10.1109/TKDE.2011.226

Author

Kuncheva, L.I. / Change detection in streaming multivariate data using likelihood detectors. In: IEEE Transactions on Knowledge and Data Engineering. 2013 ; Vol. 25, No. 5. pp. 1175-1180.

RIS

TY - JOUR

T1 - Change detection in streaming multivariate data using likelihood detectors

AU - Kuncheva, L.I.

PY - 2013/5/1

Y1 - 2013/5/1

N2 - Change detection in streaming data relies on a fast estimation of the probability that the data in two consecutive windows come from different distributions. Choosing the criterion is one of the multitude of questions that need to be addressed when designing a change detection procedure. This paper gives a log-likelihood justification for two well-known criteria for detecting change in streaming multidimensional data: Kullback-Leibler (K-L) distance and Hotelling's T-square test for equal means (H). We propose a semiparametric log-likelihood criterion (SPLL) for change detection. Compared to the existing log-likelihood change detectors, SPLL trades some theoretical rigor for computation simplicity. We examine SPLL together with K-L and H on detecting induced change on 30 real data sets. The criteria were compared using the area under the respective Receiver Operating Characteristic (ROC) curve (AUC). SPLL was found to be on the par with H and better than K-L for the nonnormalized data, and better than both on the normalized data.

AB - Change detection in streaming data relies on a fast estimation of the probability that the data in two consecutive windows come from different distributions. Choosing the criterion is one of the multitude of questions that need to be addressed when designing a change detection procedure. This paper gives a log-likelihood justification for two well-known criteria for detecting change in streaming multidimensional data: Kullback-Leibler (K-L) distance and Hotelling's T-square test for equal means (H). We propose a semiparametric log-likelihood criterion (SPLL) for change detection. Compared to the existing log-likelihood change detectors, SPLL trades some theoretical rigor for computation simplicity. We examine SPLL together with K-L and H on detecting induced change on 30 real data sets. The criteria were compared using the area under the respective Receiver Operating Characteristic (ROC) curve (AUC). SPLL was found to be on the par with H and better than K-L for the nonnormalized data, and better than both on the normalized data.

U2 - 10.1109/TKDE.2011.226

DO - 10.1109/TKDE.2011.226

M3 - Article

VL - 25

SP - 1175

EP - 1180

JO - IEEE Transactions on Knowledge and Data Engineering

JF - IEEE Transactions on Knowledge and Data Engineering

SN - 1041-4347

IS - 5

ER -