Diversity techniques improve the performance of the best imbalance learning ensembles

Research output: Contribution to journalArticlepeer-review

Standard Standard

Diversity techniques improve the performance of the best imbalance learning ensembles. / Diez-Pastor, J.F.; Rodriguez, J.J.; Garcia-Osorio, C.I. et al.
In: Information Sciences, Vol. 325, 11.07.2015, p. 98-117.

Research output: Contribution to journalArticlepeer-review

HarvardHarvard

Diez-Pastor, JF, Rodriguez, JJ, Garcia-Osorio, CI & Kuncheva, LI 2015, 'Diversity techniques improve the performance of the best imbalance learning ensembles', Information Sciences, vol. 325, pp. 98-117. https://doi.org/10.1016/j.ins.2015.07.025

APA

Diez-Pastor, J. F., Rodriguez, J. J., Garcia-Osorio, C. I., & Kuncheva, L. I. (2015). Diversity techniques improve the performance of the best imbalance learning ensembles. Information Sciences, 325, 98-117. https://doi.org/10.1016/j.ins.2015.07.025

CBE

MLA

VancouverVancouver

Diez-Pastor JF, Rodriguez JJ, Garcia-Osorio CI, Kuncheva LI. Diversity techniques improve the performance of the best imbalance learning ensembles. Information Sciences. 2015 Jul 11;325:98-117. doi: 10.1016/j.ins.2015.07.025

Author

Diez-Pastor, J.F. ; Rodriguez, J.J. ; Garcia-Osorio, C.I. et al. / Diversity techniques improve the performance of the best imbalance learning ensembles. In: Information Sciences. 2015 ; Vol. 325. pp. 98-117.

RIS

TY - JOUR

T1 - Diversity techniques improve the performance of the best imbalance learning ensembles

AU - Diez-Pastor, J.F.

AU - Rodriguez, J.J.

AU - Garcia-Osorio, C.I.

AU - Kuncheva, L.I.

PY - 2015/7/11

Y1 - 2015/7/11

N2 - Many real-life problems can be described as unbalanced, where the number of instances belonging to one of the classes is much larger than the numbers in other classes. Examples are spam detection, credit card fraud detection or medical diagnosis. Ensembles of classifiers have acquired popularity in this kind of problems for their ability to obtain better results than individual classifiers. The most commonly used techniques by those ensembles especially designed to deal with imbalanced problems are for example Re-weighting, Oversampling and Undersampling. Other techniques, originally intended to increase the ensemble diversity, have not been systematically studied for their effect on imbalanced problems. Among these are Random Oracles, Disturbing Neighbors, Random Feature Weights or Rotation Forest. This paper presents an overview and an experimental study of various ensemble-based methods for imbalanced problems, the methods have been tested in its original form and in conjunction with several diversity-increasing techniques, using 84 imbalanced data sets from two well known repositories. This paper shows that these diversity-increasing techniques significantly improve the performance of ensemble methods for imbalanced problems and provides some ideas about when it is more convenient to use these diversifying techniques

AB - Many real-life problems can be described as unbalanced, where the number of instances belonging to one of the classes is much larger than the numbers in other classes. Examples are spam detection, credit card fraud detection or medical diagnosis. Ensembles of classifiers have acquired popularity in this kind of problems for their ability to obtain better results than individual classifiers. The most commonly used techniques by those ensembles especially designed to deal with imbalanced problems are for example Re-weighting, Oversampling and Undersampling. Other techniques, originally intended to increase the ensemble diversity, have not been systematically studied for their effect on imbalanced problems. Among these are Random Oracles, Disturbing Neighbors, Random Feature Weights or Rotation Forest. This paper presents an overview and an experimental study of various ensemble-based methods for imbalanced problems, the methods have been tested in its original form and in conjunction with several diversity-increasing techniques, using 84 imbalanced data sets from two well known repositories. This paper shows that these diversity-increasing techniques significantly improve the performance of ensemble methods for imbalanced problems and provides some ideas about when it is more convenient to use these diversifying techniques

U2 - 10.1016/j.ins.2015.07.025

DO - 10.1016/j.ins.2015.07.025

M3 - Article

VL - 325

SP - 98

EP - 117

JO - Information Sciences

JF - Information Sciences

SN - 0020-0255

ER -