Open Access
2014 A Hybrid Sampling SVM Approach to Imbalanced Data Classification
Qiang Wang
Abstr. Appl. Anal. 2014(SI11): 1-7 (2014). DOI: 10.1155/2014/972786


Imbalanced datasets are frequently found in many real applications. Resampling is one of the effective solutions due to generating a relatively balanced class distribution. In this paper, a hybrid sampling SVM approach is proposed combining an oversampling technique and an undersampling technique for addressing the imbalanced data classification problem. The proposed approach first uses an undersampling technique to delete some samples of the majority class with less classification information and then applies an oversampling technique to gradually create some new positive samples. Thus, a balanced training dataset is generated to replace the original imbalanced training dataset. Finally, through experimental results on the real-world datasets, our proposed approach has the ability to identify informative samples and deal with the imbalanced data classification problem.


Download Citation

Qiang Wang. "A Hybrid Sampling SVM Approach to Imbalanced Data Classification." Abstr. Appl. Anal. 2014 (SI11) 1 - 7, 2014.


Published: 2014
First available in Project Euclid: 6 October 2014

zbMATH: 07023430
Digital Object Identifier: 10.1155/2014/972786

Rights: Copyright © 2014 Hindawi

Vol.2014 • No. SI11 • 2014
Back to Top