Sage Journals: Discover world-class research

Abstract

Feature selection is one of the key problems in machine learning and data mining. It involves identifying a subset of the most useful features that produces compatible results as the original entire set of features. It can reduce the dimensionality of original data, speed up the learning process and build comprehensible learning models with good generalization performance. Nowadays, ensemble idea has been used to improve the performance of feature selection by integrating multiple base feature selection models into an ensemble one. In this paper, in order to improve the efficiency of feature selection in dealing with large scale, high dimension and imbalanced problems, a Min-Max Ensemble Feature Selection (M2-EFS) is proposed, which is based on balanced data partition and min-max ensemble strategy. The experimental results demonstrate that the M2-EFS can obtain higher performance than other classical ensemble methods in most cases, especially for large scale, high dimension and imbalanced data.

Keywords

Feature selection Min-Max strategy ensemble data partition

Get full access to this article

View all access options for this article.

References

Guyon

and Elisseeff

, An introduction to variable and feature selection, Journal of Machine Learning Research3 (2003), 1157–1182.

Kohavi

and John

G.H.

, Wrappers for feature subset selection, Artificial Intelligence97(1) (1997), 273–324.

Saeys

, Inza

and Larrasaga

, A review of feature selection techniques in bioinformatics, bioinformatics23(19) (2007), 2507–2517.

, Li

and Liu

, Recent advances in feature selection and its application, Knowledge and Information Systems (2017). DOI: https://doi.org/10.1007/s10115-017-1059-8.

Tang

, Alelyani

and Liu

, Feature selection for classification: A review, Data Classification: Algorithms and Applications37 (2014), 313–334.

Abeel

, Helleputte

and Van de Peer

, Dupont

and Saeys

, Robust biomarker identification for cancer diagnosis with ensemble feature selection methods, Bioinformatics26(3) (2010), 392–398.

Saeys

, Abeel

and Van de Peer

, Robust feature selection using ensemble feature selection techniques, Proc. of the 25th European Conf. on Machine Learning and Knowledge Discovery in Databases (ECML-PKDD), 2008, pp. 313–325.

Kursa

M.B.

, Robustness of random forest-based gene selection methods, BMC Bioinformatics15 (2014), 1–8.

and Li

, Energy-based feature ranking for assessing the dysphonia measurements in Parkinson detection, IET Signal Processing6(4) (2012), 300–305.

10.

and Gao

S.Y.

, Energy-based feature selection and its ensemble version, Intąŕl Conf. on Neural Information Processing (ICONIP), Lecture Notes in Computer Science (LNCS), 7063, 2011, pp. 53–62.

11.

, Si

, Zhou

G.J.

, Huang

S.S.

and Chen

S.C.

, FREL: A stable feature selection algorithm, IEEE Trans on Neural Networks and Learning Systems26 (2015), 1388–1402.

12.

Bolon-Canedo

, Sánchez-Maroño

and Alonso-Betanzos

, Data classification using an ensemble of filters, Neurocomputing135(5) (2014), 13–20.

13.

Zhou

Q.F.

, Ding

J.C.

, Ning

Y.P.

, Luo

L.K.

and Li

, Stable feature selection with ensembles of multi-reliefF,ŕl Conf. on Natural Computation (ICNC), Proc. of 10th Intąŕl Conf. on Natural Computation (ICNC), 2014, pp. 742–747.

14.

Woznica

, Nguyen

and Kalousis

, Model mining for robust feature selection, Proc. of ACM SIGKDD Conf. on Knowledge Discovery and Data Mining (KDD), 2012, pp. 913–921.

15.

Huang

Y.X.

, Li

and Qiang

B.H.

, Internet traffic classification based on Min-Max Ensemble Feature Selection, Proc. Int’l Joint Conf. on Neural Networks (IJCNN) (2016), pp. 3485–3492.

16.

Mitchell

, Machine Learning, McGraw Hill, New York, 1997.

17.

B.L.

and Ito

, Task decomposition and module combination based on class relations: A modular neural network for pattern classification, IEEE Transactions on Neural Networks10(5) (1999), 1244–1256.

18.

Bache

and Lichman

, UCI machine learning repository, http://archive.ics.uci.edu/ml, 2013.

19.

Chang

C.C.

and Lin

C.J.

, LIBSVM: A library for support vector machines, http://www.csie.ntu.edu.tw/cjlin/libsvm/, 2014.

20.

Guyon

, Agnostic learning vs prior knowledge. http://www.agnostic.inf.ethz.ch/datasets.php, 2007.

21.

Liu

, Ye

J.P.

, Zhao

, et al., Feature selection at Arizona State University in conjunction with the DMML, http://featureselection.asu.edu/datasets.php (2008).

22.

Moore

A.W.

and Zuev

, Internet traffic classification using bayesian analysis techniques, ACM SIGMETRICS Performance Evaluation Review, ACM33(1) (2005), 50–60.

23.

, Li

and Han

, Generalized fisher score for feature selection, arXiv preprint arXiv:1202.3725, 2012.

24.

Robnik-Sikonja

and Kononenko

, Theoretical and empirical analysis of ReliefF and RReliefF, Machine Learning53(1-2) (2003), 23–69.