Feature Selection Method on Imbalanced Text
-
Graphical Abstract
-
Abstract
After analyzing the four basic information elements of traditional feature selection methods, a new measurement of strong class information is introduced and a new feature selection method is proposed for imbalanced text classification. The strong class information and the frequency of terms are used to improve the classification performance of minority classes and majority classes respectively. The experiments on reuter-21578 dataset show that the proposed method is better than IG and CHI. Both Micro F1 and Macro F1 are improved to some degree.
-
-