Paper
15 November 2011 Feature selection combined category concentration degree with minimal set covering
Hao-dong Zhu, Hong-chan Li
Author Affiliations +
Proceedings Volume 8335, 2012 International Workshop on Image Processing and Optical Engineering; 83350K (2011) https://doi.org/10.1117/12.917518
Event: 2012 International Workshop on Image Processing and Optical Engineering, 2012, Harbin, China
Abstract
Feature selection is the core research topic in text categorization. Selected feature subset directly influences results of text categorization. Firstly, word frequency and document frequency were analyzed. And then, the category concentration degree based on word frequency and document frequency was proposed. Next, set covering was introduced into rough sets and an attribute reduction algorithm based on minimal set covering was provided. Finally, a new feature selection method combined the proposed category concentration degree with the provided attribute reduction algorithm was presented. The presented feature selection method firstly uses the proposed category concentration degree to select features and filter out some terms to reduce the sparsity of feature spaces, and then employs the provided attribute reduction algorithm to eliminate redundancy, so that the more representative feature subset was acquired. The experimental results show that presented feature selection method is better than the three classical feature selection methods: information gain (IG), x2 statistics (CHI), mutual information (MI) in time performance, macro-average F1 and micro-average F1.
© (2011) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Hao-dong Zhu and Hong-chan Li "Feature selection combined category concentration degree with minimal set covering", Proc. SPIE 8335, 2012 International Workshop on Image Processing and Optical Engineering, 83350K (15 November 2011); https://doi.org/10.1117/12.917518
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature selection

Algorithms

Detection and tracking algorithms

Analytical research

Classification systems

Pattern recognition

Data modeling

Back to Top