• Journal of Internet Computing and Services
    ISSN 2287 - 1136(Online) / ISSN 1598 - 0170 (Print)
    http://jics.or.kr/

Combined Feature Set and Hybrid Feature Selection Method for Effective Document Classification


Joo-Ho In, Jung-Ho Kim, Soo-Hoan Chae, Journal of Internet Computing and Services, Vol. 14, No. 5, pp. 49-58, Oct. 2013
10.7472/jksii.2013.14.5.49, Full Text:
Keywords: Document Classification, Feature selection, mixed feature set, LSA, hybrid feature selection

Abstract

A novel approach for the feature selection is proposed, which is the important preprocessing task of on-line document classification. In previous researches, the features based on information from their single population for feature selection task have been selected. In this paper, a mixed feature set is constructed by selecting features from multi-population as well as single population based on various information. The mixed feature set consists of two feature sets: the original feature set that is made up of words on documents and the transformed feature set that is made up of features generated by LSA. The hybrid feature selection method using both filter and wrapper method is used to obtain optimal features set from the mixed feature set. We performed classification experiments using the obtained optimal feature sets. As a result of the experiments, our expectation that our approach makes better performance of classification is verified, which is over 90% accuracy. In particular, it is confirmed that our approach has over 90% recall and precision that have a low deviation between categories.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[APA Style]
Joo-Ho In, Jung-Ho Kim, & Soo-Hoan Chae (2013). Combined Feature Set and Hybrid Feature Selection Method for Effective Document Classification. Journal of Internet Computing and Services, 14(5), 49-58. DOI: 10.7472/jksii.2013.14.5.49.

[IEEE Style]
J. In, J. Kim and S. Chae, "Combined Feature Set and Hybrid Feature Selection Method for Effective Document Classification," Journal of Internet Computing and Services, vol. 14, no. 5, pp. 49-58, 2013. DOI: 10.7472/jksii.2013.14.5.49.

[ACM Style]
Joo-Ho In, Jung-Ho Kim, and Soo-Hoan Chae. 2013. Combined Feature Set and Hybrid Feature Selection Method for Effective Document Classification. Journal of Internet Computing and Services, 14, 5, (2013), 49-58. DOI: 10.7472/jksii.2013.14.5.49.