Paper
2 June 2012 An experimental study for Arabic text classification techniques
Bassam Al-Shargabi, Fekry Olayah
Author Affiliations +
Proceedings Volume 8334, Fourth International Conference on Digital Image Processing (ICDIP 2012); 83340H (2012) https://doi.org/10.1117/12.946039
Event: Fourth International Conference on Digital Image Processing (ICDIP 2012), 2012, Kuala Lumpur, Malaysia
Abstract
Several algorithms have been implemented to resolve the problem of text categorization. Most of the work in this area geared for English text, whereas few researches have been conducted on Arabic text. However, the nature of Arabic text is different than English text; pre-processing of Arabic text are more challenging. In this paper an experimental study was conducted on three techniques for Arabic text classification; these techniques, Discriminative Multinominal Naive Bayes (DMNB), Naïve Bayesian (NB) and IBK Algorithms, The paper aimed to assess the accuracy for each classifier and to determine which classifier is more accurate for Arabic text classification based on stop words elimination. The accuracy for each classifier is measured by Percentage split method (holdout), and K-fold cross validation methods, along with the time needed to classify Arabic text.
© (2012) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Bassam Al-Shargabi and Fekry Olayah "An experimental study for Arabic text classification techniques", Proc. SPIE 8334, Fourth International Conference on Digital Image Processing (ICDIP 2012), 83340H (2 June 2012); https://doi.org/10.1117/12.946039
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Classification systems

Data modeling

Data mining

Information technology

Machine learning

Precision measurement

Medicine

Back to Top