|
International Journal of Information and Communication Technology Research (IJICT، جلد ۲، شماره ۴، صفحات ۱۹-۳۱
|
|
|
عنوان فارسی |
|
|
چکیده فارسی مقاله |
|
|
کلیدواژههای فارسی مقاله |
|
|
عنوان انگلیسی |
Document Classification Using Novel Competitive Neural Text Classifier |
|
چکیده انگلیسی مقاله |
Text categorization is one of the well studied problems in data mining and information retrieval. Given a large quantity of documents in a data set where each document is associated with its corresponding category. This research proposes a novel approach for English and Persian documents classification with using novel method that combined competitive neural text categorizer with new vectors that we called, string vectors. Traditional approaches to text categorization require encoding documents into numerical vectors which leads to the two main problems: huge dimensionality and sparse distribution. Although many various feature selection methods are developed to address the first problem, the reduced dimension remains still large. If the dimension is reduced excessively by a feature selection method, robustness of document categorization is degraded. The idea of this research as the solution to the problems is to encode the documents into string vectors and apply it to the novel competitive neural text categorizer as a string vector. Extensive experiments based on several benchmarks are conducted. The results indicated that this method can significantly improve the performance of documents classification up to 13.8% in comparison to best traditional algorithm on standard Reuter 21578 dataset. |
|
کلیدواژههای انگلیسی مقاله |
|
|
نویسندگان مقاله |
| Seyyed Mohammad Reza Farshchi Artificial Intelligence Department, and Advance Research Center (ARC) Islamic Azad University Mashhad Branch, Iran
| Mohammad Bagher Naghibi Sistani Electrical Engineering Department Ferdowsi University of Mashhad Mashhad, Iran
|
|
نشانی اینترنتی |
http://ijict.itrc.ac.ir/browse.php?a_code=A-10-27-217&slc_lang=en&sid=1 |
فایل مقاله |
اشکال در دسترسی به فایل - ./files/site1/rds_journals/417/article-417-1212505.pdf |
کد مقاله (doi) |
|
زبان مقاله منتشر شده |
en |
موضوعات مقاله منتشر شده |
فناوری اطلاعات |
نوع مقاله منتشر شده |
پژوهشی |
|
|
برگشت به:
صفحه اول پایگاه |
نسخه مرتبط |
نشریه مرتبط |
فهرست نشریات
|