جامعة السويس

كلية الحاسبات والمعلومات

Machine learning for text document classification-efficient classification approach

تاريخ النشر

05/01/2024 12:00:00 ص

المؤلفون

Sura I. Mohammed Ali, Marwah Nihad, Hussien Mohamed Sharaf, Haitham Farouk

الوصف

[1] S. I. Mohammed Ali, M. Nihad, H. Mohamed Sharaf, and H. Farouk, “Machine learning for text document classification-efficient classification approach,” IJ-AI, vol. 13, no. 1, pp. 703–710, Mar. 2024, doi: 10.11591/ijai.v13.i1.pp703-710.

URL

DOI

الملخص

Numerous alternative methods for text classification have been created because of the increase in the amount of online text information available. The cosine similarity classifier is the most extensively utilized simple and efficient approach. It improves text classification performance. It is combined with estimated values provided by conventional classifiers such as Multinomial Naive Bayesian (MNB). Consequently, combining the similarity between a test document and a category with the estimated value for the category enhances the performance of the classifier. This approach provides a text document categorization method that is both efficient and effective. In addition, methods for determining the proper relationship between a set of words in a document and its document categorization is also obtained.

الكلمات الدالة

Cosine similarity Information retrieval Machine learning Multinomial naïve bayesian Text documents classifiers