CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Text Classification: process and Algorithms

عنوان مقاله: Text Classification: process and Algorithms
شناسه ملی مقاله: RSTCONF03_190
منتشر شده در سومین کنفرانس بین المللی پژوهش در مهندسی، علوم و تکنولوژی در سال 1395
مشخصات نویسندگان مقاله:

Shahnaz Baghbani - ACECR Institute of Higher Education [Isfahan Branch], Isfahan

خلاصه مقاله:
As the volume of information available on the Internet and corporate increases,there is growing interest in developing tools to help people better find, filter, andmanage these electronic resources. The aim of text classification is to buildsystems which are able to automatically classify documents into categories. Textis cheap but information in the form of knowing what classes a text belongs to isexpensive. Automatic classification of text can provide this information at lowcost. Proper classification of e-documents, online news, emails and digitallibraries needs text mining, machine learning and natural language processingtechniques to get meaningful knowledge. This paper provided a review of textclassification process including documents collection, pre-processing, indexing,feature selection and classification. Moreover, it studied the main algorithms intext classification such as Bayesian classifier, Decision Tree, Decision Rule, Knearest neighbor(KNN), Support Vector Machines(SVMs), Neural Networks,Rocchio’s Algorithm, Fuzzy Correlation and Genetic Algorithms.

کلمات کلیدی:
Text classification, Algorithm, Text mining

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/557521/