An overview of data mining algorithms and information extraction methods
- سال انتشار: 1401
- محل انتشار: پانزدهمین کنفرانس بین المللی فناوری اطلاعات،کامپیوتر و مخابرات
- کد COI اختصاصی: ITCT15_048
- زبان مقاله: انگلیسی
- تعداد مشاهده: 248
نویسندگان
M.Sc. in Computer Engineering, Software Orientation, Islamic Azad University, Mahallat Branch, Markazi, Iran
چکیده
Data mining means trying to find a specific pattern among the data with the help of algorithms. There are some widely used algorithms in data mining that are approved by more experts. Data mining is a set of operations performed by a computer on a large amount of data to find a specific pattern among the scattered data. This action causes the hidden order to be found in the information that does not seem to have order, and the result is displayed in a way that is understandable to humans so that it can be used for decision making and planning. There are various methods for data mining, such as clustering, classification, and so on, which are used for each of the specific algorithms. Among the data is very useful information to improve the quality of various parts of human life. From detecting the possibility of illness in individuals, to finding better sales patterns in Internet business, and even identifying the face of the offender through CCTV, there are all different aspects that data mining can help a person. Classification is used to find out in which group each data instance is related within a given dataset. It is used for classifying data into different classes according to some constrains. Several major kinds of classification algorithms including C۴.۵, ID۳, k-nearest neighbor classifier, Naive Bayes, SVM, and ANN are used for classification. Generally a classification technique follows three approaches Statistical, Machine Learning and Neural Network for classification. While considering these approaches this paper provides an inclusive survey of different classification algorithms and their features and limitations. In data mining, an algorithm is a set of commands that are defined in computer languages and can be executed by a computer. In data mining, there are many algorithms that analyze large data and extract meaningful patterns from them. Here are some of the most common ones.کلیدواژه ها
data mining algorithms, information extraction, Naive Bayes, SVM, k-nearest neighbourمقالات مرتبط جدید
- تحلیل چالشها و راهکارهای تقویت ارتباط دانشگاه و صنعت: با تمرکز بر حلقههای مفقوده
- بازخوانی نقش دانشگاه و صنعت در توسعه ملی: از موانع تا راهکارها
- نشانگر تشخیصی جدید در ژن C-myc به عنوان کیت غیر تهاجمی تشخیص سرطان دهان
- برنامه ریزی منابع تجدید پذیر با درنظر گرفتن برنامه ریزی توسعه انتقال و تولید منابع توان راکتیو
- برنامه ریزی همزمان توسعه انتقال و منابع تولید توان راکتیو با استفاده از یک الگوریتم تکاملی بهبود یافته
اطلاعات بیشتر در مورد COI
COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.
کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.