Step by Step from Machine Learning Algorithm to Making QSAR model: Present a Combined Feature Selection with a Case Study

سال انتشار: 1396
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 557

نسخه کامل این مقاله ارائه نشده است و در دسترس نمی باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

IBIS07_178

تاریخ نمایه سازی: 29 فروردین 1397

چکیده مقاله:

Due to the large amount of data in various bioinformatics sciences, such as drug design, gene selection, QSAR, etc., the use of machine learning techniques such as feature selection has become an essential requirement for the construction of the model. Feature selection is one of the most important steps in pattern recognition, machine learning and data mining. The purpose of the feature selection is to select the most optimal subset of the feature from the entire space of the main features problem, so that, while reducing the dimensions, the descriptive accuracy of the machine s learning techniques can be achieved.Optimization algorithms, random search, evolution, etc. In selecting features, new and effective methods are used to find optimal solutions for problems. The randomness of these algorithms prevents them from falling into local optimal points. Therefore, the use of feature selection methods in various bioinformatics sciences is inevitable.In this paper, we have used a combination of classic and random methods to select important descriptors in a set of QSAR data in drug design. In this paper, the combination of Grey number theory methods and firefly algorithm has been used to select the feature, so that we first use the Grey number theory algorithm to rank the features belonging to our database.Then, using some of these features that have the highest priority and rank, as well as the random firefly algorithm, we choose the best feature among these priority characteristics. The combination of these methods suggests acceptable results, which can be used for feature selection algorithms in bioinformatics.

نویسندگان

Mazaher Maghsoudloo

University of Tehran, Kish International Campus, Iran

Masoud Arabfard

University of Tehran, Kish International Campus, Iran

Sajjad Gharaghani

Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran

Kaveh kavousi

Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran