Propensity based classification: Dehalogenase and non-dehalogenase enzymes

سال انتشار: 1394
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 285

فایل این مقاله در 7 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JADM-3-2_010

تاریخ نمایه سازی: 19 تیر 1398

چکیده مقاله:

The present work was designed to classify and differentiate between the dehalogenase enzyme to non–dehalogenases (other hydrolases) by taking the amino acid propensity at the core, surface and both the parts. The data sets were made on an individual basis by selecting the 3D structures of protein available in the PDB (Protein Data Bank). The prediction of the core amino acid were predicted by IPFP tool and their structural propensity calculation was performed by an in-house built software, Propensity Calculator which is available online. All datasets were finally grouped into two categories namely, dehalogenase and non-dehalogenase using Naïve Bayes, J-48, Random forest, K-means clustering and SMO classification algorithm. By making the comparison of various classification methods, the proposed tree method (Random forest) performs well with a classification accuracy of 98.88 % (maximum) for the core propensity data set. Therefore we proposed that, the core amino acid propensity could be approved as a novel potential descriptor for the classification of enzymes.

کلیدواژه ها:

Core Propensity ، Classification Algorithm ، Random Forest ، Protein Data Bank ، Dehalogenase and Non- dehalogenases

نویسندگان

R. Satpathy

School of Life Sciences, Sambalpur University, Burla, Sambalpur, India.

V. B. Konkimalla

Department of Biological Sciences, National Institute of Science Education & Research (NISER), Bhubaneswar, India.

J. Ratha

School of Life Sciences, Sambalpur University, Burla, Sambalpur, India.