Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

سال انتشار: 1394
محل انتشار: دوفصلنامه مجله کامپیوتر و رباتیک، دوره: 8، شماره: 1
کد COI اختصاصی: JR_JCR-8-1_002
زبان مقاله: انگلیسی
تعداد مشاهده: 387

نویسندگان

Faculty of Computer and Information Technology Engineering, Qazvin Branch, Islamic Azad University, Qazvin,Iran

Department of Computer Engineering, Iran University of Science and Technology, Tehran, Iran

چکیده

DNA sequence, containing all genetic traits is not a functional entity. Instead, it is transferred to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functional responsibilities. Consequently protein function prediction is a momentous task in bioinformatics. Protein function can be elucidated from its structure. Protein secondary structure prediction has attracted great attention since it’s the input feature of many bioinformatics problems. The variety of proposed computational methods for protein secondary structure prediction is very extensive. Nevertheless they couldn’t achieve much due to the existing obstacles such as abstruse protein data patterns, noise, class imbalance and high dimensionality of encoding schemes of amino acid sequences. With the advent of machine learning and later ensemble approaches, a considerable elevation was made. In order to reach a meaningful conclusion about the strength, bottlenecks and limitations of what have been done in this research area, a review of the literature will be of great benefit. Such review is advantageous not only to wrap what has been accomplished by far but also to cast light for the future decisions about the potential and unseen solutions to this area. Consequently in this paper it’s aimed to review different computational approaches for protein secondary structure prediction with the focus on machine learning methods, addressing different parts of the problem’s area.

کلیدواژه ها

Protein secondary structure prediction, Machine learning, Neural Networks, Support Vector Machines, Ensemble methods

مقالات مرتبط جدید

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.