PSSMFG: a bioinformatics tool for generating descriptors based on PSSM profiles
- سال انتشار: 1396
- محل انتشار: هفتمین همایش بیوانفورماتیک ایران
- کد COI اختصاصی: IBIS07_111
- زبان مقاله: انگلیسی
- تعداد مشاهده: 599
نویسندگان
Department of Biophysics, Faculty of Biological Science, Tarbiat Modares University (TMU), Tehran, Iran
Department of Biophysics, Faculty of Biological Science, Tarbiat Modares University (TMU), Tehran, Iran
Department of Biophysics, Faculty of Biological Science, Tarbiat Modares University (TMU), Tehran, Iran
چکیده
One of fundamental step in the construction of machine learning based models is feature extraction. This step is important to determine the effectiveness of trained models in bioinformatics [1]. In the previous two decades, the variety of feature encoding schemes have been proposed in order to exploit suitable patterns from protein sequences. Most of the schemes are based on sequence information or representation of physicochemical properties. Although direct features derived from sequences themselves are regarded as essential for training models, an increasing number of studies have shown that evolutionary information in the form of PSSM profiles is much more informative than sequence information alone [2]. PSSM-based feature descriptors have been widely used to improve the performance of many types of research such as prediction of protein function, membrane proteins types, proteins structural class [3], protein binding site protein [4], secondary structure [5], disordered regions as well as sub-cellular localization of proteins [6], nevertheless, there is no universal tool for generating this wide variety of descriptors. In this study, PSSMFG (Position-Specific Scoring matrix-based feature generator), a tool that can generate a number of types of PSSM-based feature descriptors, is proposed. PSSMFG performances numerous algorithms available in the literature and provides an easy-to-use interfaceکلیدواژه ها
Feature encoding; Evolutionary information; PSSMاطلاعات بیشتر در مورد COI
COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.
کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.