ResidualConv۱D: A Deep Learning Approach for Enhancing Splice Site Prediction across Genomic Contexts

سال انتشار: 1402
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 175

فایل این مقاله در 8 صفحه با فرمت PDF قابل دریافت می باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

ICAIFT01_003

تاریخ نمایه سازی: 16 بهمن 1402

چکیده مقاله:

This study addresses the challenge of accurately predicting splice sites, a crucial element in understanding gene expression and protein synthesis. We assume that conventional prediction methods may lack the specificity and adaptability required for diverse genomic contexts. To improve this, we present a novel method that integrates two-Gram features and One-Hot encoding with a Deep Convolutional Neural Network (ResidualConv۱D) model. Our approach begins with using the two-Gram technique to capture nucleotide dependencies at splice sites. These sequences are then enriched with two-Gram features using one-hot encoding. The core of our methodology is the ResidualConv۱D model, which employs convolutional blocks with residual connections to detect complex sequence patterns effectively. Our results indicate a significant advancement in splice site prediction accuracy. The model particularly excels in the HS۳D acceptor and Arabidopsis thaliana donor datasets, outperforming the established Ensemble Splice algorithm. In the HS۳D acceptor dataset, the model achieved an accuracy of ۹۴.۱۸% and an F۱-score of ۹۴.۲۴%, demonstrating its effectiveness. Additionally, it shows competitive performance in a range of metrics across various datasets, highlighting its robustness in different genomic environments. In conclusion, our innovative combination of two-Gram features, one-hot encoding, and the ResidualConv۱D model substantially improves the accuracy of splice site prediction across diverse species. This improvement in prediction capability could be pivotal in advancing the understanding of gene splicing mechanisms.

نویسندگان

Mohammad Reza Rezvan

Department of Electrical and Computer Engineering, University of Science and Technology of Mazandaran, Behshahr, Iran

Ali Ghanbari Sorkhi

Department of Electrical and Computer Engineering, University of Science and Technology of Mazandaran, Behshahr, Iran

Jamshid Pirgaz

Department of Electrical and Computer Engineering, University of Science and Technology of Mazandaran, Behshahr, Iran

Mohammad Mehdi Pourhashem Kallehbasti

Department of Electrical and Computer Engineering, University of Science and Technology of Mazandaran, Behshahr, Iran