Designing Guide RNAs Considering Essential Genes for Genome Editing of Yarrowia lipolytica Using Deep Learning in the CRISPR System
سال انتشار: 1403
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 77
متن کامل این مقاله منتشر نشده است و فقط به صورت چکیده یا چکیده مبسوط در پایگاه موجود می باشد.
توضیح: معمولا کلیه مقالاتی که کمتر از ۵ صفحه باشند در پایگاه سیویلیکا اصل مقاله (فول تکست) محسوب نمی شوند و فقط کاربران عضو بدون کسر اعتبار می توانند فایل آنها را دریافت نمایند.
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
IBIS13_024
تاریخ نمایه سازی: 10 اردیبهشت 1404
چکیده مقاله:
The CRISPR/Cas system is used to precisely remove and add one or more genes to the genome. In this system, a protein called Cas is combined with a short RNA called sgRNA, which makes a double-strand break precisely at the desired location in the genome. The designed sgRNA should be designed in a way that not only accurately targets the desired location without off-target effects but also avoids affecting vital genes. The yeast Yarrowia lipolytica can produce valuable natural and recombinant compounds with commercial, industrial, and therapeutic significance. Given the importance and application of this yeast, designing appropriate sgRNAs can yield optimal efficiency for genome editing and the production of economically valuable products. The cutting score (CS) and fitness score (FS), which indicate changes in gene activity following sgRNA deletion, were obtained in the laboratory for each sgRNA sequence in the Cas۱۲a protein by Ramesh et al. In current study, we used these values and deep learning based on a convolutional neural network (CNN), unsupervised learning was first performed with a convolutional autoencoder (CAE) to extract sgRNA features in the Y. lipolytica genome. Then, supervised learning by the CNN yielded the FS value for each sgRNA in the Cas۱۲a dataset, resulting in Spearman values of ۰.۷۰% and Pearson values of ۰.۷۲%. The FS results for each sgRNA sequence were fed into a neural network to predict the CS, which indicates sgRNA effectiveness. Finally, the model’s predictions achieved Spearman values of ۰.۹۶% and Pearson values of ۰.۹۵% for predicting the sgRNA with the highest efficacy, outperforming existing algorithms for Y. lipolytica.
کلیدواژه ها:
نویسندگان
Z. Vahdani
Department of Computer Science, Faculty of Mathematical Sciences, Alzahra University, Tehran, Iran
F. Darvishi
Department of Microbiology, Faculty of Biological Sciences, Alzahra University, Tehran, Iran
F. Zare-Mirakabad
Department of Mathematics and Computer Science, Amirkabir University of Technology, Tehran, Iran