Enhancing Predictive Accuracy of Biomolecules Partition Coefficients in Aqueous Two-Phase Systems Using Machine Learning
سال انتشار: 1402
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 157
نسخه کامل این مقاله ارائه نشده است و در دسترس نمی باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
این مقاله در بخشهای موضوعی زیر دسته بندی شده است:
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
IBIS12_135
تاریخ نمایه سازی: 12 آبان 1403
چکیده مقاله:
Information about the pharmaceutical and biotech industries has experienced remarkablegrowth in recent years. Biological processes consist of two main stages: upstream and downstreamprocesses. Downstream processes aim to recover and purify chemical products, constituting ۵۰ to ۸۰percent of the total production cost [۱]. Extraction, a stage within downstream processes, employsmethods like liquid-liquid extraction systems. One type of liquid-liquid extraction system is the aqueoustwo-phase system(ATPS)[۲]. As a significant portion of the aqueous two-phase system is water, it isused to purify biological molecules such as drugs, proteins, etc. This study uses machine learningmethods to predict the partition coefficient of drugs in aqueous two-phase system[۳]. The databaseutilized in this study includes data collected from previous articles that experimentally calculatedinformation related to the components of the aqueous two-phase system, and details about the chemicalstructure and physical properties of drugs. This study aims to investigate how the properties of drugsaffect their distribution coefficient in the aqueous two-phase system. In the investigation of the chemicalstructure, binary Morgan fingerprints, count-based Morgan fingerprints and Graph convolution wereutilized. Additionally, physical properties such as melting point, density, log P, etc were considered. Topredict the partition coefficient of drugs in the aqueous two-phase system, various machine-learningmodels were employed, including Random Forests, ANN, ensemble methods, etc. Results show thesignificant influence of drug properties on partition coefficient prediction. The best performance relatesto combining the physical and chemical properties of drugs using count_based Morgan fingerprintsrepresentation. On the other hand, the performance of the model using the ensemble method is betterthan the other models. This model achieves an MSE of ۰.۰۰۷۹, MAE of ۰.۰۵۷, RMSD of ۰.۰۸۸۸, andan R۲ value of ۰.۸۴ for test data.
کلیدواژه ها:
Aqueous Two-Phase Systems (ATPS) ، Biomolecules ، Machine learning ، partition Coefficients ، Morgan fingerprint
نویسندگان
Fateme Bahgeri
Department of Chemical Engineering, AmirKabir University of Technology, Tehran, Iran
Cholamreza Pazuki
Department of Chemical Engineering, AmirKabir University of Technology, Tehran, Iran
Fatemeh Zare-Mirakabad
Department of Mathematics and Computer Science, AmirKabir University of Technology, Tehran, Iran
Mahsa Sadat
Department of Mathematics and Computer Science, AmirKabir University of Technology, Tehran, Iran
Zahra Ghorbanali
Department of Mathematics and Computer Science, AmirKabir University of Technology, Tehran, Iran