Integrating Biological Networks and Deep Learning for Microbe -Disease Prediction

سال انتشار: 1403
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 92

متن کامل این مقاله منتشر نشده است و فقط به صورت چکیده یا چکیده مبسوط در پایگاه موجود می باشد.
توضیح: معمولا کلیه مقالاتی که کمتر از ۵ صفحه باشند در پایگاه سیویلیکا اصل مقاله (فول تکست) محسوب نمی شوند و فقط کاربران عضو بدون کسر اعتبار می توانند فایل آنها را دریافت نمایند.

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

IBIS13_151

تاریخ نمایه سازی: 10 اردیبهشت 1404

چکیده مقاله:

The human microbiome, consisting of diverse microbial communities, plays a crucial role in health and disease. Identifying associations between microbes and diseases holds the potential to revolutionize diagnosis, prevention, and treatment strategies. Despite the high accuracy reported by many predictive models in microbe-disease association tasks, their evaluation frameworks often overlook biases and distributional challenges inherent in the datasets, resulting in unreliable performance metrics. To overcome this limitation, we propose a novel evaluation framework that systematically adjusts for node degree distribution, ensuring a more rigorous and reliable assessment of model performance. Furthermore, we present a predictive model that leverages deep artificial neural networks and graph learning algorithms, designed to be improved under these new evaluation criteria for microbe-disease association prediction. A comparative analysis with six state-of-the-art models shows that our approach consistently outperforms existing methods across both traditional and newly proposed evaluation frameworks. Our proposed evaluation framework introduces a sampling algorithm to construct test sets with nearly balanced numbers of positive and negative associations for each node, offering a more unbiased and comprehensive evaluation. Our new predictive model integrates heterogeneous biological network data through the MDKG knowledge graph, which encompasses nine biological entities, including microbes and diseases, connected by ۳۹ distinct edge types. We employ the Node۲Vec algorithm to extract meaningful features from this knowledge graph, generating rich feature representations for microbes and diseases. These features are then processed by a deep artificial neural network specifically designed for microbe-disease association prediction. In our study, we used the HMDAD dataset, a widely utilized resource that compiles known microbe-disease associations from previous research articles. This dataset is the most commonly used benchmark in this problem domain, ensuring comparability and relevance. Our model achieved an average AUC of ۰.۹۱ in the traditional evaluation framework, outperforming or matching existing models. Under the proposed evaluation framework, it achieved an average AUC of ۰.۷۳, surpassing the performance of competing models, which ranged between ۰.۴۱ and ۰.۶۷. Using the Wilcoxon signed-rank test, we demonstrated that our model significantly outperforms all other models in the new evaluation framework and outperforms five models in the traditional framework, achieving equal performance with the sixth.

نویسندگان

Sobhan Ahmadian Moghadama

Department of Mathematics and Computer Science, Amirkabir University of Technology, Tehran, Iran

Zahra Ghorbanali

Department of Bioinformatics, Institute Biochemistry and Biophysics, University of Tehran, Tehran, Iran

Hesam Montazeri

Department of Bioinformatics, Institute Biochemistry and Biophysics, University of Tehran, Tehran, Iran

Fatemeh Zare -Mirakabad

Department of Mathematics and Computer Science, Amirkabir University of Technology, Tehran, Iran