Safe Reinforcement Learning for Personalized Digital Interventions in Obsessive-Compulsive Disorder (OCD)

سال انتشار: 1404
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 118

فایل این مقاله در 14 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

AAIEH01_055

تاریخ نمایه سازی: 22 شهریور 1404

چکیده مقاله:

Obsessive-Compulsive Disorder (OCD) is a prevalent and disabling psychiatric condition characterized by intrusive thoughts (obsessions) and repetitive behaviors (compulsions). While standard therapies like Exposure and Response Prevention (ERP) and Cognitive Behavioral Therapy (CBT) are effective, their utility is often limited by a lack of real-time personalization, high patient dropout rates, and rigid, one-size-fits-all protocols. To address these critical limitations, this paper introduces a novel framework for Safe Reinforcement Learning (Safe RL) applied to digital psychiatric interventions for OCD. We formally model the therapy process as a Partially Observable Markov Decision Process (POMDP), capturing a patient's latent psychological state from a rich, multimodal data stream, including self-reports (SUDS scores, behavioral logs) and physiological markers (heart rate variability from wearables). An intelligent agent, using a Bi-LSTM network with an attention mechanism, dynamically selects from a range of micro-interventions to optimize for long-term therapeutic outcomes. We employ a Safe Proximal Policy Optimization (Safe-PPO) algorithm to ensure strict adherence to clinically derived safety constraints, such as limiting the intensity of ERP exposures and preventing rapid anxiety escalation. We evaluate our framework in a ۴-week cross-over pilot study with N=۱۵ participants, comparing our Safe RL agent to a static, rule-based system. Results demonstrate a clinically meaningful improvement in symptom reduction (AY-BOCS of -۵.۳ vs -۲.۱), a significant increase in patient adherence (+۱۵%), and a near-zero rate of safety violations. This work not only provides the first empirical evidence for the feasibility of Safe RL in OCD psychotherapy but also offers a robust, scalable, and clinically validated blueprint for a new generation of adaptive AI-driven interventions in digital psychiatry.

نویسندگان

Danial Eskandari Faruji

Faculty of Electrical and Computer Engineering, Hakim Sabzevari University, Sabzevar, Iran

Amir Akhavan Saffar

Faculty of Computer Engineering and Information Technology, Sadjad University, Mashhad, Iran

Javad Hamidzadeh

Faculty of Computer Engineering and Information Technology, Sadjad University, Mashhad, Iran