Dynamic Retrieval-Based Prompting for Cross-Lingual Dialogue Understanding in Persian

سال انتشار: 1405
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 24

فایل این مقاله در 15 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JADM-14-2_005

تاریخ نمایه سازی: 26 فروردین 1405

چکیده مقاله:

Dialogue understanding for low-resource languages like Persian remains challenging due to limited annotated data, which constrains supervised training at scale. We propose a simple yet effective training-free method that combines machine translation, retrieval-based example selection, and prompting with a large language model (GPT-۴o) to improve zero-shot cross-lingual performance. Given a Persian utterance translated into English, our method retrieves semantically and lexically similar English examples using a hybrid similarity function, translates them back into Persian, and constructs a few-shot prompt tailored to the input. This input-sensitive strategy enhances the quality of the examples, helping the model align more effectively with each instance. Experimental results on the Persian-ATIS dataset show that our approach improves intent detection and achieves competitive slot filling performance, outperforming state-of-the-art baselines without requiring any supervision in the target language. The modular pipeline is easy to reproduce and, in future work, can be extended to other low-resource languages, tasks, or retrieval configurations. The repository of our work is available at https://anonymous.۴open.science/r/Persian_Language_Understanding-FDF۴.

نویسندگان

Saedeh Tahery

Faculty of Computer Engineering, K. N. Toosi University of Technology, Tehran, Iran

Saeed Farzi

Faculty of Computer Engineering, K. N. Toosi University of Technology, Tehran, Iran

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • Y. Wang, J. Zhang, T. Shi, D. Deng, Y. Tian, ...
  • H. Zhang, P. S. Yu, and J. Zhang, "A systematic ...
  • A. Algherairy and M. Ahmed, "Prompting large language models for ...
  • V. Pratap, A. Tjandra, B. Shi, P. Tomasello, A. Babu, ...
  • P. Pakray, A. Gelbukh, and S. Bandyopadhyay, "Natural language processing ...
  • M. Firdaus, H. Golchha, A. Ekbal, and P. Bhattacharyya, "A ...
  • A. Algherairy and M. Ahmed, "A review of dialogue systems: ...
  • Z. Zhang, R. Takanobu, Q. Zhu, M. Huang, and X. ...
  • C.-W. Goo, G. Gao, Y.-K. Hsu, C.-L. Huo, T.-C. Chen, ...
  • M. A. Hedderich, L. Lange, H. Adel, J. Strötgen, and ...
  • T. Adimulam, S. Chinta, and S. K. Pattanayak, "Transfer learning ...
  • Y. Fu, N. Lin, B. Chen, Z. Yang, and S. ...
  • P. Sahoo, A. K. Singh, S. Saha, V. Jain, S. ...
  • S. Tahery and S. Farzi, "An Adapted Few-Shot Prompting Technique ...
  • P. Lewis, E. Perez, A. Piktus, F. Petroni, V. Karpukhin, ...
  • A. Conneau and G. Lample, "Cross-lingual language model pretraining," Advances ...
  • A. Siddhant, M. Johnson, H. Tsai, N. Ari, J. Riesa, ...
  • C. D. Manning, P. Raghavan, and H. Schütze, An introduction ...
  • M. J. Sabet, P. Dufter, F. Yvon, and H. Schütze, ...
  • M. Akbari, A. H. Karimi, T. Saeedi, Z. Saeidi, K. ...
  • E. Razumovskaia, G. Glavaš, O. Majewska, A. Korhonen, and I. ...
  • K. Yu, H. Li, and B. Oguz, "Multilingual seq۲seq training ...
  • B. McCann, J. Bradbury, C. Xiong, and R. Socher, "Learned ...
  • S. Schuster, S. Gupta, R. Shah, and M. Lewis, "Cross-lingual ...
  • T. Pires, E. Schlinger, and D. Garrette, "How multilingual is ...
  • A. Conneau, K. Khandelwal, N. Goyal, V. Chaudhary, G. Wenzek, ...
  • R. Zadkamali, S. Momtazi, and H. Zeinali, "Intent detection and ...
  • Z. Li, C. Hu, J. Chen, Z. Chen, X. Guo, ...
  • L. Qin, Q. Chen, T. Xie, Q. Li, J.-G. Lou, ...
  • L. Qin, M. Ni, Y. Zhang, and W. Che, "Cosda-ml: ...
  • Z. Liu, G. I. Winata, Z. Lin, P. Xu, and ...
  • P. Safari and M. Shamsfard, "Data augmentation and preparation process ...
  • S. Tahery, S. Kianian, and S. Farzi, "Cross-Lingual NLU: Mitigating ...
  • W. U. Ahmad, Z. Zhang, X. Ma, K.-W. Chang, and ...
  • I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, ...
  • S. Tahery and S. Farzi, "An Invasive Embedding Model in ...
  • J. Pei, G. Yan, M. De Rijke, and P. Ren, ...
  • T. Labruna, S. Brenna, and B. Magnini, "Dynamic Task-Oriented Dialogue: ...
  • A. R. Ghasemi and J. Salimi Sartakhti, "Multilingual Language Models ...
  • W. Pan, Q. Chen, X. Xu, W. Che, and L. ...
  • Z. Zhu, X. Cheng, H. An, Z. Wang, D. Chen, ...
  • Z. Borhanifard, H. Basafa, S. Z. Razavi, and H. Faili, ...
  • M. Akbari, A. Mohades, and M. H. Shirali-Shahreza, "A hybrid ...
  • E. A. Abyaneh, R. Zolfaghari, and A. A. Abyaneh, "User ...
  • A. Singhal, "Modern information retrieval: A brief overview," IEEE Data ...
  • D. Kamali, B. Janfada, M. E. Shenasa, and B. Minaei-Bidgoli, ...
  • نمایش کامل مراجع