Data-Efficient Transformer Architectures for Image-Level Facial Forgery Detection: A Comparative Evaluation of ViT and DeiT

G; M; Ramanaiah; Doss; S; Rangasamy

Data-Efficient Transformer Architectures for Image-Level Facial Forgery Detection: A Comparative Evaluation of ViT and DeiT

محل انتشار: فصلنامه مدیریت فناوری اطلاعات، دوره: 18، شماره: 1

سال انتشار: 1405

نوع سند: مقاله ژورنالی

زبان: انگلیسی

مشاهده: 16

فایل این مقاله در 17 صفحه با فرمت PDF قابل دریافت می باشد

دریافت فایل کامل مقاله

صدور گواهی نمایه سازی
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

https://civilica.com/doc/2571294

شناسه ملی سند علمی:

JR_JITM-18-1_002

تاریخ نمایه سازی: 17 اسفند 1404

چکیده مقاله:

The rapid development of deepfake technologies has increased the demand for a credible and inter-pretable system for facial forgery detection. This study compares two transformer-based architec-tures—Vision Transformer (ViT) and Distilled Data-Efficient Image Transformer (DeiT)—for de-tecting real and manipulated facial images. The study aims to measure performance in terms of de-tection as well as interpretability and to address the weaknesses of traditional convolutional models. Data augmentation was applied, and a balanced dataset containing ۸,۰۰۰ real and fake images was constructed; both models were then fine-tuned under the same training environment. The explanatory ability of the models was incorporated using LIME. Experimental findings indicate that both models perform well, with DeiT being slightly more accurate at ۹۴.۶۲% than ViT at ۹۳.۶%, alongside faster convergence rates and less overfitting. Visualization of the focus on important facial areas confirms that the models reliably register synthetic artifacts. Although promising, generalization across dif-ferent datasets and enhancement of real-time performance remain challenges. Overall, the results validate transformer architectures—especially DeiT—as powerful and explainable deepfake detec-tion algorithms, valuable for ensuring safe and transparent digital media forensics.

کلیدواژه ها:

Vision Transformer ، Data-Efficient Image Transformer ، Face Forgery Detection ، Ex-plainable AI ، Transformer Models

نویسندگان

G

Assistant Professor, Department of Computer Science& Engineering, B.M.S College of Engineering, Affiliated to Visvesvaraya Technological University, Belagavi, India.

M

Associate Professor, Department of Artificial Intelligence and Machine Learning, Bangalore Institute of Technology, Visvesvaraya Technological University, Bangalore, India.

Ramanaiah

Cloud Architect, Lead of AI initiative Program, Ernst & Young LLP, New York, USA.

Doss

Department of ECE, CMR Technical Campus, Hyderabad, Telangana, India.

S

Professor, ECE Department, Sreenidhi Institute of Science and Technology, India.

Rangasamy

Assistant Professor, Department of Biotechnology, Vinayaka Mission`s Kirupananda Variyar Engineering College, Salem (Vinayaka Mission`s Research Foundation). India.

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :

Akshatha G, Kempanna, M., Ashoka, S. B., & Kunta, J. ...
Akshatha, G., & Kempanna, M. (۲۰۲۵). Review of deep learning ...
Altaei, M. S. M. (۲۰۲۲). A detection of deep fake ...
Arshed, M. A., Alwadain, A., Faizan Ali, R., Mumtaz, S., ...
Deng, L., Suo, H., & Li, D. (۲۰۲۲). Deepfake Video ...
Dolhansky, B., Howes, R., Pflaum, B., Baram, N., & Ferrer, ...
Dosovitskiy, A. (۲۰۲۰). An image is worth ۱۶x۱۶ words: Transformers ...
Gong, L. Y., & Li, X. J. (۲۰۲۴). A contemporary ...
James, U. U., Olarinoye, H. S., Uchenna, I. R., Idika, ...
Korshunov, P., & Marcel, S. (۲۰۱۸). Deepfakes: a new threat ...
Kumar, M. and Selvam, A., ۲۰۲۵. Deep Fake Face Detection ...
Lad, S. (۲۰۲۴). Applied Ethical and Explainable AI in Adversarial ...
Mansoor, N., & Iliev, A. I. (۲۰۲۵). Explainable AI for ...
Nagahisarchoghaei, M., Nur, N., Cummins, L., Nur, N., Karimi, M. ...
Nida, N., Irtaza, A., & Ilyas, N. (۲۰۲۱). Forged face ...
Omodunbi, B. A., Sobowale, A., & Soladoye, A. Detection of ...
Omotosho, L. O., Ogundoyin, I. K., Oyeniyi, J. O., & ...
Oulad-Kaddour, M., Haddadou, H., Vilda, C. C., Palacios-Alonso, D., Benatchba, ...
Pai, G., & Sharmila, K. M. (۲۰۲۳). Semi-Dense U-Net: A ...
Rahman, M. H., Jannat, M. K. A., Islam, M. S., ...
Rajagukguk, N., Kencana, I. P. E. N., & Kusuma, I. ...
Touvron, H., Cord, M., Douze, M., Massa, F., Sablayrolles, A., ...
Yasser, B., Hani, J., El-Gayar, S., Amgad, O., Ahmed, N., ...
Zhou, L., & Yu, W. (۲۰۲۲). Improved Convolutional Neural Image ...

نمایش کامل مراجع