Advanced Multi-Task Learning with Lightweight Networks and Multi-Head Attention for Efficient Facial Attribute Estimation

سال انتشار: 1404
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 92

فایل این مقاله در 14 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_IJE-38-10_005

تاریخ نمایه سازی: 21 اسفند 1403

چکیده مقاله:

The rapid advancement of computer vision algorithms demands efficient computational resource utilization for practical applications. This study proposes a novel framework that integrates multi-task learning (MTL) with MobileNetV۳-Large networks and multi-head attention (MHA) mechanisms to simultaneously estimate facial attributes, including age, gender, race, and emotions. By employing MHA, the model enhances feature extraction and representation by focusing on multiple regions of the input image, thereby reducing computational complexity while significantly improving accuracy. The Receptive Field Enhanced Multi-Task Cascaded (RFEMTC) technique is utilized for effective preprocessing of the input data. Our methodology is rigorously evaluated on the UTKFace, FairFace, and RAF-DB datasets. We introduce a weighted loss function to balance task contributions, enhancing overall performance. Through refinement of the network architecture by analyzing branching points and optimizing the balance between shared and task-specific layers, our experimental results demonstrate significant improvements: a ۷% reduction in parameters, a ۳% increase in gender detection accuracy, a ۵% improvement in race detection accuracy, and a ۶% enhancement in emotion detection accuracy compared to single-task methods. Additionally, our proposed architecture reduces age estimation error by approximately one year on the UTKFace dataset and improves age estimation accuracy on the FairFace dataset by ۵% compared to state-of-the-art approaches.

نویسندگان

M. Rohani

Department of Electrical and Computer Engineering, University of Birjand, Birjand, Iran

H. Farsi

Department of Electrical and Computer Engineering, University of Birjand, Birjand, Iran

S. Mohamadzadeh

Department of Electrical and Computer Engineering, University of Birjand, Birjand, Iran

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • Charoqdouz E, Hassanpour H. Feature extraction from several angular faces ...
  • Hamidi H, Vafaei A, Monadjemi SAH. Analysis and evaluation of ...
  • نمایش کامل مراجع