Multi-Task Learning Using Uncertainty for Realtime Multi-Person Pose Estimation
سال انتشار: 1403
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 33
فایل این مقاله در 16 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_JECEI-12-1_010
تاریخ نمایه سازی: 5 دی 1402
چکیده مقاله:
kground and Obejctives: Multi-task learning is a widespread mechanism to improve the learning of multiple objectives with a shared representation in one deep neural network. In multi-task learning, it is critical to determine how to combine the tasks loss functions. The straightforward way is to optimize the weighted linear sum of multiple objectives with equal weights. Despite some studies that have attempted to solve the realtime multi-person pose estimation problem from a ۲D image, major challenges still remain unresolved. Methods: The prevailing solutions are two-stream, learning two tasks simultaneously. They intrinsically use a multi-task learning approach for predicting the confidence maps of body parts and the part affinity fields to associate the parts to each other. They optimize the average of the two tasks loss functions, while the two tasks have different levels of difficulty and uncertainty. In this work, we overcome this problem by applying a multi-task objective that captures task-based uncertainties without any additional parameters. Since the estimated poses can be more certain, the proposed method is called “CertainPose”. Results: Experiments are carried out on the COCO keypoints data sets. The results show that capturing the task-dependent uncertainty makes the training procedure faster and causes some improvements in human pose estimation. Conclusion: The highlight advantage of our method is improving the realtime multi-person pose estimation without increasing computational complexity.
کلیدواژه ها:
Realtime Multi-Person Pose Estimation ، Multi-Task Learning ، Loss Function ، Task-Dependent Uncertainty
نویسندگان
Z. Ghasemi-Naraghi
Artificial Intelligence and Robotics Department, Faculty of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran , Iran.
A. Nickabadi
Artificial Intelligence and Robotics Department, Faculty of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran , Iran.
R. Safabakhsh
Artificial Intelligence and Robotics Department, Faculty of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran , Iran.
مراجع و منابع این مقاله:
لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :