Context-aware Action Quality Assessment using Latent Regression-based Progressive Sub-action Learning

سال انتشار: 1405
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 20

فایل این مقاله در 18 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_CKE-9-1_006

تاریخ نمایه سازی: 5 خرداد 1405

چکیده مقاله:

Most existing Action Quality Assessment (AQA) systems infer performance scores from holistic, video-level representations, a process that frequently obscures the contribution of individual motion segments to the final prediction. This global treatment reduces transparency and limits interpretability, particularly for complex skills composed of multiple sequential phases. The problem is further compounded by limited fine-grained supervision and the difficulty of modeling long-range temporal dependencies across an action sequence. This paper introduces a Latent Regression-based Progressive Sub-action Learning framework to address these limitations by capturing rich contextual information within the action sequence. The framework first employs a procedure-parsing module to segment each video into semantically coherent sub-actions and extract corresponding spatio-temporal features. A latent variable approach subsequently generates initial pseudo-subscores by integrating these features with overall score labels from the training set. Critically, these pseudo-subscores are progressively refined through an iterative process that leverages the cumulative contextual information from preceding sub-actions. This yields an enriched feature set that accurately encapsulates the execution history. Furthermore, a novel monotonic penalty loss is introduced to enforce a logical and consistent progression in the sub-scores, mitigating abrupt and illogical score fluctuations. The model is trained in a two-stage process. First, it generates the robust pseudo-subscores, and subsequently it predicts both the overall score and the per-substage scores. This explicit modeling of long-range dependencies, along with consistent score progression, is critical for ensuring prediction accuracy. Extensive experimental evaluation confirms that this structured modeling approach delivers consistent performance gains over contemporary AQA methods while offering substantially improved interpretability.

نویسندگان

Marjan Mazruei

Department of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran

Ehsan Fazl-Ersi

Department of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran

Abedin Vahedian

Department of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran

Ahad Harati

Department of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :