Semantic Segmentation Using an Improved ResNet Structure and Efficient Channel Attention Mechanism Applied to Atrous Spatial Pyramid Pooling in a Fully Convolutional Network
محل انتشار: ماهنامه بین المللی مهندسی، دوره: 38، شماره: 11
سال انتشار: 1404
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 20
فایل این مقاله در 16 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_IJE-38-11_004
تاریخ نمایه سازی: 26 فروردین 1404
چکیده مقاله:
In the field of computer vision, semantic segmentation became an important problem that has applications in fields such as autonomous driving and robotics. Image segmentation datasets, on the other hand, present substantial hurdles due to the high intra-class variability, which includes differences across car models or building designs, and the low inter-class variability, which makes it difficult to discern between objects such as buildings that have facades that are visually identical. A focus-enhanced ASPP module that is coupled with an upgraded backbone for semantic segmentation networks is presented in this study as a solution to the problems that have been identified. In order to augment the adaptability of extracted features, the proposed framework utilizes the capability of an attention ASPP module to implement attention processes within the multiscale module. In order to efficiently capture complex features, the encoder stage also makes use of a ResNet-۵۰ backbone that has been properly optimized. In addition, to increase the robustness of the model, data augmentation approaches are applied. mDice of ۸۷.۸۲, mIoU of ۷۹.۰۵, and mean accuracy of ۸۵.۲ on the Stanford dataset, and mDice of ۸۸.۹۱, mIoU of ۸۰.۰۳, and mean accuracy of ۸۹.۸۴ on the Cityscapes dataset, according to experimental assessments, demonstrate that the developed technique performs at an accuracy level that is believed to be modern. As a result of these findings, the possibility for greatly improving semantic segmentation performance may be highlighted by integrating attention mechanisms, ASPP modules, and upgraded ResNet structures.
کلیدواژه ها:
Semantic segmentation ، Efficient channel attention ، Atrous Spatial Pyramid Pooling ، improved ResNet ، dilation convolution
نویسندگان
E. Sahragard
Department of Electrical and Computer Engineering University of Birjand, Birjand, Iran
H. Farsi
Department of Electrical and Computer Engineering University of Birjand, Birjand, Iran
S. Mohamadzadeh
Department of Electrical and Computer Engineering University of Birjand, Birjand, Iran
مراجع و منابع این مقاله:
لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :