Paying Attention to the Features Extracted from the Image to Person Re-identification

سال انتشار: 1404
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 45

فایل این مقاله در 8 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JECEI-13-2_001

تاریخ نمایه سازی: 19 تیر 1404

چکیده مقاله:

kground and Objectives: Person re-identification is an important application in computer vision, enabling the recognition of individuals across non-overlapping camera views. However, the large number of pedestrians with varying appearances, poses, and environmental conditions makes this task particularly challenging. To address these challenges, various learning approaches have been employed. Achieving a balance between speed and accuracy is a key focus of this research. Recently introduced transformer-based models have made significant strides in machine vision, though they have limitations in terms of time and input data. This research aims to balance these models by reducing the input information, focusing attention solely on features extracted from a convolutional neural network model. Methods: This research integrates convolutional neural network (CNN) and Transformer architectures. A CNN extracts important features of a person in an image, and these features are then processed by the attention mechanism in a Transformer model. The primary objective of this work is to enhance computational speed and accuracy in Transformer architectures. Results: The results obtained demonstrate an improvement in the performance of the architectures under consistent conditions. In summary, for the Market-۱۵۰۱ dataset, the mAP metric increased from approximately ۳۰% in the downsized Transformer model to around ۷۴% after applying the desired modifications. Similarly, the Rank-۱ metric improved from ۴۸% to approximately ۸۹%.Conclusion: Indeed, although it still has limitations compared to larger Transformer models, the downsized Transformer architecture has proven to be much more computationally efficient. Applying similar modifications to larger models could also yield positive effects. Balancing computational costs while improving detection accuracy remains a relative goal, dependent on specific domains and priorities. Choosing the appropriate method may emphasize one aspect over another.

نویسندگان

S. H. Zahiri

Department of Electrical Engineering, Faculty of Engineering, University of Birjand, Birjand, Iran.

R. Iranpoor

Department of Electrical Engineering, Faculty of Engineering, University of Birjand, Birjand, Iran.

N. Mehrshad

Department of Electrical Engineering, Faculty of Engineering, University of Birjand, Birjand, Iran.

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • S. S. A. Zaidi, M. S. Ansari, A. Aslam, N. ...
  • A. Krizhevsky, I. Sutskever, G. E. Hinton, "Imagenet classification with ...
  • W. Wei, W. Yang, E. Zuo, Y. Qian, L. Wang, ...
  • M. Farenzena, L. Bazzani, A. Perina, V. Murino, M. Cristani, ...
  • W. S. Zheng, S. Gong, T. Xiang, "Person re-identification by ...
  • K. He, X. Zhang, S. Ren, J. Sun, "Deep residual ...
  • Z. Zheng, L. Zheng, Y. Yang, "A discriminatively learned cnn ...
  • H. Liu, J. Feng, M. Qi, J. Jiang, S. Yan, ...
  • L. Zheng, Y. Yang, A. G. Hauptmann, "Person re-identification: Past, ...
  • L. Zheng, H. Zhang, S. Sun, M. Chandraker, Y. Yang, ...
  • R. Girshick, J. Donahue, T. Darrell, J. Malik, "Rich feature ...
  • H. Luo et al., "A strong baseline and batch normalization ...
  • Y. Sun, L. Zheng, Y. Yang, Q. Tian, S. Wang, ...
  • Y. Sun, L. Zheng, Y. Li, Y. Yang, Q. Tian, ...
  • Y. Sun et al., "Circle loss: A unified perspective of ...
  • G. Wang, Y. Yuan, X. Chen, J. Li, X. Zhou, ...
  • H. Luo, W. Jiang, X. Zhang, X. Fan, J. Qian, ...
  • J. Qian, W. Jiang, H. Luo, H. Yu, "Stripe-based and ...
  • A. Vaswani et al., "Attention is all you need," Adv. ...
  • K. Han et al., "A survey on visual transformer," arXiv ...
  • S. Khan, M. Naseer, M. Hayat, S. W. Zamir, F. ...
  • A. Dosovitskiy et al., "An image is worth ۱۶x۱۶ words: ...
  • H. Touvron, M. Cord, M. Douze, F. Massa, A. Sablayrolles, ...
  • S. He, H. Luo, P. Wang, F. Wang, H. Li, ...
  • D. Wu et al., "Deep learning-based methods for person re-identification: ...
  • D. Gray, H. Tao, "Viewpoint invariant pedestrian recognition with an ...
  • C. C. Loy, T. Xiang, S. Gong, "Multi-camera activity correlation ...
  • W. Li, R. Zhao, X. Wang, "Human reidentification with transferred ...
  • W. Li, R. Zhao, T. Xiao, X. Wang, "Deepreid: Deep ...
  • L. Zheng, L. Shen, L. Tian, S. Wang, J. Wang, ...
  • E. Ristani, F. Solera, R. Zou, R. Cucchiara, C. Tomasi, ...
  • L. Wei, S. Zhang, W. Gao, Q. Tian, "Person transfer ...
  • S. Ren, K. He, R. Girshick, J. Sun, "Faster r-cnn: ...
  • S. Targ, D. Almeida, K. Lyman, "Resnet in resnet: Generalizing ...
  • S. Xie, R. Girshick et al., "Aggregated residual transformations for ...
  • A. G. Howard et al., "Mobilenets: Efficient convolutional neural networks ...
  • J. Zang, L. Wang, Z. Liu, Q. Zhang, G. Hua, ...
  • F. N. Iandola, S. Han, M. W. Moskewicz et al. ...
  • F. Chollet, "Xception: Deep learning with depthwise separable convolutions," in ...
  • M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, L. C. ...
  • A. Howard et al., "Searching for mobilenetv۳," in Proc. IEEE/CVF ...
  • S. Elfwing, E. Uchibe, K. Doya, "Sigmoid-weighted linear units for ...
  • Y. Guo, D. Zhou, W. Li, J. Cao, "Deep multi-scale ...
  • P. Ramachandran, B. Zoph, Q. V. Le, "Searching for activation ...
  • J. L. Ba, J. R. Kiros, G. E. Hinton, "Layer ...
  • نمایش کامل مراجع