Analyzing the Inference Process in Deep Convolutional Neural Networks using Principal Eigenfeatures, Saturation and Logistic Regression Probes

سال انتشار: 1402
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 104

فایل این مقاله در 10 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JAREE-2-1_001

تاریخ نمایه سازی: 3 دی 1402

چکیده مقاله:

The predictive performance of a neural network depends on the one hand on the difficulty of a problem, defined by the number of classes and complexity of the visual domain, and on the other hand on the capacity of the model, determined by the number of parameters and its structure. By applying layer saturation and logistic regression probes, we confirm that these factors influence the inference process in an antagonistic manner. This analysis allows the detection of over- and under-parameterization of convolutional neural networks. We show that the observed effects are independent of previously reported pathological patterns, like the “tail pattern”. In addition, we study the emergence of saturation patterns during training, showing that saturation patterns emerge early in the optimization process. This allows for quick detection of problems and potentially decreased cycle time during experiments. We also demonstrate that the emergence of tail patterns is independent of the capacity of the networks. Finally, we show that information processing within a tail of unproductive layers is different, depending on the topology of the neural network architecture.

نویسندگان

Mats Leon Richter

Institute of Cognitive Science, University of Osnabrück, Osnabrück, Germany

Leila Malihi

Institute of Cognitive Science, University of Osnabrück, Osnabrück, Germany

Anne-Kathrin Patricia Windler

Institute of Cognitive Science, University of Osnabrück, Osnabrück, Germany

Ulf Krumnack

Institute of Cognitive Science, University of Osnabrück, Osnabrück, Germany

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • M. Tan et al., “MnasNet: Platform-aware neural architecture search for ...
  • I. Garg, P. Panda, and K. Roy, "A Low Effort ...
  • W. Ahmed, S. Ansari, M. Hanif, A. Khalil, “PCA driven ...
  • I. Chakraborty, D. Roy, I. Garg Constructing, A. Ankit, and ...
  • G. Alain, and Y. Bengio, “Understanding intermediate layers using linear ...
  • M. L. Richter, W. Byttner, U. Krumnack, L. Schallner, and ...
  • M. L. Richter, L. Malihi, A.-K. P. Windler, and U. ...
  • J. Shenk, M. L. Richter, A. Arpteg, and M. Huss, ...
  • J. Shenk, M. L. Richter, W. Byttner, M. Marcinkiewicz “Delve: ...
  • D. P. Kingma, and J. Ba, “Adam: A method for ...
  • L. Bossard, M. Guillaumin, and L. Van Gool, “Food-۱۰۱ – ...
  • A. Krizhevsky, “Learning multiple layers of features from tiny images,” ...
  • Y. LeCun, and C. Cortes, “MNIST hand written digit database,”, ...
  • نمایش کامل مراجع