Modeling Intra-label Dynamics and Analyzing the Role of Blank in Connectionist Temporal Classification
محل انتشار: مجله مهندسی کامپیوتر و دانش، دوره: 1، شماره: 2
سال انتشار: 1397
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 337
فایل این مقاله در 8 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_CKE-1-2_006
تاریخ نمایه سازی: 3 اسفند 1398
چکیده مقاله:
The goal of many tasks in the realm of sequence processing is to map a sequence of input data to a sequence of output labels. Long short-term memory (LSTM), a type ofrecurrent neural network (RNN), equipped with connectionist temporal classification (CTC) has been proved to be one of the most suitable tools for such tasks. With theaid of CTC, the existence of per-frame labeled sequences are no longer necessary and it suffices to only knowing the sequence of labels. However, in CTC, only a single state isassigned to each label and consequently, LSTM would not learn the intra-label relationships. In this paper, we propose to remedy this weakness by increasing the number of states assigned to each label and actively modeling such intra-label transitions. On the other hand, the output of a CTC network usually corresponds to the set of all possible labels along with a blank. One of the uses of blank is in the recognition of multiple consecutive identical labels. Assigning more than one state to each label, we can also decode consecutive identical labels without resorting to the blank. We investigated the effect of increasing the number of sub-labels with/without blank on the recognition rate of the system. We performed experiments on two printed and handwritten Arabic datasets. Our experiments showed that while on simple tasks a model without blank may converge faster, on real-world complex datasets use of blank significantly improves the results.
کلیدواژه ها:
Connectionist Temporal Classification ، Handwriting Recognition ، Recurrent Neural Networks ، Multidimensional Long Short Term Memory ، Blank.
نویسندگان
Ashkan Sadeghi Lotfabadi
Department of Computer Engineering Ferdowsi University of Mashhad, Iran
Kamaledin Ghiasi-Shirazi
department of Computer Engineering Ferdowsi University of Mashhad, Iran
Ahad Harati
Department of Computer Engineering Ferdowsi University of Mashhad, Iran.