Cross-modal Image-Text Retrieval Using Support Vector Machine

سال انتشار: 1402
محل انتشار: اولین کنفرانس بین المللی هوش مصنوعی و خودروی هوشمند
کد COI اختصاصی: ICAISV01_009
زبان مقاله: انگلیسی
تعداد مشاهده: 292

دانلود فایل این مقاله

نویسندگان

Ali Goudarzi

Department of Computer Engineering Islamic Azad University, South Tehran Branch,Tehran, Iran

Fatemeh Taheri

Department of Computer Engineering Islamic Azad University, South Tehran Branch,Tehran, Iran

Kambiz Rahbar

Department of Computer Engineering Islamic Azad University, South Tehran Branch,Tehran, Iran

چکیده

With the increasing growth of multimodal data in the form of audio, video, image and text data, the importance of multimodal retrieval has also increased. One of the main challenges of cross-retrieval is to reduce the heterogeneity gap between different methods, such as retrieving images through texts or vice versa. Therefore, in this paper, a reciprocal retrieval method based on supervised learning is proposed. Image features including color, texture and shape are extracted using color auto-correlogram, Gabor filter and Zernike moments. Text features are also extracted using latent Dirichlet allocation method. Also, two support vector machines are trained separately to learn the features of images and side texts. Finally, mutual retrieval is done based on the classification results of the search modality and considering the smallest distance between the samples of the opposite modality.

کلیدواژه ها

Cross-modal retrieval Support vector machine, Auto correlogram, Gabor filter, Latent Dirichlet allocation

مقالات مرتبط جدید

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.