Designing Persian Language Parser Tool

  • سال انتشار: 1392
  • محل انتشار: پنجمین کنفرانس ملی مهندسی برق و الکترونیک ایران
  • کد COI اختصاصی: ICEEE05_542
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 1885
دانلود فایل این مقاله

نویسندگان

Ahmad estiri

ferdowsi university of mashhad

mohsen kahani

ferdowsi university of mashhad

hadi ghaemi

ferdowsi university of mashhad

Ehsan Asgarian

ferdowsi university of mashhad

چکیده

Along with the theoretical developments in new linguistics, the computer analysis methodology for analyzing text and grammar are developed. The grammar of any language in here is referred to as a series of understandable grammatical orders for computer by help of which one can accurately analyze the syntactic components of any given sentence. A typical sentence is analyzed and disintegrated to its components (noun phrases, verb phrases, adverbial phrases etc.) by the help of parser tools that play a vital role in designing and accuracy of any other text-analyzing tool in computer. Taking into account the lexical morphology, position, ordering of words in a sentence as well as the prior and posterior lexical items, the present parser tool designs the syntactic tree or parsing. In fact, the parsing process is completed by considering the lexical morphology (the study of the form of the words) and the syntax of the Persian language. Therefore, as much as the writing and documentation of a sentence is polished and punctuation is accurate, the parsing process is more accurate and more successful. In addition, the tagging process for different components of the sentence would be easier

کلیدواژه ها

Natural Language Processing (NLP), Farsi, Parser, parse tree, morphology

مقالات مرتبط جدید

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.