Accelerating Legislation Processes through Semantic Similarity Analysis with BERT-based Deep Learning

سال انتشار: 1403
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 32

فایل این مقاله در 9 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_IJE-37-6_001

تاریخ نمایه سازی: 28 اسفند 1402

چکیده مقاله:

Countries are managed based on accurate and precise laws. Enacting appropriate and timely laws can cause national progress. Each law is a textual term that is added to the set of existing laws after passing a process with the approval of the assembly. In the review of each new law, the relevant laws are extracted and analyzed among the set of existing laws. This paper presents a new solution for extracting the relevant rules for a term from an existing set of rules using semantic similarity and deep learning techniques based on the BERT model. The proposed method encodes sentences or paragraphs of text in a fixed-length vector (dense vector space). Thereafter, the vectors are utilized to evaluate and score the semantic similarity of the sentences with the cosine distance measurement scale. In the proposed method, the machine can understand the meaning and concept of the sentences by using the BERT model coding method. The BERT model considers the position of the entities in the sentences. Then the semantic similarities of documents, calculating the degree of similarity between their documents with a subject, and detecting their semantic similarity are done. The results obtained from the test dataset indicated the precision and accuracy of the method in detecting semantic similarities of legal documents related to the Islamic Consultative Assembly of Iran, as well as the precision and accuracy of performance above ۹۰%.

کلیدواژه ها:

Text Mining ، Neural Network ، Semantic search ، Sentence embedding in vector space ، BERT model

نویسندگان

J. Naseri

Faculty of Computer Engineering, Shahrood University of Technology, Shahrood, Iran

H. Hasanpour

Faculty of Computer Engineering, Shahrood University of Technology, Shahrood, Iran

A. Ghanbari Sorkhi

Faculty of Electrical and Computer Engineering, University of Science and Technology of Mazandaran, Behshahr, Iran

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • National strategic plan for research and development of artificial intelligence ...
  • Research in artificial intelligence and legislation and review of civil ...
  • Burri T, Von Bothmer F. The new EU legislation on ...
  • Devlin J, Chang M-W, Lee K, Toutanova K. Bert: Pre-training ...
  • Reimers N, Gurevych I. Making monolingual sentence embeddings multilingual using ...
  • Leskovec J, Rajaraman A, Ullman JD. Mining of massive data ...
  • Sadjadi S, Mashayekhi H, Hassanpour H. A two-level semi-supervised clustering ...
  • Jiang Y, Zhang X, Tang Y, Nie R. Feature-based approaches ...
  • Reimers N, Gurevych I. Sentence-bert: Sentence embeddings using siamese bert-networks. ...
  • Haveliwala TH, Gionis A, Klein D, Indyk P, editors. Evaluating ...
  • Pennington J, Socher R, Manning CD, editors. Glove: Global vectors ...
  • Sutskever I, Vinyals O, Le QV. Sequence to sequence learning ...
  • Hossain MZ, Akhtar MN, Ahmad RB, Rahman M. A dynamic ...
  • نمایش کامل مراجع