An Improved Clustering Analysis Method Based on Fuzzy C-Means Algorithm by Whale Optimization Algorithm
محل انتشار: سومین کنفرانس مهندسی صنایع ،اقتصاد و مدیریت
سال انتشار: 1399
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 2,414
فایل این مقاله در 13 صفحه با فرمت PDF و WORD قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
IEEM03_003
تاریخ نمایه سازی: 18 اسفند 1399
چکیده مقاله:
The use of big data has become widespread in many areas of human knowledge, including medicine and engineering. One of the most widely used processes on various types of data, especially big data, is cluster analysis or clustering. One of the most popular clustering methods developed by the distance approach is called Fuzzy C-Means (FCM). Having a simple structure, this method has been favored by developers in many applications; however, it cannot be used in the clustering of big data. Due to the large number of objects, they are not loadable in the main memory of ordinary computer systems at run time; therefore, they are impossible to be processed at once. Moreover, FCM is sensitive to cluster center initialization, so that inappropriate initialization may lead to slow or non-optimal convergence. Optimization methods are usually used to solve the FCM convergence problem and to find more appropriate cluster centers. In this thesis, a new clustering method has been introduced in which a whale optimization algorithm is used to solve the FCM convergence problem. Furthermore, random sampling of data, application of the clustering on samples, and ultimately, extension of the clustering results to all data have been proposed as a solution for the problem of big data clustering. In order to reduce the effect of the selected samples on the performance of clustering in this solution, sampling is repeated several times, and at the end, the clustering results are combined. Results from the application of the proposed clustering method on artificial and actual databases indicate the accuracy of the proposed method compared with that of other similar methods.
کلیدواژه ها:
نویسندگان
Seyed Emadedin Hashemi
Department of Industrial Engineering, Islamic Azad University Arak Branch, Arak , Iran