Job Failure Prediction in Grid Environment Based on Workload Characteristics

سال انتشار: 1388
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 2,006

فایل این مقاله در 6 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

CSICC14_006

تاریخ نمایه سازی: 24 خرداد 1388

چکیده مقاله:

The power of grid technology in aggregating autonomous resources owned by several organizations into a single virtual system has made it popular in compute-intensive and data-intensive applications. Complex and dynamic nature of grid makes failure of users’ jobs fairly probable. Furthermore, traditional methods for job failure recovery have proven costly and thus a need to shift toward proactive and predictive management strategies is necessary in such systems. In this paper, an innovative effort is made to predict the futurity of jobs submitted to a production grid environment (AuverGrid). By analyzing grid workload traces and extracting patterns describing common failure characteristics, the success or failure status of jobs during 6 months of AuverGrid activity was predicted with around 96% accuracy. The quality of services on grid can be improved by integrating the result of this work into management services like scheduling and monitoring.

نویسندگان

Hamid Fadishei

Parallel and Distributed Processing Lab, Ferdowsi University of Mashhad, Iran

Hamid Saadatfar

Parallel and Distributed Processing Lab, Ferdowsi University of Mashhad, Iran

Hossein Deldari

Parallel and Distributed Processing Lab, Ferdowsi University of Mashhad, Iran