A new method for Resource Management System (RMS) Fault Tolerance in Grid Computing

سال انتشار: 1390
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 2,387

فایل این مقاله در 8 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

CSCCIT01_047

تاریخ نمایه سازی: 8 بهمن 1390

چکیده مقاله:

Since the grid system is implemented on a network framework with heterogeneous remote resources, it is a hazardous environment which fault and failure are familiar events. Users expect that their jobs executed reliable and fast in the grid. Thus, reliability and fault tolerance are important challenges in grid researches. The grid service reliability and fault tolerance are discussed in this paper. Resource management system is the brain of a grid and responsible for the management and execution of tasks. In this paper, we propose a new method for grid Resource Management Systems (RMS) fault-tolerance. In this method, we add a new layer tops of the RMSs site for support them when one or more RMSs are failed. This layer is composed of components that called RMSS (Resource Management System Supporter). Our goal is reliability improvement and consequently decreasing penalties that paid to the users. This method does not need redundant RMSs, which leads to decrease hardware redundancy and implementation costs. MATLAB software is used for analysis of our proposed method. Analysis of results shows that the reliability factor is improved and consequently the penalties are decreased.

نویسندگان

bahman arasteh

Islamic Azad University of Tabriz, Tabriz, Iran

faraz dastan

Islamic Azad University of Tabriz, Tabriz, Iran

saeed alavi

Islamic Azad University of Tabriz, Tabriz, Iran

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • Lyu, M. "Handbook of Software Reliability Engineering, " McGraw- Hill ...
  • Baker, M., Buyya, R., Laforenza, D., Grids and Grid technologies ...
  • Foster, I., Kesselman, C., Tuecke, S., The Anatomy of the ...
  • Levitin, G., Dai, Y.S. Service reliability and performance in grid ...
  • Dai, Y.S., Wang X.L. Optimal resource allocation on grid systems ...
  • Dai, Y.S., Levitin, G., Wang, X. Optimal task partition and ...
  • Krauter, k., Buyya, R., Maheswaran, M. A taxonomy and survey ...
  • Levitin, G., Dai, Y.S. Optimal service task partition and distribution ...
  • Avizienis, _ Laprie, J., Randle, B., Landwehr, C. Basic Concepts ...
  • Chepten, M., Claeys, A, Dhoet, B., DE Turck, F., Demeester, ...
  • IEEE Transaction on parallel and Distributed Systems, Vol. 21, no.2, ...
  • نمایش کامل مراجع