Fault Tolerance by Replication in Parallel Systems

Madhavi Vaidya

抽象的

Fault Tolerance by Replication in Parallel Systems

Madhavi Vaidya

In this paper the author has concentrated on architecture of a cluster computer and the working of them in context with parallel paradigms. Author has a keen interest on guaranteeing the working of a node efficiently and the data on it should be available at any time to run the task in parallel. The applications while running may face resource faults during execution. The application must dynamically do something to prepare for, and recover from, the expected failure. Typically, checkpointing is used to minimize the loss of computation. Checkpointing is a strategy purely local, but can be very costly. Most checkpointing techniques, however, require central storage for storing checkpoints. This results in a bottleneck and severely limits the scalability of checkpointing, while also proving to be too expensive for dedicated checkpointing networks and storage systems. The author has suggested the technique of replication implemented on it. Replication has been studied for parallel databases in general. Author has worked on parallel execution of task on a node; if it fails then self protecting feature should be turned on. Selfprotecting in this context means that computer clusters should detect and handle failures automatically with the help of replication

免责声明: 此摘要通过人工智能工具翻译，尚未经过审核或验证

期刊亮点

人工智能信息技术信息系统图形控制论数据库管理系统数据挖掘机器学习神经网络编程语言虚拟现实计算机人机交互计算机安全计算机工程计算机架构计算机科学计算理论计算生物学通讯网络

索引于

谷歌学术

学术期刊数据库

打开 J 门

学术钥匙

研究圣经

引用因子

电子期刊图书馆

参考搜索

哈姆达大学

学者指导

国际创新期刊影响因子（IIJIF）

国际组织研究所 (I2OR)

宇宙

国际期刊

制药科学医学科学工程普通科学

全球计算机科学研究杂志

抽象的

Fault Tolerance by Replication in Parallel Systems

期刊亮点

索引于

国际期刊

地址