Explore tens of thousands of sets crafted by our community.
Fault Tolerance Techniques
12
Flashcards
0/12
Rollback Recovery
Rollback Recovery involves reverting a system to a previous state when an error is detected. It provides fault tolerance by allowing the system to continue functioning after correcting the error by reverting to a known good state.
Failover
Failover is the automatic switching to a standby system or component upon the failure of the currently active system. It provides fault tolerance by minimizing downtime and maintaining service availability.
Failback
Failback is the process of returning to the primary system after a failover occurred once the primary system is back online and stable. It provides fault tolerance by restoring normal operations after an incident.
Redundancy
Redundancy means having extra hardware, software, or data that can be used as a backup. It provides fault tolerance by allowing systems to switch to redundant components in the event of a failure.
Majority Consensus
Majority Consensus is a method where multiple replicas need to agree on a value or decision. It provides fault tolerance as it requires more than half of the nodes to agree, ensuring a consistent system state even in the presence of failures.
State Machine Replication
State Machine Replication ensures that copies of a service in different nodes execute the same transactions in the same order, leading to the same state. It provides fault tolerance by ensuring all nodes maintain a consistent state despite individual failures.
Time Redundancy
Time Redundancy involves performing operations or calculations multiple times to ensure correctness. It provides fault tolerance by re-executing tasks to recover from transient errors.
Load Balancing
Load Balancing is the process of distributing workload evenly across multiple systems. It provides fault tolerance by preventing any single node from becoming a bottleneck and by supporting graceful degradation in case of node failure.
Heartbeat
A heartbeat is a periodic signal sent between nodes to monitor availability. It provides fault tolerance by detecting failures early and allowing systems to react before a full outage occurs.
Replication
Replication involves creating copies of data or services across different nodes. It provides fault tolerance by ensuring system functionality even when one or more nodes fail, as other nodes with replicas can take over.
Checkpointing
Checkpointing is the process of saving the state of a system at intervals. It provides fault tolerance by enabling the system to resume from the last checkpoint in case of failure.
Data Mirroring
Data Mirroring is the practice of copying data in real-time to another location. It provides fault tolerance by ensuring data is continuously available from an alternative location in case of failure.
© Hypatia.Tech. 2024 All rights reserved.