| ||||||||||||||
|
SynchronizationSlow But Sure The good news: the active RaQ3i continues to serve requests during resynchronization, and we encountered no data corruption or loss when we interrupted this process - it simply restarted again from scratch. Web transactions and mail transfers completed before failover were mirrored to the standby and available after failover. The bad news: during resynchronization, HA services are (naturally) unavailable, leaving the active vulnerable to failure. FailoverWhen, Why, and How? If the active cannot reach the standby over Network 1, HA services are unavailable but real-time synchronization continues over Network 2. When Network 1 reachability is restored, HA services resume immediately, without further resynchronization. If the active and standby cannot reach each other over Network 2, the active continues to service clients over Network 1 but drops into a non-HA state. When Network 2 reachability is restored, a full resynchronization is required before HA services are available. If the active and standby cannot reach the default gateway, there is no point in initiating failover, because both servers are configured with the same gateway. Thus, StaQWare cannot be used to increase availability through diverse routing. StaQWare indicates the problem, but does so in a confusing manner. The standby LCD displays "RaQ is not in HA *check cabling*"nearly always sound advice. But GUI status and mail notifications indicate "Standby RaQ has failed or been disconnected". One cannot determine the real problem unless actually looking at the standby's LCD panel. We tested network failure by yanking Ethernet cables. We also tested abrupt power failures. When the active loses power, the standby checks its own file system, then takes over the active's address and restores service (in our experience, within 6-7 minutes). When the standby loses power, HA services become unavailable until the standby is restored and data is completely resynchronized. If both units lose power? The active and standby search for each other for upon restart. After 5 minutes, if the standby cannot locate the active, the standby takes over, restoring service within about 7 minutes. For faster restoration, the standby's LCD panel can be used to boot in active mode.
|
|
||||||||||||
|
|
||||||||||||||