Achieve a level of high reliability in a microprocessor system by adding a second identical processor to a system to monitor and verify the system processor operation – also known as the lockstep processor technique. This tip demonstrates the integrated lockstep facility in the PowerPC 750GX processor.
Stratus does this in hardware. Marathon Technologies does this in software. Both take care of many other failure modes (disk, power supply, I/O, memory, and in Marathon’s design, even driver faults) that are much more likely to cause a system failure. I mean it’s cool IBM is working on this (and they have some really strong high availability talent in house), but they need to take it to the system level (not just one component, and especially one – the CPU – that is among the least likely to fail).
System 88?
The System 88 was OEM’d from Stratus.
Are these the dual core chips used in the high end PowerMac G5?
No, the Power Mac G5 uses the 970MP.
The PowerPC 750GX processor is also known as the G3.
They have been doing this for years with their mainframes, to the point that it swaps out the bad hardware and calls service with no user or operator intervention, and without a single bad bit hitting disk. This is how they have years between reboots on what is now called the Z series (370/390 for you old schoolers.)
I think it will be great if we can get this kind of fail over all of the way down into SMB sized machines, but I amso agree that it needs to be not only the processor, but other componenets too Add memory to the list, since we already have RAID, and we are getting close.
So can this have more than 2 CPUs? And could they interface to different, identical circuits for I/O?