We have multiple m1000e Chassis units in our datacenter, and they are all full of M610 units. We have now had multiple instances where for some reason, the primary CMC Heartbeat is lost, triggering a CMC failover event. As soon as the second CMC fails over, many of the M610s in that chassis suddenly spike up in CPU usage and become nearly unresponsive. Once we fail back over to the primary CMC, they calm back down.
We have verified that all power settings are correct and that the CMC firmwares are the same version. The M610s are all running RHEL 6.
Has anyone else had this happen? Is there an update/patch somewhere that resolves this? We've gone so far in one chassis as to pull the redundant CMC out and run on one until we can figure out the fix.