We recently setup a PRTG Failover Cluster node and everything connected and setup fine. We have been getting an error regularly overnight where the Cluster node will show up as the Cluster Probe device being disconnected for a short period of time and then re-establish. We have been unable to find any cause for this and was wondering if anyone else experienced this and may have found a reason. Windows logs and PRTG logs do not seem to show any reason.
Article Comments
Below is a copy off error we get. Have Uploaded support bundle as well Cluster Probe > Cluster Probe > Cluster Probe Device (127.0.0.1)
Sensor Cluster Health (Cluster Health) * New Status at 29/04/2016 2:43:43 AM (AUS Eastern Standard Time): Down ended (now: Down Partial) Last Message: 1 # (Cluster Nodes Disconnected) is above the error limit of 0 # in Cluster Nodes Disconnected
Last Scan: Last Up: Last Down: Uptime: Downtime: Coverage: Sensor Type: Interval: 60 s 60 s 120 s 99.4755% 0.5245% 100% Cluster Health 60 s
Check Now Acknowledge Alarm Pause Resume
Pause for 5 minutes Pause for 60 minutes Pause for 24 hours
Channel Last Value Cluster Messages In 9 Msg/min Cluster Messages Out 10 Msg/min Cluster Nodes Disconnected 1 # Connects 1 #/min Downtime Outbound Cluster Connections 1 #
History 29/04/2016 1:55:43 AM Down, 1 # (Cluster Nodes Disconnected) is above the error limit of 0 # in Cluster Nodes Disconnected 29/04/2016 1:55:43 AM Down, 1 # (Cluster Nodes Disconnected) is above the error limit of 0 # in Cluster Nodes Disconnected 29/04/2016 1:48:57 AM Up, 1 # 29/04/2016 1:47:43 AM Up, 1 # 29/04/2016 1:47:43 AM Up, 1 # 29/04/2016 1:35:43 AM Down Partial, 1 # (Cluster Nodes Disconnected) is above the error limit of 0 # in Cluster Nodes Disconnected
Don't want to receive this email? Please edit the triggers of this sensor or edit your notification settings!
Apr, 2016 - Permalink
?? What was the result of this?? I'm having a similar issue. many times a day my cluster health sensor goes down. Indicating the same error "Cluster nodes Disconnected".
Mar, 2020 - Permalink
In that case the issue was that the Failover occasionally lost connection due to network problems. Please check the connection between the two nodes.
Apr, 2020 - Permalink
Hello,
Please forward us also a Support Bundle from both cluster nodes for analysis. This can be done via the "Contact Support" ribbon in the lower right corner of the failover´s web interface. Please refer to this kb post.
Apr, 2016 - Permalink