Hey there,
I'm running Operations Manager 2012 R2 and have 4 agents that consistently show this alert despite my best efforts to resolve it. On the agents I'll see an event 4000 followed immediately by an event 1103:
Log Name: Operations Manager
Source: HealthService
Date: 1/24/2014 11:47:05 AM
Event ID: 4000
Task Category: Health Service
Level: Error
Keywords: Classic
User: N/A
Computer: HOSTNAME
Description:
A monitoring host is unresponsive or has crashed. The status code for the host failure was 2164195371.
Log Name: Operations Manager
Source: HealthService
Date: 1/24/2014 11:47:05 AM
Event ID: 1103
Task Category: Health Service
Level: Warning
Keywords: Classic
User: N/A
Computer: HOSTNAME
Description:
Summary: 273 rule(s)/monitor(s) failed and got unloaded, 273 of them reached the failure limit that prevents automatic reload. Management group "mymanagementgroup". This is summary only event, please see other events with descriptions of unloaded
rule(s)/monitor(s).
There's also a corresponding event 1000 in the Application log:
Log Name: Application
Source: Application Error
Date: 1/24/2014 11:28:14 AM
Event ID: 1000
Task Category: (100)
Level: Error
Keywords: Classic
User: N/A
Computer: HOSTNAME
Description:
Faulting application name: MonitoringHost.exe, version: 7.1.10184.0, time stamp: 0x522a23d5
Faulting module name: MSVCR100.dll, version: 10.0.40219.325, time stamp: 0x4df2bcac
Exception code: 0xc0000417
Fault offset: 0x0000000000070468
Faulting process id: 0x1e38
Faulting application start time: 0x01cf1932066335fb
Faulting application path: C:\Program Files\Microsoft Monitoring Agent\Agent\MonitoringHost.exe
Faulting module path: C:\Windows\system32\MSVCR100.dll
Report Id: 48dc1e0c-8525-11e3-ae8c-005056a25687
Things I've already tried to resolve the issue:
- Tried to repair the agent through the operations manager console
- Uninstalled and reinstalled the agent by pushing it and installing the agent manually
- Restarted the healthservice on the agents
What other things can I do to try to diagnose what's broken? I have 265 other agents that are working just fine and I can't figure out what's different about these 4 agents.
The agents are running Server 2008 R2
- Edited by Austin Gruenweller Friday, January 24, 2014 7:58 PM