Unsolved
This post is more than 5 years old
1 Message
0
5640
OME 1.2.1 hardware reporting reliability
Hello
I am trying to reduce the number of times our datacenter staff needs to perform visual checks on our Dell servers for hardware alerts: hdd, fan, psu etc.
I am using OMSA + snmp combo for both Windows and ESXI. Server hardware is mostly a mix from 9th-12th gen. Reporting has been fairly reliable with some exceptions. One 2008R2 server was not listing any hardware alerts and the status was unknown. Logging in locally, found the DSM SA Data Manager service had quit. After starting it up, OME started getting alert data from this system. There a few other Windows 2003 servers that have unknown health but report alerts just fine.
What is the best strategy to ensure reliable alerting? I know I can set Windows to auto-restart DSM services upon failure, and probably need to do the same for snmp service. How do I accomplish similar on ESXI?
Looks like I posted in the wrong section, can someone move the posting? Thanks