diff --git a/Huawei Server HMM Template V1.1.xml b/Huawei Server HMM Template V1.1.xml new file mode 100644 index 0000000..8086874 --- /dev/null +++ b/Huawei Server HMM Template V1.1.xml @@ -0,0 +1,1911 @@ + + + 3.4 + 2018-10-24T19:10:37Z + + + Huawei Server + + + + + + + + {Huawei Server HMM Template:shelfHealth.last()}>3 and {Huawei Server HMM Template:shelfHealth.last()}<8 + 0 + + HMMshelfHealth critical + 0 + + + 0 + 5 + + 0 + 0 + + + + + {Huawei Server HMM Template:shelfHealth.last()}=2 or {Huawei Server HMM Template:shelfHealth.last()}=3 + 0 + + HMMshelfHealth major + 0 + + + 0 + 4 + + 0 + 0 + + + + + {Huawei Server HMM Template:shelfHealth.last()}=1 + 0 + + HMMshelfHealth minor + 0 + + + 0 + 2 + + 0 + 0 + + + + + {Huawei Server HMM Template:systemHealth.last()}>3 and {Huawei Server HMM Template:systemHealth.last()}<8 + 0 + + HMMsystemHealth critical + 0 + + + 0 + 5 + + 0 + 0 + + + + + {Huawei Server HMM Template:systemHealth.last()}=2 or {Huawei Server HMM Template:systemHealth.last()}=3 + 0 + + HMMsystemHealth major + 0 + + + 0 + 4 + + 0 + 0 + + + + + {Huawei Server HMM Template:systemHealth.last()}=1 + 0 + + HMMsystemHealth minor + 0 + + + 0 + 2 + + 0 + 0 + + + + + {Huawei Server HMM Template:smmHealth.last()}>3 and {Huawei Server HMM Template:systemHealth.last()}<8 + 0 + + smmHealth critical + 0 + + + 0 + 5 + + 0 + 0 + + + + + {Huawei Server HMM Template:smmHealth.last()}=2 or +{Huawei Server HMM Template:smmHealth.last()}=3 + 0 + + smmHealth major + 0 + + + 0 + 4 + + 0 + 0 + + + + + {Huawei Server HMM Template:smmHealth.last()}=1 + 0 + + smmHealth minor + 0 + + + 0 + 2 + + 0 + 0 + + + + + + + Ambient Temperature (°C) + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 2 + 0 + + Huawei Server HMM Template + Ambient_Temp + + + + + + inlet Temperature (°C) + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 2 + 0 + + Huawei Server HMM Template + Inlet_Temp + + + + + + LSW Temperature (°C) + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 2 + 0 + + Huawei Server HMM Template + LSW_Temp + + + + + + outlet Temperature (°C) + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 2 + 0 + + Huawei Server HMM Template + Outlet_Temp + + + + + + shelfRealTimePower (W) + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 2 + 0 + + Huawei Server HMM Template + shelfRealTimePower + + + + + + + + Huawei server HMMsystemHealth + + + 0 + ok + + + 1 + minor + + + 2 + major + + + 3 + majorandminor + + + 4 + critical + + + 5 + criticalandminor + + + 6 + criticalandmajor + + + 7 + criticalandmajorandminor + + + + + Huawei server smmHealth + + + 1 + normal + + + 2 + minor + + + 3 + major + + + 4 + critical + + + + + Huawei sever HMM Blade temperature + + + 2.15 G + Invalide Temperature value + + + 2.15 G°C + Invalide Temperature value + + + 2147483647 + Invalide Temperature value + + + 2147483647°C + Invalide Temperature value + + + + + Huawei sever invalid CPU usage + + + 65535 % + invalid current CPU usage + + + 65536 + invalid current CPU usage + + + + + diff --git a/Huawei Server ValueMap V1.1.xml b/Huawei Server ValueMap V1.1.xml new file mode 100644 index 0000000..379dec5 --- /dev/null +++ b/Huawei Server ValueMap V1.1.xml @@ -0,0 +1,419 @@ + + + 3.4 + 2018-10-22T11:38:58Z + + + Huawei server fanEntireStatus + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + 5 + absence + + + 6 + unknown + + + + + Huawei server hardDiskEntireStatus + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + 5 + absence + + + 6 + unknown + + + + + Huawei server HMMsystemHealth + + + 0 + ok + + + 1 + minor + + + 2 + major + + + 3 + majorandminor + + + 4 + critical + + + 5 + criticalandminor + + + 6 + criticalandmajor + + + 7 + criticalandmajorandminor + + + + + Huawei server hwEntityFanState + + + 1 + normal + + + 2 + abnormal + + + + + Huawei server invalid Memory usage + + + 65536 + invalid current memory usage + + + + + Huawei server memoryEntireStatus + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + 5 + absence + + + 6 + unknown + + + + + Huawei server powerSupplyEntireStatus + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + 5 + absence + + + 6 + unknown + + + + + Huawei server raidControllerHealthStatus + + + 0 + ok + + + 1 + memory correctable error + + + 2 + memory uncorrectable error + + + 3 + memory correctable error and memory uncorrectable error + + + 4 + memory ECC error reached limit + + + 5 + memory ECC error reached limit and memory correctable error + + + 6 + memory ECC error reached limit and memory uncorrectable error + + + 7 + memory ECC error reached limit | uncorrectable|correctable error + + + 8 + NVRAM uncorrectable error + + + 9 + NVRAM uncorrectable error and memory correctable error + + + 10 + NVRAM uncorrectable error and memory uncorrectable error + + + 11 + NVRAM uncorrectable error | uncorrectable|correctable error + + + 12 + NVRAM uncorrectable error and memory ECC error reached limit + + + 13 + NVRAM uncorrectableError|ECC ErrorReachedLimit|correctableError + + + 14 + NVRAM|memory uncorrectableError|ECC ErrorReachedLimit + + + 15 + NVRAM|memory Uncorrectable Error|ECC ErrorReachedLimit + + + 65535 + unknown + + + + + Huawei server smmHealth + + + 1 + normal + + + 2 + minor + + + 3 + major + + + 4 + critical + + + + + Huawei server systemHealth + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + + + Huawei sever cpuEntireStatus + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + 5 + absence + + + 6 + unknown + + + + + Huawei sever HMM Blade temperature + + + 2.15 G + Invalide Temperature value + + + 2.15 G°C + Invalide Temperature value + + + 2147483647 + Invalide Temperature value + + + 2147483647°C + Invalide Temperature value + + + + + Huawei sever invalid CPU usage + + + 65535 % + invalid current CPU usage + + + 65536 + invalid current CPU usage + + + + + Huawei sever invalid Memory usage + + + 65536 + invalid current memory usage + + + + + Huawei sever memoryEntireStatus + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + 5 + absence + + + 6 + unknown + + + + + Huawei sever systemHealth + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + + + diff --git a/Huawei Server iBMC Template V1.1.xml b/Huawei Server iBMC Template V1.1.xml new file mode 100644 index 0000000..13a1b38 --- /dev/null +++ b/Huawei Server iBMC Template V1.1.xml @@ -0,0 +1,2392 @@ + + + 3.4 + 2018-10-24T15:34:02Z + + + Huawei Server + + + + + + + + {Huawei Server iBMC Template:systemHealth.last()}=4 + 0 + + systemHealth Critical + 0 + + + 0 + 5 + systemHealth information about system present state of health. + This value will be one of the following: + (1-OK, 2-Minor, 3-Major, 4-Critical) + 0 + 0 + + + + + {Huawei Server iBMC Template:systemHealth.last()}=3 + 0 + + systemHealth major + 0 + + + 0 + 4 + systemHealth information about system present state of health. + This value will be one of the following: + (1-OK, 2-Minor, 3-Major, 4-Critical) + 0 + 0 + + + + + {Huawei Server iBMC Template:systemHealth.last()}=2 + 0 + + systemHealth minor + 0 + + + 0 + 2 + systemHealth information about system present state of health. + 0 + 0 + + + + + + + averagePower + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 2 + 0 + + Huawei Server iBMC Template + averagePower + + + + + + diskPartitionUsage + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 2 + 0 + + Huawei Server iBMC Template + diskPartitionUsage + + + + + + InletTemperature + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 2 + 0 + + Huawei Server iBMC Template + temperatureVlue + + + + + + peakPower + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 2 + 0 + + Huawei Server iBMC Template + peakPower + + + + + + powerConsumption + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 2 + 0 + + Huawei Server iBMC Template + powerConsumption + + + + + + presentSystemPower + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 2 + 0 + + Huawei Server iBMC Template + presentSystemPower + + + + + + systemCpuUsage + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 7 + 0 + + Huawei Server iBMC Template + systemCpuUsage + + + + + + systemMemUsage + 900 + 200 + 0.0000 + 100.0000 + 1 + 1 + 0 + 1 + 0 + 0.0000 + 0.0000 + 0 + 0 + 0 + 0 + + + 0 + 0 + 1A7C11 + 0 + 7 + 0 + + Huawei Server iBMC Template + systemMemUsage + + + + + + + + Huawei server fanEntireStatus + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + 5 + absence + + + 6 + unknown + + + + + Huawei server hardDiskEntireStatus + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + 5 + absence + + + 6 + unknown + + + + + Huawei server invalid Memory usage + + + 65536 + invalid current memory usage + + + + + Huawei server powerSupplyEntireStatus + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + 5 + absence + + + 6 + unknown + + + + + Huawei server raidControllerHealthStatus + + + 0 + ok + + + 1 + memory correctable error + + + 2 + memory uncorrectable error + + + 3 + memory correctable error and memory uncorrectable error + + + 4 + memory ECC error reached limit + + + 5 + memory ECC error reached limit and memory correctable error + + + 6 + memory ECC error reached limit and memory uncorrectable error + + + 7 + memory ECC error reached limit | uncorrectable|correctable error + + + 8 + NVRAM uncorrectable error + + + 9 + NVRAM uncorrectable error and memory correctable error + + + 10 + NVRAM uncorrectable error and memory uncorrectable error + + + 11 + NVRAM uncorrectable error | uncorrectable|correctable error + + + 12 + NVRAM uncorrectable error and memory ECC error reached limit + + + 13 + NVRAM uncorrectableError|ECC ErrorReachedLimit|correctableError + + + 14 + NVRAM|memory uncorrectableError|ECC ErrorReachedLimit + + + 15 + NVRAM|memory Uncorrectable Error|ECC ErrorReachedLimit + + + 65535 + unknown + + + + + Huawei sever cpuEntireStatus + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + 5 + absence + + + 6 + unknown + + + + + Huawei sever invalid CPU usage + + + 65535 % + invalid current CPU usage + + + 65536 + invalid current CPU usage + + + + + Huawei sever memoryEntireStatus + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + 5 + absence + + + 6 + unknown + + + + + Huawei sever systemHealth + + + 1 + ok + + + 2 + minor + + + 3 + major + + + 4 + critical + + + + + diff --git a/README.md b/README.md index 4d0310f..dd7ec36 100644 --- a/README.md +++ b/README.md @@ -1 +1,83 @@ -# Server_Management_Plugin_Zabbix \ No newline at end of file +# 1.Zabbix Template Introduction + +Zabbix is free and open-source software. After a Huawei iBMC or HMM template is imported and community names of servers are configured in the template, you can view server asset information, monitor server component alarms, and view server temperature change curves, CPU and memory usage change curves, and real-time power change diagrams on the Zabbix WebUI. + +**Zabbix Template Names:** +- Huawei - ValueMap V1.0.xml +- Huawei - iBMC V1.0.xml +- Huawei - HMM V1.0.xml + +**Supported Versions:** +- Zabbix 3.4.X +- Zabbix 4.0.X + +**Mapping Software:** +- MM910: (U54) 6.86D or later +- iBMC: 2.94 (U25) or later + + +# 2.Template Functions + +**iBMC Template:** +- Latest data: CPU, fan, hard disk, iBMC system information, memory, power supply, RAID controller card, and temperature +- Problems and triggers: system health status, CPU status, fan status, power supply status, hard disk status, and memory status +- Graphs: inlet temperature, power consumption, system CPU usage, averagePower, peakPower, presentSystemPower and system memory usage +- Inventory: type, name, OS, serial number, tag, and MAC address + +**HMM Template:** +- Latest data: CPU, fan, power supply, switch, system information, and temperature +- Problems and triggers: system health, chassis health, SMM health, blade status, fan status, power supply status, and switch status +- Graphs: ambient temperature, inlet temperature, LSW temperature, outlet temperature, real-time chassis power, blade CPU power, blade inlet temperature, real-time blade power, and blade system CPU usage +- Inventory: type, name, OS, serial number, tag, and MAC address + + +# 3.Template Configuration + +## 3.1 Configuring the iBMC +- Step 1 Log in to the iBMC WebUI. +- Step 2 Choose Configuration > System. +- Step 3 Select SNMPv2c, and set the community name. + +## 3.2 Configuring the HMM +- Step 1 Log in to the HMM CLI. +- Step 2 Run the following command to configure the SNMPv2c protocol: + smmset -l smm -d snmpconfig -v v2c enable +- Step 3 Enter y to confirm the configuration information. +- Step 4 Run the following command to enter the community name configuration: + smmset -l smm -d snmpconfig -v community +Step 5 Enter a community name. + +## 3.3 Importing and Configuring a Template +**3.3.1 Importing a ValueMap Template** +- Step 1 Log in to GitHub and obtain the Huawei Server ValueMap V1.1.xml template. +- Step 2 Log in to the Zabbix WebUI. +- Step 3 Choose Administration > General. +- Step 4 Select Value mapping from the drop-down list box. +- Step 5 Click Import to go to the import page. +- Step 6 Click Browse and select the template obtained in step 1. +- Step 7 Click Import. + +**3.3.2 Importing an iBMC or HMM Template** +- Step 1 Log in to GitHub and obtain the Huawei Server iBMC Template V1.1.xml or Huawei Server HMM Template V1.1.xml template. +- Step 2 Log in to the Zabbix WebUI. +- Step 3 Choose Configuration > Templates. +- Step 4 Click Import to go to the import page. +- Step 5 Click Browse and select the template obtained in step 1. +- Step 6 Click Import. + +**3.3.3 Configuring a Template** +- Step 1 Log in to the Zabbix WebUI. +- Step 2 Choose Configuration > Templates. +- Step 3 Select the template to be imported. +- Step 4 Select Macros and set {$SNMP_COMMUNITY} and {$SNMP_PORT}. + The value of {$SNMP_COMMUNITY} is the community name configured in section 3.1 or section 3.2. + Retain the default value 161 for {$SNMP_PORT}. +- Step 5 Click Update. + +**3.3.4 Adding a Host** +- Step 1 Log in to the Zabbix WebUI. +- Step 2 Choose Configuration > Hosts.The Hosts page is displayed. +- Step 3 Select Create Hosts to go to the creation page. + Set In groups under Groups to Huawei Server. + Set IP address and Port under SNMP interfaces to the iBMC IP address and 161 respectively. +- Step 4 Click Update.