HP Proliant HW monitor

patvdv — Fri, 20/06/2008 - 20:54
About
The hppro_hwmon Perl script will query the HP Linux hpasmcli and hpacucli command-line tools to check or show the health of your Proliant hardware. It will check following components:
-
Fans
-
Memory DIMMs
-
Power Supplies
-
Server
-
Temperature Controls
-
RAID Controller
-
Physical Drives
-
Logical Drives
The default script comes with a standard set of components to check but can be easily customized to fit your needs.
Note: The script has been tested on standard SCSI-based systems only; YMMV with SAS/SATA based systems.
Script: hppro_hwmon
Make sure you check and customize if necessary the @command and @recipients variables before running the script!
Download hppro_hwmon.gz (GZIP archive)
Latest fixes (20-Aug-2008):
- Added better regexp checking for failed conditions
- Fix for wrong check on "onk" instead of "nok"
- Removed superfluous output
Dependancies/Requirements
-
Linux: any version that is officially supported by HP for their Proliant platform (see Proliant support matrices)
-
Perl: any recent version should do
-
HP System Health Application and Insight Management Agents: free download available from HP website.
-
HP Array Configuration Utility CLI for Linux: free download available from HP website.
Usage
The script can be run with either of 2 options and should be executed as the root user:
-s (show): show the current hardware health/status
-c (check): report any failed hardware or problematic events (on STDOUT) and send an alert mail.
Example 1: using ''-s''
Below listing show the command executed (between square brackets) and the resulting status.
[root@tanchico kudos]# ./hppro_hwmon -s
[acu_controller]
=> command executed: /opt/compaq/hpacucli/bld/hpacucli controller all show status
Smart Array 6i in Slot 0
Controller Status: OK
Cache Status: OK
[acu_logical_drive]
=> command executed: /opt/compaq/hpacucli/bld/hpacucli controller slot=0 logicaldrive all show status
logicaldrive 1 (67.8 GB, RAID 1+0): OK
[acu_physical_drive]
=> command executed: /opt/compaq/hpacucli/bld/hpacucli controller slot=0 physicaldrive all show status
physicaldrive 0:0 (port 0:id 0, 72.8 GB): OK
physicaldrive 0:1 (port 0:id 1, 72.8 GB): OK
[asm_asr]
=> command executed: /sbin/hpasmcli -s "SHOW ASR"
ASR timeout is 10 minutes.
ASR is currently enabled.
[asm_boot]
=> command executed: /sbin/hpasmcli -s "SHOW BOOT"
First boot device is: CD-ROM.
One time boot device is: Not set.
[asm_dimm]
=> command executed: /sbin/hpasmcli -s "SHOW DIMM"
DIMM Configuration
------------------
Cartridge #: 0
Module #: 1
Present: Yes
Form Factor: 9h
Memory Type: 12h
Size: 1024 MB
Speed: 400 MHz
Status: Ok
Cartridge #: 0
Module #: 2
Present: Yes
Form Factor: 9h
Memory Type: 12h
Size: 1024 MB
Speed: 400 MHz
Status: Ok
[asm_fans]
=> command executed: /sbin/hpasmcli -s "SHOW FANS"
Fan Location Present Speed of max Redundant Partner Hot-pluggable
--- -------- ------- ----- ------ --------- ------- -------------
#1 PROCESSOR_ZONE Yes NORMAL 49% Yes 0 No
#2 SYSTEM Yes NORMAL 49% Yes 0 No
[asm_ht]
=> command executed: /sbin/hpasmcli -s "SHOW HT"
Processor hyper-threading is currently enabled.
[asm_iml]
=> command executed: /sbin/hpasmcli -s "SHOW IML"
Event: 0 Added: 02/14/2006 00:01
REPAIRED: POST Messages - POST Error: 1785-Drive Array not Configured.
Event: 1 Added: 10/25/2006 14:08
INFO: Maintenance Note - CFG2HTML Proliant Server report successfully created.
Event: 2 Added: 10/25/2006 14:13
INFO: Maintenance Note - CFG2HTML Proliant Server report successfully created.
Event: 3 Added: 10/25/2006 14:25
INFO: Maintenance Note - CFG2HTML Proliant Server report successfully created.
Event: 4 Added: 10/25/2006 16:15
INFO: Maintenance Note - CFG2HTML Proliant Server report successfully created.
Press q to quit, c to continue without pagination
or any other key to see the next screen.:
[asm_ipl]
=> command executed: /sbin/hpasmcli -s "SHOW IPL"
IPL (Standard Boot Order)
-------------------------
#0 CDROM
#1 Floppy
#2 USBKEY
#3 HDD
#4 PXE
[asm_name]
=> command executed: /sbin/hpasmcli -s "SHOW NAME"
Server name is: bobo.voiswebhosting.eu
[asm_power]
=> command executed: /sbin/hpasmcli -s "SHOW POWERSUPPLY"
Power supply #1
Present : Yes
Redundant: No
Condition: Ok
Hotplug : Supported
Power supply #2
Power Supply not present
[asm_pxe]
=> command executed: /sbin/hpasmcli -s "SHOW PXE"
PXE boot status (2 Embedded NICs):
NIC1: PXE enabled
NIC2: PXE disabled
[asm_serial_bios]
=> command executed: /sbin/hpasmcli -s "SHOW SERIAL BIOS"
BIOS console redirection port is currently set to COM1/9600.
[asm_serial_embedded]
=> command executed: /sbin/hpasmcli -s "SHOW SERIAL EMBEDDED"
Embedded serial port A: COM1
Embedded serial port B: Disabled
[asm_serial_virtual]
=> command executed: /sbin/hpasmcli -s "SHOW SERIAL VIRTUAL"
The virtual serial port is currently COM2.
[asm_server]
=> command executed: /sbin/hpasmcli -s "SHOW SERVER"
System : ProLiant DL360 G4p
Serial No. : GB8633HEWD
ROM version : P54 02/14/2006
iLo present : Yes
Embedded NICs : 2
NIC1 MAC: 00:18:71:e3:de:4a
NIC2 MAC: 00:18:71:e3:de:49
Processor: 0
Name : Intel Xeon
Stepping : 10
Speed : 3400 MHz
Bus : 800 MHz
Socket : 1
Level2 Cache : 2048 KBytes
Status : Ok
Processor total : 1
Memory installed : 2048 MBytes
ECC supported : Yes
[asm_temperature]
=> command executed: /sbin/hpasmcli -s "SHOW TEMP"
Sensor Location Temp Threshold
------ -------- ---- ---------
#0 SYSTEM_BD - -
#1 I/O_ZONE 30C/86F 63C/145F
#2 CPU#1 34C/93F 85C/185F
#3 CPU#2 - 85C/185F
#4 POWER_SUPPLY_BAY 28C/82F 48C/118F
#5 SYSTEM_BD 22C/71F 41C/105F
[asm_uid]
=> command executed: /sbin/hpasmcli -s "SHOW UID"
UID is currently off.
[asm_wol]
=> command executed: /sbin/hpasmcli -s "SHOW WOL"
Wake-On-Lan is currently enabled.
Example 2: using ''-c''
This example shows a clean and troublefree run:
[root@tanchico kudos]# ./hppro_hwmon -c
INFO: no hardware alarm events found! Be happy :)

Great Command
Anonymous — Sat, 07/01/2012 - 07:30Thanks it awesome. but would be greatly helpful if on OS level we can find the same hardware status without using HP utility command.
Can anyone beat the HP utility harware status to normal OS level command or paths to find the all hardware status.
sweet
Anonymous — Thu, 14/10/2010 - 15:53sweet
Great!
Anonymous — Tue, 02/03/2010 - 20:56Really a great job! I spent a lot of time searching for a simple and power tool like this!
Mirko
Awesome! Thank you!
Anonymous — Thu, 31/12/2009 - 18:49Awesome! Thank you!
Excellent script
Anonymous — Thu, 31/12/2009 - 17:38I just happened across this post and script. Great work, thank you.
Download Corrupt
Anonymous — Wed, 27/05/2009 - 04:42Is the attached tgz file ok? I am unable to extract
tar -xf hppro_hwmon.tar
tar: This does not look like a tar archive
tar: Skipping to next header
tar: Error exit delayed from previous errors
Not corrupt but wrong file extension
patvdv — Wed, 27/05/2009 - 08:06Hi,
The file is not corrupt but actually has a misleading extension. It is not a TAR but rather a GZIP:
I have changed this now so that is less confusing. Thanks for the hint!