HP Proliant HW monitor

patvdv — Fri, 20/06/2008 - 20:54
About
The hppro_hwmon Perl script will query the HP Linux hpasmcli and hpacucli command-line tools to check or show the health of your Proliant hardware. It will check following components:
-
Fans
-
Memory DIMMs
-
Power Supplies
-
Server
-
Temperature Controls
-
RAID Controller
-
Physical Drives
-
Logical Drives
The default script comes with a standard set of components to check but can be easily customized to fit your needs.
Note: The script has been tested on standard SCSI-based systems only; YMMV with SAS/SATA based systems.
Script: hppro_hwmon
Make sure you check and customize if necessary the @command and @recipients variables before running the script!
Download hppro_hwmon.gz (GZIP archive)
Latest fixes (20-Aug-2008):
- Added better regexp checking for failed conditions
- Fix for wrong check on "onk" instead of "nok"
- Removed superfluous output
Dependancies/Requirements
-
Linux: any version that is officially supported by HP for their Proliant platform (see Proliant support matrices)
-
Perl: any recent version should do
-
HP System Health Application and Insight Management Agents: free download available from HP website.
-
HP Array Configuration Utility CLI for Linux: free download available from HP website.
Usage
The script can be run with either of 2 options and should be executed as the root user:
-s (show): show the current hardware health/status
-c (check): report any failed hardware or problematic events (on STDOUT) and send an alert mail.
Example 1: using ''-s''
Below listing show the command executed (between square brackets) and the resulting status.
[root@tanchico kudos]# ./hppro_hwmon -s
[acu_controller]
=> command executed: /opt/compaq/hpacucli/bld/hpacucli controller all show status
Smart Array 6i in Slot 0
Controller Status: OK
Cache Status: OK
[acu_logical_drive]
=> command executed: /opt/compaq/hpacucli/bld/hpacucli controller slot=0 logicaldrive all show status
logicaldrive 1 (67.8 GB, RAID 1+0): OK
[acu_physical_drive]
=> command executed: /opt/compaq/hpacucli/bld/hpacucli controller slot=0 physicaldrive all show status
physicaldrive 0:0 (port 0:id 0, 72.8 GB): OK
physicaldrive 0:1 (port 0:id 1, 72.8 GB): OK
[asm_asr]
=> command executed: /sbin/hpasmcli -s "SHOW ASR"
ASR timeout is 10 minutes.
ASR is currently enabled.
[asm_boot]
=> command executed: /sbin/hpasmcli -s "SHOW BOOT"
First boot device is: CD-ROM.
One time boot device is: Not set.
[asm_dimm]
=> command executed: /sbin/hpasmcli -s "SHOW DIMM"
DIMM Configuration
------------------
Cartridge #: 0
Module #: 1
Present: Yes
Form Factor: 9h
Memory Type: 12h
Size: 1024 MB
Speed: 400 MHz
Status: Ok
Cartridge #: 0
Module #: 2
Present: Yes
Form Factor: 9h
Memory Type: 12h
Size: 1024 MB
Speed: 400 MHz
Status: Ok
[asm_fans]
=> command executed: /sbin/hpasmcli -s "SHOW FANS"
Fan Location Present Speed of max Redundant Partner Hot-pluggable
--- -------- ------- ----- ------ --------- ------- -------------
#1 PROCESSOR_ZONE Yes NORMAL 49% Yes 0 No
#2 SYSTEM Yes NORMAL 49% Yes 0 No
[asm_ht]
=> command executed: /sbin/hpasmcli -s "SHOW HT"
Processor hyper-threading is currently enabled.
[asm_iml]
=> command executed: /sbin/hpasmcli -s "SHOW IML"
Event: 0 Added: 02/14/2006 00:01
REPAIRED: POST Messages - POST Error: 1785-Drive Array not Configured.
Event: 1 Added: 10/25/2006 14:08
INFO: Maintenance Note - CFG2HTML Proliant Server report successfully created.
Event: 2 Added: 10/25/2006 14:13
INFO: Maintenance Note - CFG2HTML Proliant Server report successfully created.
Event: 3 Added: 10/25/2006 14:25
INFO: Maintenance Note - CFG2HTML Proliant Server report successfully created.
Event: 4 Added: 10/25/2006 16:15
INFO: Maintenance Note - CFG2HTML Proliant Server report successfully created.
Press q to quit, c to continue without pagination
or any other key to see the next screen.:
[asm_ipl]
=> command executed: /sbin/hpasmcli -s "SHOW IPL"
IPL (Standard Boot Order)
-------------------------
#0 CDROM
#1 Floppy
#2 USBKEY
#3 HDD
#4 PXE
[asm_name]
=> command executed: /sbin/hpasmcli -s "SHOW NAME"
Server name is: bobo.voiswebhosting.eu
[asm_power]
=> command executed: /sbin/hpasmcli -s "SHOW POWERSUPPLY"
Power supply #1
Present : Yes
Redundant: No
Condition: Ok
Hotplug : Supported
Power supply #2
Power Supply not present
[asm_pxe]
=> command executed: /sbin/hpasmcli -s "SHOW PXE"
PXE boot status (2 Embedded NICs):
NIC1: PXE enabled
NIC2: PXE disabled
[asm_serial_bios]
=> command executed: /sbin/hpasmcli -s "SHOW SERIAL BIOS"
BIOS console redirection port is currently set to COM1/9600.
[asm_serial_embedded]
=> command executed: /sbin/hpasmcli -s "SHOW SERIAL EMBEDDED"
Embedded serial port A: COM1
Embedded serial port B: Disabled
[asm_serial_virtual]
=> command executed: /sbin/hpasmcli -s "SHOW SERIAL VIRTUAL"
The virtual serial port is currently COM2.
[asm_server]
=> command executed: /sbin/hpasmcli -s "SHOW SERVER"
System : ProLiant DL360 G4p
Serial No. : GB8633HEWD
ROM version : P54 02/14/2006
iLo present : Yes
Embedded NICs : 2
NIC1 MAC: 00:18:71:e3:de:4a
NIC2 MAC: 00:18:71:e3:de:49
Processor: 0
Name : Intel Xeon
Stepping : 10
Speed : 3400 MHz
Bus : 800 MHz
Socket : 1
Level2 Cache : 2048 KBytes
Status : Ok
Processor total : 1
Memory installed : 2048 MBytes
ECC supported : Yes
[asm_temperature]
=> command executed: /sbin/hpasmcli -s "SHOW TEMP"
Sensor Location Temp Threshold
------ -------- ---- ---------
#0 SYSTEM_BD - -
#1 I/O_ZONE 30C/86F 63C/145F
#2 CPU#1 34C/93F 85C/185F
#3 CPU#2 - 85C/185F
#4 POWER_SUPPLY_BAY 28C/82F 48C/118F
#5 SYSTEM_BD 22C/71F 41C/105F
[asm_uid]
=> command executed: /sbin/hpasmcli -s "SHOW UID"
UID is currently off.
[asm_wol]
=> command executed: /sbin/hpasmcli -s "SHOW WOL"
Wake-On-Lan is currently enabled.
Example 2: using ''-c''
This example shows a clean and troublefree run:
[root@tanchico kudos]# ./hppro_hwmon -c
INFO: no hardware alarm events found! Be happy :)
Delicious
Digg
StumbleUpon
Propeller
Reddit
Magnoliacom
Google
Yahoo
Technorati
Great!
Anonymous — Tue, 02/03/2010 - 20:56Really a great job! I spent a lot of time searching for a simple and power tool like this!
Mirko
Awesome! Thank you!
Anonymous — Thu, 31/12/2009 - 18:49Awesome! Thank you!
Excellent script
Anonymous — Thu, 31/12/2009 - 17:38I just happened across this post and script. Great work, thank you.
Download Corrupt
Anonymous — Wed, 27/05/2009 - 04:42Is the attached tgz file ok? I am unable to extract
tar -xf hppro_hwmon.tar
tar: This does not look like a tar archive
tar: Skipping to next header
tar: Error exit delayed from previous errors
Not corrupt but wrong file extension
patvdv — Wed, 27/05/2009 - 08:06Hi,
The file is not corrupt but actually has a misleading extension. It is not a TAR but rather a GZIP:
I have changed this now so that is less confusing. Thanks for the hint!