# hwmgr status component
STATUS ACCESS INDICT
HWID: HOSTNAME SUMMARY STATE STATE LEVEL NAME
------------------------------------------------------------------------------
2: nbtax3 online available CPU0
3: nbtax3 online available CPU1
4: nbtax3 online available CPU2
5: nbtax3 online available CPU3
6: nbtax3 online available CPU4
7: nbtax3 warning offline(nosave) available CPU5
8: nbtax3 online available CPU6
9: nbtax3 online available CPU7
18: nbtax3 online available dmapi
19: nbtax3 online available scp
20: nbtax3 online available kevm
21: nbtax3 online available wfqbb0
22: nbtax3 online available wfqbb0slot0
23: nbtax3 online available wfiop0
24: nbtax3 online available wfiop0slot0
25: nbtax3 online available pci0
26: nbtax3 online available pci0slot1
28: nbtax3 online available pci0slot2
30: nbtax3 online available pci0slot3
32: nbtax3 online available pci0slot7
34: nbtax3 online available pci0slot15
36: nbtax3 online available isp0
37: nbtax3 online available scsi0
38: nbtax3 online available comet0
39: nbtax3 online available emx0
40: nbtax3 online available scsi1
41: nbtax3 online available isa0
42: nbtax3 online available isa0slot0
43: nbtax3 online available keyboard0
44: nbtax3 online available mouse0
45: nbtax3 online available isa0slot2
46: nbtax3 online available tty00
47: nbtax3 online available isa0slot3
48: nbtax3 online available tty01
49: nbtax3 online available isa0slot4
50: nbtax3 online available lp0
51: nbtax3 online available isa0slot5
52: nbtax3 online available fdi0
53: nbtax3 online available ata0
54: nbtax3 online available scsi2
55: nbtax3 online available wfiop0slot1
56: nbtax3 online available pci1
57: nbtax3 online available pci1slot4
61: nbtax3 online available pci64
62: nbtax3 online available pci64slot4
64: nbtax3 online available pci64slot5
66: nbtax3 online available ee0
67: nbtax3 online available ee1
70: nbtax3 online available wfiop0slot2
71: nbtax3 online available pci2
72: nbtax3 online available pci2slot3
74: nbtax3 online available emx2
75: nbtax3 online available scsi4
76: nbtax3 online available wfiop0slot3
77: nbtax3 online available pci3
78: nbtax3 online available pci3slot4
80: nbtax3 online available isp2
81: nbtax3 online available scsi5
82: nbtax3 online available wfqbb1
83: nbtax3 online available wfqbb1slot1
84: nbtax3 online available wfiop1
91: nbtax3 online available dsk0
92: nbtax3 online available dsk1
93: nbtax3 online available cdrom0
94: nbtax3 online available random
95: nbtax3 online available urandom
96: nbtax3 online available tape0
97: nbtax3 online available scp0
113: nbtax3 online available dsk6
116: nbtax3 online available AlphaServer GS160 6/940
118: nbtax3 online available dsk10
119: nbtax3 online available dsk11
# tail /var/adm/messages
Aug 18 12:52:52 nbtax3 vmunix: Machine check code = 0x100000098
Aug 18 12:52:52 nbtax3 vmunix: Ibox Status = 0000000000000000
Aug 18 12:52:52 nbtax3 vmunix: Dcache Status = 0000000000000000
Aug 18 12:52:52 nbtax3 vmunix: Cbox Address = 0000001100640e40
Aug 18 12:52:52 nbtax3 vmunix: Fill Syndrome 1 = 0000000000000000
Aug 18 12:52:52 nbtax3 vmunix: Fill Syndrome 0 = 00000000000000f9
Aug 18 12:52:52 nbtax3 vmunix: Cbox Status = 0000000000000010
Aug 18 12:52:52 nbtax3 vmunix: EV6 captured status of Bcache mode = 000000000000000c
Aug 18 12:52:52 nbtax3 vmunix: EV6 Exception Address = ffffffff001d0dc4
Aug 18 12:52:52 nbtax3 vmunix: EV6 Interrupt Enablement and Current Processor mode = 0000007ee0000000
Aug 18 12:52:52 nbtax3 vmunix: EV6 Interrupt Summary Register = 0000000000000000
Aug 18 12:52:52 nbtax3 vmunix: EV6 TBmiss or Fault status = 0000000000000290
Aug 18 12:52:52 nbtax3 vmunix: EV6 PAL Base Address = 00000011ffff0000
Aug 18 12:52:52 nbtax3 vmunix: EV6 Ibox control = fffffe0017306396
Aug 18 12:52:52 nbtax3 vmunix: EV6 Ibox Process_context = 0000100000000004
Aug 18 12:52:52 nbtax3 vmunix: CPU 5 is prevented from being rebooted.
Aug 18 12:52:52 nbtax3 vmunix: The system must be reset or power cycled to clear this state.
Aug 18 12:52:52 nbtax3 vmunix: panic (cpu 5): Processor Machine Check
Aug 18 12:52:52 nbtax3 vmunix: syncing disks...
Aug 18 12:52:52 nbtax3 vmunix: Alpha boot: available memory from 0x108518000 to 0x11fffc0000
Aug 18 12:52:52 nbtax3 vmunix: Compaq Tru64 UNIX V5.1B (Rev. 2650); Fri Oct 22 14:27:36 CST 2004
Aug 18 12:52:52 nbtax3 vmunix: physical memory = 16384.00 megabytes.
Aug 18 12:52:52 nbtax3 vmunix: available memory = 7954.28 megabytes.
Aug 18 12:52:52 nbtax3 vmunix: using 10458 buffers containing 81.70 megabytes of memory
Aug 18 12:52:52 nbtax3 vmunix: Master cpu at slot 0
Aug 18 12:52:52 nbtax3 vmunix: Starting secondary cpu 1
Aug 18 12:52:52 nbtax3 vmunix: Starting secondary cpu 2
Aug 18 12:52:52 nbtax3 vmunix: Starting secondary cpu 3
Aug 18 12:52:52 nbtax3 vmunix: Starting secondary cpu 4
Aug 18 12:52:52 nbtax3 vmunix: Starting secondary cpu 6
Aug 18 12:52:52 nbtax3 vmunix: Starting secondary cpu 7
Aug 18 12:52:52 nbtax3 vmunix: The system must be reset or power cycled to clear this state.
分析binary.errlog结果如下
Problem Found: Bcache or System double bit error reported by CPU5, CPU Slot1 of SoftQbb1 (HardQbb1) at Tue 19 Aug 2008 13:36:43 GMT+08:00
Problem Report Times:
Event Time: Mon 18 Aug 2008 12:47:45 GMT+08:00
Report Time: Tue 19 Aug 2008 13:36:43 GMT+08:00
Expiration Time: Mon 18 Aug 2008 12:47:45 GMT+08:00
Managed Entity:
System Name : nbtax3
System Type : AlphaServer GS160 6/940
System Serial : G2E692
OS Type : Tru64 UNIX/Compaq Tru64 UNIX V5.1B (Rev. 2650)
Service Obligation Data:
Service Obligation: Valid
Service Obligation Number:
System Serial Number:
Service Provider Company Name: Hewlett-Packard Company
Brief Description:
Bcache or System double bit error reported by CPU5, CPU Slot1 of SoftQbb1 (HardQbb1)
Callout ID:
Theory Code : 0x060902000007B005
HQBB.Ent.Uce : 1.1.6
Severity:
1
Reporting Node:
nbtax3
Full Description:
One or more CPUs detected a Bcache or System double bit error on a Dcache or
Icache fill operation. This CPU version is not able to further filter the
problem. If no system errors occured in the shadow of this event the assumption
is made the problem source was the Bcache. The Bcache is located on the CPU
module. This callout took place because analysis found no evidence of non-cpu
uncorrectable system errors that could have caused this event. Note:
No uncorrectable system errors where detected in the shadow of this error.
No correctable Bcache ECC error(s) occured on this CPU in the past 24 hours.
While this event was fatal, it is not recommended that a FRU replacement be
done on the first occurrence of the failure. There is a very low probability of
a reoccurrence on the same hardware component. FRU information has been
provided to allow failure correlation in the unlikely event of a repeat
failure. Please discuss this failure with your support center for the correct
course of action.
FRU List:
Probability : High
Fru Manufacturer : -
Fru Model : -
Fru PartNumber : B4166-AA.A4
Fru SerialNumber : AY13208037
Fru FirmwareRev : SROM V8.0-9
Fru SiteLocation : -
Fru CabinetId : 800mm System Cabinet 1
Fru Position : System Cabinet 1, FrontSide, lower System Box (Hard Qbb1)
Fru Chassis : System Box 1 (Hard Qbb 0/1)
Fru Assembly : Qbb backplane
Fru Subassembly : -
Fru Slot : CPU Module 1
Probability : Low
Fru Manufacturer : -
Fru Model : -
Fru PartNumber : 54-25043-02.E03
Fru SerialNumber : SM03410295
Fru FirmwareRev : -
Fru SiteLocation : -
Fru CabinetId : 800mm System Cabinet 1
Fru Position : System Cabinet 1, FrontSide, lower System Box (Hard Qbb1)
Fru Chassis : System Box 1 (Hard Qbb 0/1)
Fru Assembly : -
Fru Subassembly : -
Fru Slot : -
Evidence:
Time of Event : 18 Aug 2008 12:47:45 GMT+08:00 (Mon)
Unique ID : 16320.45399
Analysis Revision : GS320_UCE_RULE V7.1 (20feb2006)
Notifications:
All
Analysis Mode:
Manual
SEA Version:
System Event Analyzer for Windows V4.4.4 (Build 18)
WCC Version:
Web-Based Enterprise Services Common Components for
Windows V4.4.4 (Build 18), member of Web-Based
Enterprise Services Suite for Windows V4.4.4 (Build
18)
解决
nbtax3# hwmgr online componet -id 7
CPU number 5 disabled
The system must be reset or power cycled to clear this state.
hwmgr: CPU5 is now online
nbtax3# hwmgr status component|more
[K
STATUS ACCESS INDICT
HWID: HOSTNAME SUMMARY STATE STATE LEVEL NAME
------------------------------------------------------------------------------
2: nbtax3 online available CPU0
3: nbtax3 online available CPU1
4: nbtax3 online available CPU2
5: nbtax3 online available CPU3
6: nbtax3 online available CPU4
7: nbtax3 online available CPU5
8: nbtax3 online available CPU6
9: nbtax3 online available CPU7
18: nbtax3 online available dmapi
19: nbtax3 online available scp
20: nbtax3 online available kevm
21: nbtax3 online available wfqbb0
22: nbtax3 online available wfqbb0slot0
23: nbtax3 online available wfiop0
24: nbtax3 online available wfiop0slot0
25: nbtax3 online available pci0
26: nbtax3 online available pci0slot1
28: nbtax3 online available pci0slot2
30: nbtax3 online available pci0slot3
重启一次。还不知道为何会出现这种状况,一年多以前联通一台GS60E也出现这样的现象,使用hwmgr online componet后到现在依一直运行良好。
阅读(4189) | 评论(0) | 转发(0) |