今天碰到一个客户打电话来说,他们去年买的P550因为要升级SYBASE,听取了SYBASE供货商的建议,把08升级到11了。结果重启主机的时候发现几乎挂起,等待将近半小时才把系统起来;运行cfgmgr和powerpath config(配置emc设备的命令)时需要将近20分钟才能完成。
他们怀疑是EMC POWERPATH不兼容导致,需要我的支持。
因为阵列是由我们提供,因此我负责寻找阵列和POWERPATH方面的原因。
从电话中得知,AIX没升级之前阵列及POWERPATH都无异常(POWERPATH为5.3版本),难道AIX53TL11真的不兼容5.3版本的POWERPATH?可是不会啊,5.3是最新的了;忽然想起该版本还有个补丁,不知道是否已经打上?
马上回电话过去---powerpath version输出,果然没有打补丁。
登陆POWERLINK,下载POWERPATH5.3的SP1补丁传过去。
补丁打完,尝试重启和cfgmgr,问题依然没有解决。
于是又想,目前TL11是AIX5.3的最新版本,之前没有碰到该版本的主机,难道是AIX本身的BUG?
上EMC支持库,果然给我发现:原来AIX5.3TL11及AIX6.1TL04版本下要安装几个补丁才能正常访问非MPIO存储设备。也即是说,以上故障根本就是AIX的BUG,不管连接什么存储,只要没有使用MPIO,那么AIX5.3TL11及AIX6.1TL04都无法正常使用这些存储设备。
======================================
Environment:
OS: IBM AIX 6.1 TL4 SP1
OS: IBM AIX 6.1 TL2
OS: IBM AIX 5.3 TL11
OS: VIOS 2.1.10.22 (FP22)
EMC SW: PowerPath 5.1
EMC SW: PowerPath 5.3.1
Product: Symmetrix
Product: CLARiiON
Problem:
System hangs on cfgmgr when attempting to configure SAN devices.
Reserve Lock, which is needed in VIO, Oracle and PowerPath environments, changes back to yes when it should be no.
cause:
The cause is non-MPIO (hdiskpower) devices. FC DISKS will define a new Hdisk instance upon each reboot if a PVID stamp exists on the disk, but no PVID attributes exists in ODM. Reserve Lock is then changed back to yes, and a reserve is placed on all the new Hdisk instances. The fix was created to match non-MPIO FC Disks that have a PVID stamp against the connection information of the device.
For AIX v6.1
Apply APARs IZ63813, IZ64056, IZ64133
IZ63818, IZ64056, IZ64133 – Are planned to be part of AIX 6.1 TL4 SP2, which is targeted to be available Feb 2010
All three APARs/IFIXes are currently available for this problem, and must be loaded as a group for AIX 6.1 T.L.4.
Obtain IZ64056 through normal download channels of IBM.
For IZ63813 & IZ64133, please obtain them from the public IBM FTP site as described:
ftp public.dhe.ibm.com
login = anonymous
password = email address
cd aix/efixes/iz63813
cd aix/efixes/iz64133
For AIX v5.3
Apply APARs IZ63977, IZ63808.
IZ63977 & IZ63808 - Scheduled to be part of AIX 5.3 base TL12, currently targeted to be available in April 2010.
These are two APARs are currently available for this problem, and must be loaded as a group for this level AIX 5.3 T.L.11.and are available
Obtain IZ63977 through normal download channels of IBM.
For IZ63808, please obtain it from the public IBM FTP site as described:
ftp public.dhe.ibm.com
login = anonymous
password = email address
cd aix/efixes/iz63808
NOTE: If running PowerPath 5.3, SP1 MUST be installed as part of the fix
=====================================================
从以上内容得知:IZ63808这个补丁能在FTP站点下载,但是IZ63977需要到FIXCENTER上通过WEB来下载。
结果发现,IZ63808是可以下载,但是根本装不上,而IZ63977根本就找不到!
以下是IZ63808安装过程
# ls -l IZ*
-rw-r----- 1 root system 615393 Jan 18 23:22 IZ63808.epkg.Z
# instfix -k IZ63808 -d .
instfix: There are no filesets on the media for IZ63808.
instfix: There are no filesets on the media for the requested Fix IDs.
# uncompress IZ63808.epkg.Z
# instfix -T -d .
没有任何输出,即表示根本没有这个补丁存在。
难道是上传时没用2进制模式?再试,自动模式2进制模式都尝试过了,都不行;又到IBM站点去找,原来IBM也有文档提到这个BUG
=====================
IZ63977: NON-MPIO DISK WITHOUT PVID ATTRIBUTE CAUSES NEW DISK DEFINES.
APAR statusClosed as program error.
Error descriptionnon-MPIO FC Disks will define a new hdisk instance on each
reboot, if a PVID stamp exists on the disk, but no PVID
attribute exists in ODM.
Local fix
Problem summarynon-MPIO FC Disks will define a new hdisk instance on each
reboot, if a PVID stamp exists on the disk, but no PVID
attribute exists in ODM.
Problem conclusionProperly match non-MPIO FC Disks which have a PVID stamp
against the connection information of the device.
====================
主机工程师致电IBM800,得到一个让人听了马上晕倒的答案:这些补丁目前还没正式对外发布!
既然没有发布,为什么其中一个补丁可以下载,而且IBM和EMC站点都提到这个BUG及其解决方案?
最后,只得回退到原来的版本(幸亏系统有备份),并升级到10后,以上问题再没出现过。
呵呵,现在知道备份有多重要了吧。
另外,除非必要,最好不要使用最新版本的软件,否则很可能当小白鼠。
阅读(2947) | 评论(0) | 转发(0) |