上周在HP rx8640两个vpar分区上搭建ORACLE RAC时遇到问题,装CRS的时候无论怎样都不行,导致工作量延长近十二个小时,最终在老大的指点下解决问题。这次故障的解决,从另一个方面说明遇到问题时,眼光狭隘无益于问题的解决,只有纵观全局,方能做到有备无患。简单记录如下,就当是经验总结了,也提醒大家在碰到类似的问题的时候少走弯路。
前因后果这里不多说了,情况比较复杂。这里从装CRS遇到的问题说起。
在HPUX环境下搭建ORALCE RAC,要遵循如下的顺序:
1,HP MC
2,ORACLE CRS installation
3,ORACLE DATABASE software installation
4,ORACLE database installation
也就是先配置双机,然后分别安装crs、数据库软件,最后一部才是安装数据库。
在安装CRS的过程中,会进行安装前系统环境检查(检查内容包括核心参数、系统补丁等),主、私用网络的检测,心跳IP、VIP的检测,ocr、vote磁盘的检测,这些步骤完成之后,就开始进行crs软件的安装。安装到最后,会提示在各个节点的$CRS_HOME下以及$ORACLE_HOME/oraInventory运行root.sh脚本,启动CRSD,OCSSD,EVMD三个进程。如果都能正确启动,说明CRS软件安装成功(通过在$CRS_HOME/bin下运行“./crs_stat -t”命令可以查看)。有时候会因为主机名大小写的问题出现延迟或者停顿,从而导致root.sh脚本不能成功运行,需要通过在$CRS_HOME/bin路径下手工执行“./vipca”命令来进行一些设置解决。而我这里碰到的错误,却是另外一种情况。
在节点一报错信息如下:
root.sh
WARNING: directory '/home/db/oracle' is not owned by root
WARNING: directory '/home' is not owned by root
Checking to see if Oracle CRS stack is already configured
Checking to see if any 9i GSD is up
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/home/db/oracle' is not owned by root
WARNING: directory '/home' is not owned by root
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node
:
node 1: storm51 storm51_hb storm51
node 2: storm52 storm52_hb storm52
Creating OCR keys for user 'root', privgrp 'sys'..
OCR create keys failed, OCR error 26.
Failed to initialize Oracle Cluster Registry for cluster 0
节点二运行root.sh的情况如下:
root.sh
WARNING: directory '/home/db/oracle' is not owned by root
WARNING: directory '/home' is not owned by root
Checking to see if Oracle CRS stack is already configured
Checking to see if any 9i GSD is up
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/home/db/oracle' is not owned by root
WARNING: directory '/home' is not owned by root
clscfg: EXISTING configuration version 3 detected.
clscfg: version 3 is 10G Release 2.
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node
:
node 1: storm51 storm51_hb storm51
node 2: storm52 storm52_hb storm52
clscfg: Arguments check out successfully.
NO KEYS WERE WRITTEN. Supply -force parameter to override.
-force is destructive and will destroy any previous cluster
configuration.
Oracle Cluster Registry for cluster has already been initialized
Startup will be queued to init within 30 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
到这个地方就停止不动了,跟节点一碰到的情况截然不同。通常顺利的情况下,正确的状态是,两个节点各自运行root.sh脚本然后各自启动CRS所需进程。比较常见的是第二个节点碰到的情况,在启动CRS进程的时候停在那里不动,到时间超过600秒仍然没有反应,这就需要手工执行vipca来启动进程了。而至于第一个节点碰到的问题,以前没有碰到过,所以出现了解决盲点。
阅读(1751) | 评论(0) | 转发(0) |