1、集群没有启动起来,首先查看了一下grid的日志,日志保存在$GRID_HOME/log/<节点名称> 目录下,文件的名字是alert<节点名称>.log
例如我的日志文件就是/g01/11ggrid/app/11.2.0/grid/log/orcl1/alertorcl1.log
2、查看日志文件,在日志文件里我们发现了发下错误
-
2014-09-10 15:24:09.169
-
[ohasd(2450)]CRS-0704:Oracle High Availability Service aborted due to Oracle Local Registry error [PROCL-26: Error while accessing the physical storage Operating System error [No such file or directory] [2]]. Details at (:OHAS00106:) in /g01/11ggrid/app/11.2.0/grid/log/orcl1/ohasd/ohasd.log.
-
[client(2465)]CRS-10001:CRS-10132: No msg for has:crs-10132 [10][60]
3、于是查看ohasd.log文件,有如下错误提示
-
2014-09-10 15:23:40.326: [ default][758777632] OHASD Daemon Starting. Command string :restart
-
2014-09-10 15:23:40.328: [ default][758777632] Initializing OLR
-
2014-09-10 15:23:40.329: [ OCROSD][758777632]utopen:6m': failed in stat OCR file/disk /g01/11ggrid/app/11.2.0/grid/cdata/orcl1.olr, errno=2, os err string=No such file or directory
-
2014-09-10 15:23:40.329: [ OCROSD][758777632]utopen:7: failed to open any OCR file/disk, errno=2, os err string=No such file or directory
-
2014-09-10 15:23:40.330: [ OCRRAW][758777632]proprinit: Could not open raw device
-
2014-09-10 15:23:40.330: [ OCRAPI][758777632]a_init:16!: Backend init unsuccessful : [26]
-
2014-09-10 15:23:40.330: [ CRSOCR][758777632] OCR context init failure. Error: PROCL-26: Error while accessing the physical storage Operating System error [No such file or directory] [2]
-
2014-09-10 15:23:40.331: [ default][758777632] Created alert : (:OHAS00106:) : OLR initialization failed, error: PROCL-26: Error while accessing the physical storage Operating System error [No such file or directory] [2]
-
2014-09-10 15:23:40.331: [ default][758777632][PANIC] OHASD exiting; Could not init OLR
-
2014-09-10 15:23:40.331: [ default][758777632] Done.
4、问题比较明显是orcl1.lor文件找不到,到目录下面查看果然没有这个文件,于是开始查找资料恢复。
首先确定原来orcl1.lor文件的配置信息
-
[root@orcl1 orcl1]# ocrcheck -config -local
-
Oracle Local Registry configuration is :
-
Device/File Name : /g01/11ggrid/app/11.2.0/grid/cdata/orcl1.olr
5、使用touch命令创建文件
-
[root@orcl1 orcl1]# touch /g01/11ggrid/app/11.2.0/grid/cdata/orcl1.olr
-
[root@orcl1 orcl1]# cd ..
-
[root@orcl1 cdata]# ll
-
total 12
-
drwxr-xr-x 2 grid oinstall 4096 Sep 4 12:11 localhost
-
drwxr-xr-x 2 grid oinstall 4096 Sep 4 12:49 orcl1
-
-rw-r--r-- 1 root root 0 Sep 10 22:33 orcl1.olr
-
drwxrwxr-x 2 grid oinstall 4096 Sep 4 12:11 orcl-cluster
6、恢复olr文件内容。默认olr文件会在目录下的<节点名>文件夹中,保存一下备份
-
[root@orcl1 cdata]# ocrconfig -local -restore ./orcl1/backup_20140904_124926.olr
-
[root@orcl1 cdata]# ll
-
total 6636
-
drwxr-xr-x 2 grid oinstall 4096 Sep 4 12:11 localhost
-
drwxr-xr-x 2 grid oinstall 4096 Sep 4 12:49 orcl1
-
-rw-r--r-- 1 root root 272756736 Sep 10 22:34 orcl1.olr
-
drwxrwxr-x 2 grid oinstall 4096 Sep 4 12:11 orcl-cluster
7、启动集群
-
[root@orcl1 cdata]# crsctl start crs
-
[root@orcl1 cdata]# crsctl stat res -t
-
--------------------------------------------------------------------------------
-
NAME TARGET STATE SERVER STATE_DETAILS
-
--------------------------------------------------------------------------------
-
Local Resources
-
--------------------------------------------------------------------------------
-
ora.LISTENER.lsnr
-
ONLINE ONLINE orcl1
-
ONLINE ONLINE orcl2
-
ora.SYSTEMDG.dg
-
ONLINE ONLINE orcl1
-
ONLINE ONLINE orcl2
-
ora.asm
-
ONLINE ONLINE orcl1 Started
-
ONLINE ONLINE orcl2 Started
-
ora.gsd
-
OFFLINE OFFLINE orcl1
-
OFFLINE OFFLINE orcl2
-
ora.net1.network
-
ONLINE ONLINE orcl1
-
ONLINE ONLINE orcl2
-
ora.ons
-
ONLINE ONLINE orcl1
-
ONLINE ONLINE orcl2
-
--------------------------------------------------------------------------------
-
Cluster Resources
-
--------------------------------------------------------------------------------
-
ora.LISTENER_SCAN1.lsnr
-
1 ONLINE ONLINE orcl2
-
ora.cvu
-
1 ONLINE ONLINE orcl2
-
ora.oc4j
-
1 ONLINE ONLINE orcl2
-
ora.orcl1.vip
-
1 ONLINE ONLINE orcl1
-
ora.orcl2.vip
-
1 ONLINE ONLINE orcl2
-
ora.scan1.vip
-
1 ONLINE ONLINE orcl2
-
[root@orcl1 cdata]#
本次操作感谢提供的资料
阅读(3705) | 评论(0) | 转发(0) |