经过反复重装CRS,该报的错还是依然如故,如果在以前,重装一次也就过去了,可是这次始终过不去,这就让人有些纳闷了。在此之前还有一个怪现象,那就是CRS和ORACLE软件都成功安装,但在最后通过DBCA建库的时候发生了一件事,从来没有遇到过。那就是在DBCA的最后一步点那个“下一步”的按钮没有任何反应,也就无法实施建库。无奈之下,重新安装了操作系统,并划分文件系统、配置网络、搭建双机等,然后才跋山涉水的来到安装CRS的过程,然后碰到了上面的错误,具体情况就是这样的。
按理说重装了操作系统,不应该再碰到之前那些毛病才对,结果反而患病愈发严重了。把这个故障提交给老大,过了会儿,老大说故障解决了,并帮我把CRS软件给装好。我想了很久实在没有想明白问题出在什么地方,但为了赶进度,先硬着头皮做下去。到DBCA建库最后那个“下一步”的时候,倒是顺利过去了,但是建库的时候报告说裸设备“/dev/vgdata/rredo1_1”无法识别,安装过程中止。我仔细检查了裸设备的权限和大小,都没有什么问题,再次通过DBCA建库的时候还是报这个错误,安装无法进行。
我又把这个情况报告给了老大,过会儿他给我说明白问题出在什么地方了,CRS安装过程报错和这次数据库安装报错可以归类到同种类型,并反馈给我一个信息,如下所示:
--- Logical extents ---
LE PV1 PE1 Status 1
00000 /dev/dsk/c7t13d7 00192 current
00001 ??? 00192 current
00002 /dev/dsk/c7t14d1 00192 current
00003 /dev/dsk/c7t14d2 00192 current
00004 /dev/dsk/c7t14d3 00192 current
00005 /dev/dsk/c7t14d4 00192 current
00006 /dev/dsk/c7t14d5 00192 current
00007 /dev/dsk/c7t13d7 00193 current
00008 ??? 00193 current
这些消息是通过lvdisplay命令抓取出来的,根据这里反映出来的信息就是说裸设备出现了坏块。这里的裸设备做了条带化操作,所以对应了很多块磁盘。
按照这个错误信息,之前CRS安装报错,那应该和裸设备/dev/vgdata/ocr以及/dev/vgdata/vote有关,也就是说这两个裸设备同样出现了坏块,所以导致CRS安装无法进行。现在的/dev/vgdata/redo1_1也遇到了同样的问题。怎么会出现这么多坏块?我觉得纳闷,于是对其他裸设备也用命令lvdisplay进行查看,查看的结果是所有的裸设备文件都有问题。
故障原因明了了,下面就要着手准备故障解决以及故障原因分析了。
先说说故障解决方法,对于这类故障。可行的方法是先将这个裸设备从卷组中删除,然后重新条带化并加入源卷组。下面以裸设备redo4_3为例进行说明:
1,查看故障卷的情况:
root@storm51:/dev/vgdata#lvdisplay -v /dev/vgdata/redo4_3
--- Logical volumes ---
LV Name /dev/vgdata/redo4_3
VG Name /dev/vgdata
LV Permission read/write
LV Status available/syncd
Mirror copies 0
Consistency Recovery MWC
Schedule striped
LV Size (Mbytes) 672
Current LE 21
Allocated PE 21
Stripes 7
Stripe Size (Kbytes) 512
Bad block on
Allocation strict
IO Timeout (Seconds) default
--- Distribution of logical volume ---
PV Name LE on PV PE on PV
/dev/dsk/c7t13d7 3 3
/dev/dsk/c7t14d1 3 3
/dev/dsk/c7t14d2 3 3
/dev/dsk/c7t14d3 3 3
/dev/dsk/c7t14d4 3 3
/dev/dsk/c7t14d5 3 3
--- Logical extents ---
LE PV1 PE1 Status 1
00000 /dev/dsk/c7t13d7 00192 current
00001 ??? 00192 current
00002 /dev/dsk/c7t14d1 00192 current
00003 /dev/dsk/c7t14d2 00192 current
00004 /dev/dsk/c7t14d3 00192 current
00005 /dev/dsk/c7t14d4 00192 current
00006 /dev/dsk/c7t14d5 00192 current
00007 /dev/dsk/c7t13d7 00193 current
00008 ??? 00193 current
00009 /dev/dsk/c7t14d1 00193 current
00010 /dev/dsk/c7t14d2 00193 current
00011 /dev/dsk/c7t14d3 00193 current
00012 /dev/dsk/c7t14d4 00193 current
00013 /dev/dsk/c7t14d5 00193 current
00014 /dev/dsk/c7t13d7 00194 current
00015 ??? 00194 current
00016 /dev/dsk/c7t14d1 00194 current
00017 /dev/dsk/c7t14d2 00194 current
00018 /dev/dsk/c7t14d3 00194 current
00019 /dev/dsk/c7t14d4 00194 current
00020 /dev/dsk/c7t14d5 00194 current
2,设置对应卷组为非cluster模式,并以独享方式进行激活:
3,移除故障裸设备:
root@storm51:/dev/vgdata#lvremove /dev/vgdata/redo4_3
Current path "/dev/dsk/c10t13d7" is an alternate link, skip.
Current path "/dev/dsk/c10t14d1" is an alternate link, skip.
Current path "/dev/dsk/c10t14d2" is an alternate link, skip.
Current path "/dev/dsk/c10t14d3" is an alternate link, skip.
Current path "/dev/dsk/c10t14d4" is an alternate link, skip.
Current path "/dev/dsk/c10t14d5" is an alternate link, skip.
Current path "/dev/dsk/c10t14d0" is an alternate link, skip.
The logical volume "/dev/vgdata/redo4_3" is not empty;
do you really want to delete the logical volume (y/n) : y
Logical volume "/dev/vgdata/redo4_3" has been successfully removed.
vgcfgbackup: /etc/lvmtab is out of date with the running kernel:Kernel indicates 15 disks for "/dev/vgdata"; /etc/lvmtab has 14 disks.
Cannot proceed with backup.
4,重新建立并条带化裸设备:
root@storm51:/dev/vgdata#lvcreate -i 7 -I 512 -n redo4_3 /dev/vgdata
Logical volume "/dev/vgdata/redo4_3" has been successfully created with
character device "/dev/vgdata/rredo4_3".
root@storm51:/dev/vgdata#7t14d1 lvextend -L 512 /dev/vgdata/redo4_3 /dev/dsk/c7t13d7 /dev/dsk/c7t14d0 /dev/dsk/c7t14d1 /dev/dsk/c7t14d2 /dev/dsk/c7t14d3 /dev/dsk/c7t14d4 /dev/dsk/c7t14d5
Warning: rounding up logical volume size to extent boundary at size "672" MB for striping.
Logical volume "/dev/vgdata/redo4_3" has been successfully extended.
5,查看新建的裸设备是否还出现之前的错误信息:
root@storm51:/dev/vgdata#lvdisplay -v /dev/vgdata/redo4_3
--- Logical volumes ---
LV Name /dev/vgdata/redo4_3
VG Name /dev/vgdata
LV Permission read/write
LV Status available/syncd
Mirror copies 0
Consistency Recovery MWC
Schedule striped
LV Size (Mbytes) 672
Current LE 21
Allocated PE 21
Stripes 7
Stripe Size (Kbytes) 512
Bad block on
Allocation strict
IO Timeout (Seconds) default
--- Distribution of logical volume ---
PV Name LE on PV PE on PV
/dev/dsk/c7t13d7 3 3
/dev/dsk/c7t14d0 3 3
/dev/dsk/c7t14d1 3 3
/dev/dsk/c7t14d2 3 3
/dev/dsk/c7t14d3 3 3
/dev/dsk/c7t14d4 3 3
/dev/dsk/c7t14d5 3 3
--- Logical extents ---
LE PV1 PE1 Status 1
00000 /dev/dsk/c7t13d7 00192 current
00001 /dev/dsk/c7t14d0 00008 current
00002 /dev/dsk/c7t14d1 00192 current
00003 /dev/dsk/c7t14d2 00192 current
00004 /dev/dsk/c7t14d3 00192 current
00005 /dev/dsk/c7t14d4 00192 current
00006 /dev/dsk/c7t14d5 00192 current
00007 /dev/dsk/c7t13d7 00193 current
00008 /dev/dsk/c7t14d0 00009 current
00009 /dev/dsk/c7t14d1 00193 current
00010 /dev/dsk/c7t14d2 00193 current
00011 /dev/dsk/c7t14d3 00193 current
00012 /dev/dsk/c7t14d4 00193 current
00013 /dev/dsk/c7t14d5 00193 current
00014 /dev/dsk/c7t13d7 00194 current
00015 /dev/dsk/c7t14d0 00010 current
00016 /dev/dsk/c7t14d1 00194 current
00017 /dev/dsk/c7t14d2 00194 current
00018 /dev/dsk/c7t14d3 00194 current
00019 /dev/dsk/c7t14d4 00194 current
00020 /dev/dsk/c7t14d5 00194 current
如果Logical extents下不存在之前的错误,那么说明此裸设备没有问题。
阅读(1856) | 评论(0) | 转发(0) |