07 Oracle Clusterware Postinstallation Procedures

References

Oracle® Clusterware Installation Guide 11g Release 1 (11.1) for Linux

Procedure

Followed the procedure in Oracle Clusterware Postinstallation Procedures.

Required Postinstallation Tasks

Reviewed Required Postinstallation Tasks.

Back Up the Voting Disk After Installation

Following Back Up the Voting Disk After Installation,

[root@penrith1 ~]# cd /u01/app/crs [root@penrith1 crs]# ls product [root@penrith1 crs]# mkdir backups [root@penrith1 crs]# cd backups [root@penrith1 backups]# dd if=/u04/voting of=voting_20120103.bak 41024+0 records in 41024+0 records out 21004288 bytes (21 MB) copied, 39.7536 seconds, 528 kB/s [root@penrith1 backups]# ls -l total 20540 -rw-r--r-- 1 root root 21004288 Jan 3 10:58 voting_20120103.bak

Configure Input/Output Fencing

Following Configure Input/Output Fencing, I find that the hangcheck configuration does not exist in /etc/modprobe.conf. The configuration is entered as follows:

[root@penrith1 ~]# echo "options hangcheck-timer hangcheck_tick=1 \ hangcheck_margin=10 hangcheck_reboot=1" >>/etc/modprobe.conf [root@penrith1 ~]# grep hang /etc/modprobe.conf options hangcheck-timer hangcheck_tick=1 hangcheck_margin=10 hangcheck_reboot=1 [root@penrith1 ~]# /sbin/modprobe -v hangcheck-timer insmod /lib/modules/2.6.18-164.el5/kernel/drivers/char/hangcheck-timer.ko \ hangcheck_tick=1 hangcheck_margin=10 hangcheck_reboot=1 [root@penrith1 ~]# /sbin/lsmod | grep hang hangcheck_timer 8025 0 [root@penrith1 ~]# echo "/sbin/modprobe -v hangcheck-timer" >>/etc/rc.d/rc.local [root@penrith1 ~]# cat /etc/rc.d/rc.local #!/bin/sh # # This script will be executed *after* all the other init scripts. # You can put your own initialization stuff in here if you don't # want to do the full Sys V style init stuff. touch /var/lock/subsys/local /sbin/modprobe -v hangcheck-timer

Setting the diagwait parameter to 13 for oprocd. First, on PENRITH1,

[root@penrith1 ~]# cd /u01/app/crs/product/11.1.0/crs/bin/ [root@penrith1 bin]# ./crsctl stop crs Stopping resources. This could take several minutes. Successfully stopped Oracle Clusterware resources Stopping Cluster Synchronization Services. Shutting down the Cluster Synchronization Services daemon. Shutdown request successfully issued.

Repeated this on PENRITH2. Then, on PENRITH1,

[root@penrith1 bin]# ./oprocd stop Jan 03 11:23:31.209 | ERR | failed to connect to daemon, errno(111)

Repeated this on PENRITH2 and got the same result. Then, on PENRITH1,

[root@penrith1 bin]# ./crsctl set css diagwait 13 -force Configuration parameter diagwait is now set to 13.

On PENRITH1,

[root@penrith1 bin]# ./crsctl start crs Attempting to start Oracle Clusterware stack The CRS stack will be started shortly

Repeated this on PENRITH2.

And both hosts started rebooting. The problem appears to be that I did not wait for one node to start up crs completely before starting crs on the other node.

Download and Install Patch Updates

Skipped Download and Install Patch Updates for the time being as I want to complete the 11.1.0.6 installation first before doing the 11.1.0.7 patch.

Recommended Postinstallation Tasks

Followed Recommended Postinstallation Tasks and ran CVU Postinstallation Check as follows:

su - crs cd /u00/product/11.1.0.6/clusterware ./runcluvfy.sh stage -post crsinst -n penrith1,penrith2 -verbose \ >/tmp/cluvfy_post_crsinst.lst

The log file, cluvfy_post_crsinst.lst, has been uploaded as an attachment to this page.

And the root.sh script was backed up as follows on both PENRITH1 and PENRITH2:

[crs@penrith1 ~]$ mkdir -p backup/11.1.0.6 [crs@penrith1 ~]$ cp /u01/app/crs/product/11.1.0/crs/root.sh backup/11.1.0.6/