Search

Sunday, August 14, 2016

Oracle Cluster Registry (OCR) in 11gR2 -- demistify

Oracle Cluster Registry (OCR) in 11gR2

Oracle Cluster Registry (OCR) is the critical component in Oracle RAC.
Ø  OCR records cluster configuration information.  If it fails, the entire clustered environment of Oracle RAC is affected and a possible outage is a result.
Ø  It is the central repository for CRS, which stores its metadata, configuration and state information for all clusters defined in the clusterware.
Ø  It is the cluster registry maintains application resources and their availability within the RAC environment.
Ø  It also stores information of CRS daemons and cluster managed applications.
What is stored in OCR
We have the introduction of OCR, now we will see what are stored in OCR file.
Ø  Node membership information, i.e, which nodes are part of the cluster.
Ø  Software version
Ø  Location of 11g voting disk
Ø  Server pools
Ø  Status of cluster resources such as RAC databases, listeners, instances and services.
·         Server up/down
·         Network up/down
·         Database up/down
·         Instance up/down
·         Listener up/down
Ø  Configuration of the cluster resources like RAC databases, listeners, instances and services.
·         Dependencies,
·         Management Policies (automatic/manual)
·         Callout scripts
·         Retries
Ø  Cluster database instance to node mapping
Ø  ASM instance, disk groups, etc
Ø  CRS application resource profiles such as VIP address, service, etc.
Ø  Database services characteristics eg., preferred/available nodes, TAF policy, load balancing goal, etc
Ø  Information about clusterware processes
Ø  Information about interaction and management of 3rd party applications controlled by CRS
Ø  Communication settings where the clusterware daemons or background process listen.
Ø  Information about OCR backups.
Let us see the contents in OCR file.
[root@rac1 ~]# ocrconfig -manualbackup
rac2     2014/01/18 01:03:40     /u01/app/grid/product/11.2.0.3/grid_1/cdata/cluster01/backup_20140118_010340.ocr
[root@rac2 ~]#  strings /u01/app/grid/product/11.2.0.3/grid_1/cdata/cluster01/backup_20140118_010340.ocr| grep -v type |grep ora!
ora!LISTENER!lsnr
ora!host02!vip
rora!rac1!vip
;ora!oc4j
6ora!LISTENER_SCAN3!lsnr
ora!LISTENER_SCAN2!lsnr
ora!LISTENER_SCAN1!lsnr
ora!scan3!vip
ora!scan2!vip
ora!scan1!vip
ora!gns
ora!gns!vip
ora!registry!acfs
ora!DATA!dg
dora!asm
_ora!eons
ora!ons
ora!gsd
ora!net1!network
Who is updating OCR
It is updated and maintained by many client applications.
Ø  CSSd during startup of cluster – to update the status of the servers.
Ø  CSSd during node addition/deletion – to add/delete nodes
Ø  CRSd about status of nodes during failure/reconfiguration
Ø  OUI  - Oracle universal Installer during installation/upgradation/deletion/addition
Ø  Srvctl – ( to manage clusters and rac databases)
Ø  Cluster control utility – crsctl (to manage cluster /local resources)
Ø  Enterprise Manager (EM)
Ø  Database Configuration Assistant (DBCA)
Ø  Database upgrade Assistant (DBUA)
Ø  Network Configuration Assistant (NETCA)
Ø  ASM configuration assistant (ASMCA)
How the update is performed in OCR.
1.       Each node in the cluster will have the copy of OCR in memory for better performance and each node is responsible for updating the OCR if required.
2.       CRSd process is responsible for reading and writing to the OCR files as well as refreshing the local OCR cache and caches on the other nodes in the cluster.
3.       Oracle uses distributed shared cache architecture during cluster management to optimize queries against the cluster repository and at the same time, each node maintains a copy of the OCR in memory.
4.       Oracle clusterware uses the background process to access the OCR cache.
5.       Only one CRSd process (designated as master) in the cluster performs any read/write activity. If any new information read by the CRSd master process, then it refresh the local OCR cache and the OCR cache on the others nodes of the cluster.
6.       As the OCR cache is distributed across all nodes in the cluster, OCR clients like srvctl,crsctl,etc will communicate directly with the OCR process on the node to get required information.
7.       Clients communicate via the local CRSd process for any updates on the physical OCR binary file.
During the above process, OCRCONFIG command cannot modify the OCR configuration information for nodes that are shut down or for nodes on which oracle clusterware is not running.  So, we need to avoid shutting down the nodes while modifying the OCR using the ocrconfig command.  We need to perform a repair on the stopped node before it can bring online to join the cluster.
OCRCONFIG –repair command change the OCR configuration only on the node from which we are executing this command.
 Example :  if the OCR mirror was relocated to a disk in rac2 which is down from /dev/raw/raw2  in rac1, then use ocrconfig –repair ocrmirror /dev/raw/raw2 command on rac2 while the CRS stack is down on that node to repair its OCR configuration.
Purpose of OCR
Ø  Oracle clusterware reads the ocr.loc for the location of the registry and to know which application resources needs to be started and the nodes on which to start them.
Ø  Used for bootstrap the CSS with port info, nodes in the cluster and  other similar informations.
Ø  CRSd and other clusterware daemons function is to define and manage resources managed by clusterware. Resources have profiles that define metadata about them. This metadata is stored in OCR. The CRS reads the OCR and manages the application resources, starts, stops, and monitor and manages their failover.
Ø  Maintains and tracks the information pertaining to the definition, availability and current state of the services.
Ø  Implements the workload balancing and continuous availability features of services.
Ø  Generates event during state changes.
Ø  Maintain configuration profiles of resources in OCR.
Ø  Records the currently known state of the cluster at regular intervals and provides the same when reqeuested by client application like srvctl,crsctl,etc.
How the information is stored in OCR
Ø  OCR uses a file-based repository to store configuration information in a series of key-value pairs, using a directory tree-like structure.
Ø  It contains information pertaining to all tiers in the clustered database.
Ø  Various parameters are stored as name-value pairs used and maintained at different levels of the architecture.
Each tier is managed and administered by daemon process at different levels with appropriate privileges.
Eg. All system level resources or application definitions would require root, or superuser to start or stop.
Those defined at the database level will require dba privileges.
Where and how should OCR be stored ?
Ø  Location of the OCR is found in a file on each individual node of the cluster. It varies depending on the flavors of unix. In Linux , /etc/oracle/ocr.loc
Ø  OCR must reside on a shared disk that is accessible by all nodes in the cluster. It cannot be stored on raw filesystems as in 10g as it is deprecated.  So, we are left with cluster filesystem (CFS) and ASM. But CFS will cost high and may not be an option. So, better to go for ASM.  OCR and voting disks can be exclusively available on any disksgroups .
Ø  OCR is striped and mirrored (if we have redundancy in the external) similar to other database files.
Ø  OCR is replicated across all the underlying disks of the diskgroup, so failure of the disk does not bring the failure of the diskgroup.
Ø  In 11gR2, we can have upto 5 OCR copies.
Ø  Since it is in shared location, it can be administered from any nodes irrespective of the node on which the registry was created.
Ø  A small disk of around 300MB-500MB is a good choice.
Utilities to manage OCR
Add an OCR file
—————-
Add an OCR file to an ASM diskgroup called +DATA
ocrconfig –add  +DATA
Moving the OCR
————–
Move an existing OCR file to another location :
ocrconfig –replace /u01/app/oracle/ocr –replacement +DATA
Removing an OCR location
————————
- requires that at least one other OCR file must remain online.
ocrconfig –delete +DATA
OCR Backups
ü  Oracle Clusterware 11g Release 2 backs up the OCR automatically every four hours on a schedule that is dependent on when the node started (not clock time).
ü  OCR backups are made available in $GRID_HOME/cdata/<cluster name> directory on the node performing the backups.
ü  One node known as the master node is dedicated to these backups, but in case master node is down, some other node may become the master. Hence, backups could be spread across nodes due to outages.
These backups are named as follows:
-4-hour backups   (3 max) –backup00.ocr, backup01.ocr, and backup02.ocr.
-Daily backups     (2 max) – day.ocr and day_.ocr
-Weekly backups (2 max) – week.ocr and week_.ocr
ü  It is recommended that OCR backups may be placed on a shared location which can be configured using ocrconfig -backuploc <new location> command.
ü  Oracle Clusterware maintains the last three backups, overwriting the older backups. Thus, you will have 3 4-hour backups, the current one, one four hours old and one eight hours old.
ü  Therefore no additional clean-up tasks are required of the DBA.
ü  Oracle Clusterware will also take a backup at the end of the day. The last two of these backups are retained.
ü  Finally, at the end of each week Oracle will perform another backup, and again the last two of these backups are retained. You should make sure that your routine file system backups backup the OCR location.
ü  Note that RMAN does not backup the OCR.
You can use the ocrconfig command to view the current OCR backups:

#ocrconfig –showbackup auto