资讯专栏INFORMATION COLUMN

Reboot-less node fencing in Oracle Clusterware 11g

lufficc / 2187人阅读

摘要:基于上面的现象,在上搜索,发现在版本之后引入了叫的特性,即不使用重启节点的方式进行。下面引用官方文档对的介绍。

在进行一次RAC的高可用性测试时,当private网卡的网线被拔掉之后,没有出现传说中的有一个节点被CRS强制重启,取而代之的是node2上面的ASM实例和RDBMS实例被关闭;当网线被重新插上时,node2上面的ASM实例和RDBMS实例自动重新启动。

基于上面的现象,在google上搜索,发现oracle在11.2.0.2版本之后引入了叫reboot-less node fencing的特性,即不使用重启节点的方式进行fencing。

下面引用oracle官方文档对reboot-less node fencing的介绍。

As mentioned, Oracle Clusterware uses a STONITH (Shoot The Other Node In The Head) comparable fencing algorithm to ensure data integrity in cases, in which cluster integrity is endangered and split-brain scenarios need to be prevented. In case of Oracle Clusterware, this means that a local process enforces the removal of one or more nodes from the cluster (fencing). 

Until Oracle Clusterware 11g Release 2, Patch Set One (11.2.0.2) the fencing of a node was performed by a “fast reboot” of the respective server. A “fast reboot” in this context summarizes a shutdown and restart procedure that does not wait for any IO to finish or for file systems to synchronize on shutdown. With Oracle Clusterware 11g Release 2, Patch Set One (11.2.0.2) this mechanism has been changed in order to prevent such a reboot as much as possible. 

Already with Oracle Clusterware 11g Release 2 this algorithm was improved so that failures of certain, Oracle RAC-required subcomponents in the cluster do not necessarily cause an immediate fencing (reboot) of a node. Instead, an attempt is made to clean up the failure within the cluster and to restart the failed subcomponent. Only, if a cleanup of the failed component appears to be unsuccessful, a node reboot is performed in order to force a cleanup. 

With Oracle Clusterware 11g Release 2, Patch Set One (11.2.0.2) further improvements were made so that Oracle Clusterware will try to prevent a split-brain without rebooting the node. It thereby implements a standing requirement from those customers, who were requesting to preserve the node and to prevent a reboot, since the node runs applications not managed by Oracle Clusterware, which would otherwise be forcibly shut down by the reboot of a node. 

With the new algorithm and when a decision is made to evict a node from the cluster, Oracle Clusterware will first attempt to shutdown all resources on the machine that was chosen to be the subject of an eviction. Especially IO generating processes are killed and it is ensured that those processes are completely stopped before continuing. If, for some reason, not all resources can be stopped or IO generating processes cannot be stopped completely, Oracle Clusterware will still perform a reboot or use IPMI to forcibly evict the node from the cluster. 

If all resources can be stopped and all IO generating processes can be killed, Oracle Clusterware will shut itself down on the respective node, but will attempt to restart after the stack has been stopped. The restart is initiated by the Oracle High Availability Services Daemon, which has been introduced with Oracle Clusterware 11g Release 2. 

文章版权归作者所有,未经允许请勿转载,若此文章存在违规行为,您可以联系管理员删除。

转载请注明本文地址:https://www.ucloud.cn/yun/35242.html

相关文章

  • Reboot-less node fencing in Oracle Clusterware 11g

    摘要:基于上面的现象,在上搜索,发现在版本之后引入了叫的特性,即不使用重启节点的方式进行。下面引用官方文档对的介绍。 在进行一次RAC的高可用性测试时,当private网卡的网线被拔掉之后,没有出现传说中的有一个节点被CRS强制重启,取而代之的是node2上面的ASM实例和RDBMS实例被关闭;当网线被重新插上时,node2上面的ASM实例和RDBMS实例自动重新启动。 基于上面的现象,在g...

    senntyou 评论0 收藏0
  • Veritas InfoScale Enterprise 7 安装部署手册

    摘要:前言能确保关键任务应用程序在遇到意外停机时能够持续正常运行,作为一款商业产品已经足够强大。 前言 Veritas InfoScale Availability 能确保关键任务应用程序在遇到意外停机时能够持续正常运行,作为一款商业产品已经足够强大。 记录 infoscale 7.0.1 安装部署心得 更新记录 2017年04月14日 - 初稿 阅读原文 - https://wsgzao...

    lolomaco 评论0 收藏0
  • ORA-02143: invalid STORAGE option --DSG oracle 11g

    摘要:复制数据到分类人阅读评论收藏举报报错原因新增了一个存储选项,不支持此选项解决办法在源端抽取语句是去掉存储选项,在源端下编辑文件配置如下重新发起全同步问题解决 ORA-02143: invalid STORAGE option --DSG oracle 11g 复制数据到oracle 10g 分类: GoldenGate&DSG&SharePlex 2012-10-17 02:36 10...

    yacheng 评论0 收藏0
  • ORA-02143: invalid STORAGE option --DSG oracle 11g

    摘要:复制数据到分类人阅读评论收藏举报报错原因新增了一个存储选项,不支持此选项解决办法在源端抽取语句是去掉存储选项,在源端下编辑文件配置如下重新发起全同步问题解决 ORA-02143: invalid STORAGE option --DSG oracle 11g 复制数据到oracle 10g 分类: GoldenGate&DSG&SharePlex 2012-10-17 02:36 10...

    J4ck_Chan 评论0 收藏0
  • Oracle 11g安装及遇到的问题

    摘要:安装过程及遇到的问题安装。遇到的问题位系统安装时遇到环境不满足最低要求。安装到配置时失败。小结以上都是我自己安装时碰到的问题,安装时碰到问题,可以去安装目录这个目录找错误日志定位原因及解决方法,或者上网寻求解决方案。 1、下载地址 1.1、oracle官网下载地址:http://www.oracle.com/technet...1.2、百度云链接:https://pan.baidu.c...

    liuchengxu 评论0 收藏0

发表评论

0条评论

lufficc

|高级讲师

TA的文章

阅读更多
最新活动
阅读需要支付1元查看
<