I/O Errors in Alert log with ORA-29701, with "gipcWait failed with 16" in trace (文档 ID 1496329.1)

1. Database alert log

Fri May  ::
Errors in file /oracle/app/oracle/diag/rdbms/orcl/rocl1/trace/rocl1_ora_65536796.trc:
ORA-: 将块写入文件  时出现 IO 错误 (块 # )
Fri May  :: 

2. trace file

Oracle Database 11g Enterprise Edition Release  - 64bit Production
With the Partitioning, Real Application Clusters, Automatic Storage Management, OLAP,
Data Mining and Real Application Testing options
ORACLE_HOME = /oracle/app/oracle/product//db_1
System name:    AIX
Node name:      rac1
Release:
Version:
Machine:        00F6E7C84C00
Instance name: rocl1
Redo thread mounted by this instance:
Oracle process number:
Unix process pid: , image: oracle@rac1

*** -- ::58.840
*** SESSION ID:(-- ::58.840
*** CLIENT ID:() -- ::58.840
*** SERVICE NAME:(orcl) -- ::58.840
*** MODULE NAME:(JDBC Thin Client) -- ::58.840
*** ACTION NAME:() -- ::58.840

-- :: ()
-- ::
kgxgncin: CLSS init failed with status
kgxgncin: return status  ( SKGXN not av) from CLSS
kjfmsgr: unable to connect to NM for reg in shared group
ORA-: 将块写入文件  时出现 IO 错误 (块 # )
Dump of memory from 0x070001209CBA0328 to 0x070001209CBA0D3B
70001209CBA0320                    20544F44          [WITH TOD]

3. ocssd.log

-- 检查/oracle/app//grid/log/rac1/cssd/ocssd.log 文件
-- ::]clssgmQueueShare: (11ba99f10) target global grock DBORCL member  type  queued from client (1176496b0), global grock DBORCL, refcount
-- ::]clssgmRegisterShared: global grock DBORCL member  share type , refcount
-- ::] gipcmodMuxTransferAccept: internal accept request failed endp 1112a2970, child 11ba653d0, ret gipcretAuthFail (22) 
-- ::] gipcmodMuxTransferAccept: EXCEPTION[ ret gipcretAuthFail () ]  error during accept on endp 1112a2970
-- ::] gipcmodClscCallback: async request failed req 1172b0bf0 [00000000e3b63bc0] { gipcSendRequest : addr , olen , parentEndp 11abbcef
, ret gipcretConnectionLost (), objFlags )
-- ::] gipcmodMuxTransferAccept: internal accept request failed endp 1112a2970, child 11abbcef0, ret gipcretConnectionInvalid ()
-- ::] gipcmodMuxTransferAccept: EXCEPTION[ ret gipcretConnectionInvalid () ]  error during accept on endp 1112a2970
-- ::]clssscSelect: cookie accept request 11ad57f10
-- ::]clssscevtypSHRCON: getting client with cmproc 11ad57f10
-- ::]clssgmRegisterClient: proc(/11ad57f10), client(/1174aaa90)
-- ::]clssscSelect: cookie accept request 11ba74630
-- ::]clssscevtypSHRCON: getting client with cmproc 11ba74630
-- ::]clssgmRegisterClient: proc(/11ba74630), client(/)
-- ::]clssgmRegisterShared: grp DG_LOCAL_DATA, mbr , type
-- ::]clssgmQueueShare: (11a93a690) target local grock DG_LOCAL_DATA member  type  queued from client (1174aaa90), local grock DG_LOCAL_DATA, refcount
-- ::]clssgmRegisterShared: local grock DG_LOCAL_DATA member  share type , refcount
-- ::]clssgmRegisterShared: grp DBORCL, mbr , type
-- ::]clssgmQueueShare: (11a93ab70) target global grock DBORCL member  type  queued from client (), global grock DBORCL, refcount
-- ::]clssgmRegisterShared: global grock DBORCL member  share type , refcount
-- ::] gipcmodClscCallback: async request failed req 11730eff0 [00000000e3b63c64] { gipcSendRequest : addr , olen , parentEndp 11abbcef
, ret gipcretConnectionLost (), objFlags )
-- ::] gipcmodMuxTransferAccept: internal accept request failed endp 1112a2970, child 11abbcef0, ret gipcretConnectionInvalid ()
-- ::] gipcmodMuxTransferAccept: EXCEPTION[ ret gipcretConnectionInvalid () ]  error during accept on endp 1112a2970
-- ::]clssscSelect: cookie accept request 11ba4a590
-- ::]clssscevtypSHRCON: getting client with cmproc 11ba4a590
-- ::]clssgmRegisterClient: proc(/11ba4a590), client(/11764d8f0)
-- ::]clssscSelect: cookie accept request 1109c2e00
-- ::]clssgmAllocProc: (11bac8dd0) allocated

4. 检查CRS_home空间及文件

目录空间足够。
ls -ld /var/tmp/.oracle
drwxrwxrwt     root     oinstall         Nov    /var/tmp/.oracle
ls -ld /tmp/.oracle
drwxrwxrwt     root     oinstall        Jan  : /tmp/.oracle

5. 数据库此刻出现活动回话剧增,459f3z9u4fb3u语句查询字典视图出现(cursor: pin S wait on X)等待事件,且sga频繁收缩和扩展

SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
GROW        |IMMEDIATE   |shared_pool_size     |       |      |     |COMPLETE |/ :          |
GROW        |IMMEDIATE   |shared_pool_size     |       |      |     |COMPLETE |/ :          |
GROW        |IMMEDIATE   |shared_pool_size     |       |      |     |COMPLETE |/ :          |
SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
GROW        |IMMEDIATE   |shared_pool_size     |       |      |     |COMPLETE |/ :          |
GROW        |DEFERRED    |db_cache_size        |       |      |     |COMPLETE |/ :          |
SHRINK      |DEFERRED    |shared_pool_size     |       |      |     |COMPLETE |/ :          |
SHRINK      |DEFERRED    |shared_pool_size     |       |      |     |COMPLETE |/ :          |
GROW        |DEFERRED    |db_cache_size        |       |      |     |COMPLETE |/ :          |
GROW        |DEFERRED    |db_cache_size        |       |      |     |COMPLETE |/ :          |
SHRINK      |DEFERRED    |shared_pool_size     |       |      |     |COMPLETE |/ :          |
GROW        |DEFERRED    |db_cache_size        |       |      |     |COMPLETE |/ :          |
SHRINK      |DEFERRED    |shared_pool_size     |       |      |     |COMPLETE |/ :          |
SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
GROW        |IMMEDIATE   |shared_pool_size     |       |      |     |COMPLETE |/ :          |
GROW        |IMMEDIATE   |shared_pool_size     |       |      |     |COMPLETE |/ :          |
SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
GROW        |IMMEDIATE   |shared_pool_size     |       |      |     |COMPLETE |/ :          |
SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
GROW        |IMMEDIATE   |shared_pool_size     |       |      |     |COMPLETE |/ :          |
SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
SHRINK      |IMMEDIATE   |db_cache_size        |       |      |     |COMPLETE |/ :          |
GROW        |IMMEDIATE   |shared_pool_size     |       |      |     |COMPLETE |/ :          |
GROW        |IMMEDIATE   |shared_pool_size     |       |      |     |COMPLETE |/ :          |
SHRINK      |DEFERRED    |shared_pool_size     |       |      |     |COMPLETE |/ :          |
GROW        |DEFERRED    |db_cache_size        |       |      |     |COMPLETE |/ :          |        

Cause 3. ocssd log has "gipcretAuthFail (22)" (文档 ID 1496329.1)

Example:

-- ::] gipcmodMuxTransferAccept: EXCEPTION[ ret gipcretAuthFail () ]  error during accept on endp 111249b70
gipcretAuthFail () indicates "general security authorization failure".

This could occur for multiple reasons:
* if filesystem is full and there is no space to create file under auth directory. Please check if there is sufficient space in CRS_HOME.
* Also this issue could occur if /var/tmp/.oracle socket is deleted (/tmp/.oracle on some platforms) . Please check on this too.

核查结果与【Cause 3. ocssd log has "gipcretAuthFail (22)" (文档 ID 1496329.1)】情况一致,但我们数据库软件目录空间足够且.oracle文件存在。

问题分析总结:ORA-01114告警是由于数据库SGA出现抖动引起数据库出现性能问题导致。

处理建议:增加SGA大小132G扩大到180G(v$sga_target_advice建议值)

RAC with asm on AIX, ORA-01114 error,with "gipcretAuthFail (22) " in ocssd.log的更多相关文章

  1. rac下asm管理的表空间-数据文件的重命名

    asm下表空间的重命名与普通文件系统下的表空间重命名原理是一样的,只不过asm管理的数据文件有一些需要注意的地方,另外在asm下操作数据文件需要格外小心,稍有不慎将会造成数据文件丢失,如可以做备份最好 ...

  2. 删RAC中间ASM和LISTENER 资源的正确方法

    在我们的安装RAC什么时候.有时因各种错误加载错误ASM实例或LISTENER显示器,或增加CLUSTERWARE资源,但并没有真正的工作,例如: 如图所看到的,这里在2个节点分别创建过ASM1和AS ...

  3. Oracle 11G RAC For ASM 利用RMAN COPY进行存储迁移

    转载请注明出处 一.需求背景 客户数据库存储空间接近存满,需购置一台新的存储,进行数据迁移,客户允许少量停机时间. 二.实施方法讨论 利用ASM rebalance 进行迁移 可以实现0宕机进行迁移, ...

  4. ORACLE RAC 监听配置 (listener.ora tnsnames.ora)

    Oracle RAC 监听器的配置与单实例稍有不同,但原理和实现方法基本上是相同的.在Oracle中 tns进程用于为指定网络地址上的一个或多个Oracle 实例提供服务注册,并响应来自客户端对该服务 ...

  5. 使用git error: RPC failed; result=22, HTTP code = 411

    使用git提交比较大的文件的时候可能会出现这个错误 error: RPC failed; result=22, HTTP code = 411 fatal: The remote end hung u ...

  6. git clone error: RPC failed; result=22, HTTP code = 502

    http://www.jianshu.com/p/645d3fe4e028 git克隆的工程太大用https的方式会有如下问题 hbl:tmp hubert$ git clone https://gi ...

  7. winedt打开.tex文件时会出现reading error,看不到任何文字

    winedt打开.tex文件时会出现reading error,然后看不到任何文字.   解决办法:先打开空白的winedt,然后点击open,找到该.tex文件,将文件名右侧的 default 改为 ...

  8. 在SharePoint2010中用out-of-box的方式自定制Application Pages(AccessDenied,Confirmation,Error,Login,RequestAccess,Signout,WebDeleted)

    在实际项目中需要对SharePoint2010中的AccessDenied页面进行自定制,于是乎上网搜索相关内容,经实际操作此方法可行,便以此文记录. 在SharePoint2010中,由于secur ...

  9. Jquery Ajax的时候 老是返回到 error,是因为json格式不正规的原因

    Jquery Ajax的时候 老是返回到 error,是因为json格式不正规的原因: 怪不得不执行,原来我返回的是{success:true,id:1} 这种不规则的字符串,不是严格的json格式, ...

随机推荐

  1. python 软件管理规范

    一.背景 软件开发是一个系统工程,当然编码实现是其中尤其重要的一个环节,关乎到功能需求的实现好坏.这个环节中除了编码这一硬功之外,与之相关的编码风格这一柔道,虽然没有直接决定功能的实现与否,但却在很大 ...

  2. MyBatis入门(一)—— 入门案例

    一.MyBatis简介 MyBatis是面向sql的持久层框架,他封装了jdbc访问数据库的过程,我们开发,只需专注于sql语句本身的拼装,其它赋值的过程全部可以交给MyBatis去完成. 与Hibe ...

  3. 【代码笔记】iOS-自动成表格的效果

    一,效果图. 二,工程图. 三,代码. RootViewController.h #import <UIKit/UIKit.h> #import "LabelOnBackImag ...

  4. 基于token的后台身份验证(转载)

    几种常用的认证机制 HTTP Basic Auth HTTP Basic Auth简单点说明就是每次请求API时都提供用户的username和password,简言之,Basic Auth是配合RES ...

  5. PyQt4(简单界面)

    import sys; from PyQt4 import QtCore, QtGui; app=QtGui.QApplication(sys.argv); widget=QtGui.QWidget( ...

  6. 探讨Oracle分区表

    一年又一年,又到年底了,对于数据库的分区表需要检查一下,有无最大分区,次分区是否需要追加分区,如果程序不是自动追加分区的话,那么年中结算的时候,就会报错. 1.oracle分区主要有五种类型 (1)R ...

  7. 树莓派2 raspyberry Pi2 交叉编译app

    Pi 使用的是ARMV7架构的BCM2836, 下载交叉编译器 arm-linux-gnueabihf-gcc 即可. 本地环境: Ubuntu14 + x86_64 1. 下载编译器地址: 1). ...

  8. HBase Region分裂实现

    分裂策略 不同的分裂策略的实现需要继承RegionSplitPolicy,主要实现两个方法: shouldSplit()表示是否需要分裂 getSplitPoint()得到分裂点rowkey 从 HB ...

  9. Entity Framework的几种初始化器

    Database.SetInitializer<TContext>(new NullDatabaseInitializer<TContext>()); Database.Set ...

  10. .net 下使用Quartz.Net

    Quartz.net是作业调度框架 1. 项目中添加quartz.net的引用(这里使用nuget管理) 新建一个类TimingJob,该类主要用于实现任务逻辑 using Quartz; using ...