有时候,你会在ORACLE数据库的告警日志中发现“Thread <number> cannot allocate new log, sequence <number>  Checkpoint not complete”这类告警。具体案例如下所示:

Thread 1 cannot allocate new log, sequence 279334

Checkpoint not complete

Current log# 4 seq# 279333 mem# 0: /u01/oradata/GSP/redo04.log

Current log# 4 seq# 279333 mem# 1: /u03/oradata/GSP/redo04.log

当然Thread或sequence的数值可能有所不同,基本上是类似下面这样的告警信息

Thread <number> cannot allocate new log, sequence <number>

Checkpoint not complete

也有可能是因为在等待重做日志的归档,出现的是下面这类告警信息

ORACLE Instance <name> - Can not allocate log, archival required

Thread <number> cannot allocate new log, sequence <number>

那么出现这类告警的具体原因是什么呢? 以及要如何去解决这个问题呢?

原因分析:

通常来说是因为重做日志(redo log)在写满后就会切换日志组,这个时候就会触发一次检查点事件(checkpoint),检查点(checkpoint)激活时会触发数据库写进程(DBWR),将数据缓冲区里的脏数据块写回到磁盘的数据文件中,只要这个脏数据写回磁盘事件没结束,那么数据库就不会释放这个日志组。在归档模式下,还会伴随着ARCH进程将重做日志进行归档的过程。如果重做日志(redo log)产生的过快,当CPK或归档还没完成,LGWR已经把其余的日志组写满,又要往当前的日志组里面写redo log的时候,这个时候就会发生冲突,数据库就会被挂起。并且一直会往alert.log中写类似上面的错误信息。

另外,重做日志在不同业务时段的切换频率不一样,所以出现这个错误,一般是业务繁忙或者出现大量DML操作的时候。

解决方法:

 

1:增大REDO LOG FILE的大小

增大redo log file的大小容易操作,但是redo log file设置为多大才是合理的呢?

1:参考V$INSTANCE_RECOVERY中OPTIMAL_LOGFILE_SIZE字段值,但是这个字段有可能为Null值,除非你调整FAST_START_MTTR_TARGET参数的值大于0

Redo log file size (in megabytes) that is considered optimal based on the current setting of FAST_START_MTTR_TARGET. It is recommended that the user configure all online redo logs to be at least this value.

官方文档的建议如下:

You can use the V$INSTANCE_RECOVERY view column OPTIMAL_LOGFILE_SIZE to determine the size of your online redo logs. This field shows the redo log file size in megabytes that is considered optimal based on the current setting of FAST_START_MTTR_TARGET. If this field consistently shows a value greater than the size of your smallest online log, then you should configure all your online logs to be at least this size.

Note, however, that the redo log file size affects the MTTR. In some cases, you may be able to refine your choice of the optimal FAST_START_MTTR_TARGET value by re-running the MTTR Advisor with your suggested optimal log file size.

SQL> SELECT OPTIMAL_LOGFILE_SIZE FROM V$INSTANCE_RECOVERY;

2:根据重做日志切换次数和重做日志生成的量来判断

可以用awr_redo_size_history脚本统计分析一下,每个小时、每天生成的归档日志的大小,然后可以某些时间段(切换频繁的时间段)的归档日志大小和15~ 20分钟(如果某个时间段切换非常频繁,几乎无法使用这个规则,因为重组日志会非常大)切换一次计算重做日志大小。当然这个不是放之四海而皆准的规则,需要根据实际业务判断,大部分情况下还是可以参考这个

计算重做日志的一个脚本,仅供参考

SELECT

(SELECT ROUND(AVG(BYTES) / 1024 / 1024, 2) FROM V$LOG) AS "Redo size (MB)",

ROUND((20 / AVERAGE_PERIOD) * (SELECT AVG(BYTES)

FROM V$LOG) / 1024 / 1024, 2) AS "Recommended Size (MB)"

FROM (SELECT AVG((NEXT_TIME - FIRST_TIME) * 24 * 60) AS AVERAGE_PERIOD

FROM V$ARCHIVED_LOG

WHERE FIRST_TIME > SYSDATE - 3

    AND TO_CHAR(FIRST_TIME, 'HH24:MI') BETWEEN

    &START_OF_PEAK_HOURS AND &END_OF_PEAK_HOURS

);

2:增加REDO LOG Group的数量

增加日志组的数量,其实并不能解决“Thread <number> cannot allocate new log, sequence <number> Checkpoint not complete” 这个问题,但是他能解决下面这个问题:

ORACLE Instance <name> - Can not allocate log, archival required

Thread <number> cannot allocate new log, sequence <number>

这个是因为ARCH进程,尚未完成将重做日志文件复制到归档目标(需要存档),而此时由于重做日志切换太快或日志组过少,必须等待ARCR进程完成归档后,才能循环覆盖日志组。

3:Tune checkpoint

 

这个比较难,参考官方文档:Note 147468.1 Checkpoint Tuning and Troubleshooting Guide

4:Increase I/O speed for writing online REDO log/Archived REDO

This applies to Thread <number> cannot allocate new log, sequence <number>

Checkpoint not complete

- use ASYNC I/O if not already so

- use DBWR I/O slaves or multiple DBWR processes

Reference:

Oracle Database Performance Tuning Guide

Instance Tuning Using Performance Views

Consider Multiple Database Writer (DBWR) Processes or I/O Slaves

10.2 - http://docs.oracle.com/cd/B19306_01/server.102/b14211/instance_tune.htm#i42802

11.1 - http://docs.oracle.com/cd/B28359_01/server.111/b28274/instance_tune.htm#i42802

11.2 - http://docs.oracle.com/cd/E11882_01/server.112/e16638/instance_tune.htm#PFGRF94511

- consider the generic recommendations for REDO log files:

If the high I/O files are redo log files, then consider splitting the redo log files from the other files. Possible configurations can include the following:

1. Placing all redo logs on one disk without any other files. Also consider availability; members of the same group should be on different physical disks and controllers for recoverability purposes.

2. Placing each redo log group on a separate disk that does not store any other files.

3. Striping the redo log files across several disks, using an operating system striping tool. (Manual striping is not possible in this situation.)

4. Avoiding the use of RAID 5 for redo logs.

Reference:

Oracle Database Performance Tuning Guide

Redo Log Files

10.2 - http://docs.oracle.com/cd/B19306_01/server.102/b14211/iodesign.htm#sthref534

11.1 - http://docs.oracle.com/cd/B28359_01/server.111/b28274/iodesign.htm#CHDBCDHG

11.2 - http://docs.oracle.com/cd/E11882_01/server.112/e16638/iodesign.htm#PFGRF94396

For

ORACLE Instance <name> - Can not allocate log, archival required

Thread <number> cannot allocate new log, sequence <number>

In the above document you may check section "Archived Redo Logs"

5: 找到产生大量重做日志的SQL,如果这个SQL有业务或逻辑上不合理的地方,就要修改,或者将相关表设置为NOLOGGING,减少重做日志的产生

关于如何定位那些SQL产生了大量的重做日志,可以使用LogMiner工具,也可以参考我这篇博客“如何定位那些SQL产生了大量的redo日志

参考资料:

https://asktom.oracle.com/pls/asktom/f?p=100:11:0::::P11_QUESTION_ID:69012348056

Manual Log Switching Causing "Thread 1 Cannot Allocate New Log" Message in the Alert Log (文档 ID 435887.1)

Can Not Allocate Log (文档 ID 1265962.1)

https://gokhanatil.com/2009/08/optimum-size-of-the-online-redo-log-files.html

Thread <number> cannot allocate new log, sequence <number>浅析的更多相关文章

  1. Thread 1 cannot allocate new log, sequence 187398

    报错信息: Thread 1 cannot allocate new log, sequence 187398Checkpoint not complete 处理方法: 查看REDO日志组 selec ...

  2. Thread 1 cannot allocate new log的问题分析 (转载)

    Thread 1 cannot allocate new log的问题分析 发生oracle宕机事故,alert文件中报告如下错误: Fri Jan 12 04:07:49 2007Thread 1 ...

  3. Thread 1 cannot allocate new log 的处理办法

    ALTER SYSTEM ARCHIVE LOG Thread 1 cannot allocate new log, sequence 2594 Checkpoint not complete 这个实 ...

  4. InnoDB: The log sequence number in ibdata files does not match

    InnoDB: The log sequence number in ibdata files does not matchInnoDB的:在ibdata文件的日志序列号不匹配 可能ibdata文件损 ...

  5. mysql oom之后的page 447 log sequence number 292344272 is in the future

    mysql oom之后,重启时发生130517 16:00:10 InnoDB: Error: page 447 log sequence number 292344272InnoDB: is in ...

  6. Thread 1 cannot allocate new log的问题分析

    http://blog.csdn.net/zonelan/article/details/7613519 http://leoguan.blog.51cto.com/816378/584494 htt ...

  7. The log scan number (620023:3702:1) passed to log scan in database 'xxxx' is not valid

    昨天一台SQL Server 2008R2的数据库在凌晨5点多抛出下面告警信息: The log scan number (620023:3702:1) passed to log scan in d ...

  8. [crypto][ipsec] 简述ESP协议的sequence number机制

    预备 首先提及一个概念叫重放攻击,对应的机制叫做:anti-replay https://en.wikipedia.org/wiki/Anti-replay IPsec协议的anti-replay特性 ...

  9. Sequence Number

    1570: Sequence Number 时间限制: 1 Sec  内存限制: 1280 MB 题目描述 In Linear algebra, we have learned the definit ...

随机推荐

  1. 针对base64编码和URIEncode的一点研究

    Base64编码的作用 将任意的二进制比特串编码成由ASCii码中的64个可显示字符组成的字符串. 为什么需要base64编码? 所有的文件,本质上都是0.1组成的比特串,文本文件.二进制文件的区别只 ...

  2. C语言程序设计100例之(11):求质数

    例11  求质数 问题描述 质数是指除了有1和自身作为约数外,不再有其他约数的数.比如:3.5.7是质数.而9不是质数,因为它还有约数3. 编写程序求给定区间中的所有质数. 输入格式 两个整数a和b, ...

  3. 使用pycharm或idea提交项目到github

    pycharm和idea的操作方式几乎一样,所以下面就以pycharm为例来介绍. 安装git https://git-scm.com/download/win 官网,下载慢,需翻墙 https:// ...

  4. SpringBoot 项目运行在 tomcat7 上

    SpringBoot 项目如何打成 war 包 SpringBoot项目的默认打包方式是将工程打包成为一个 jar 包.部分情况下,我们需要将项目打包成一个 war 包,以方便我们将工程部署在 tom ...

  5. spark shell操作

    RDD有两种类型的操作 ,分别是Transformation(返回一个新的RDD)和Action(返回values). 1.Transformation:根据已有RDD创建新的RDD数据集build ...

  6. RESTful 架构风格

    在移动互联网的大潮下,『微服务』的概念也越来越被大家接受并应用于实践,日益增多的web service逐渐统一于RESTful 架构风格,如果开发者对RESTful 架构风格不甚了解,则开发出的所谓R ...

  7. JS---part2课程介绍+part1复习

    part1复习 JavaScript分三个部分: 1. ECMAScript标准----JS基本的语法 2. DOM:Document Object Model 文档对象模型 3. BOM:浏览器对象 ...

  8. English: Class logogram

    IT # this is a IT type ISP ANOTHER # following is another logogram LCD PDA

  9. 47-准备 Overlay 网络实验环境

    为支持容器跨主机通信,Docker 提供了 overlay driver,使用户可以创建基于 VxLAN 的 overlay 网络.VxLAN 可将二层数据封装到 UDP 进行传输,VxLAN 提供与 ...

  10. 关于SQL2005备份和还原的语法操作

    原数据库:restore database 练习版 from disk='D:\db-recovery\@@@.BAK' WITH REPLACE/WITH NORECOVERY备份数据库:backu ...