PostgreSQL 9.3 Streaming Replication 状态监控

postgresql是使用Streaming Replication来实现热备份的，热备份的作用如下：

灾难恢复
高可用性
负载均衡，当你使用Streaming Replication来实现热备份(hot standby)的时候,可以再standby上执行查询语句，也只允许执行select

那么，当我们有大量使用了流复制的机器之后，监控 Streaming Replication 的正常运行，在我们的部署中是非常重要的。

那么，我们会有下面的监控问题：

如何更好的监控流复制（Streaming Replication）
监控它们最好的方法是什么
除了使用 Master的pg_stat_replication视图 监控，还有什么在standby上可用的方法来监控流复制
如何计算 replication 滞后时间，以秒、分钟为单位。

针对上面几个常见的问题，下面是一些我认为比较有用的方法。

master/primary server 上的pg_stat_replication 视图

 pid:              walsender process的进程ID

 usesysid:         执行流复制的用户的OID

 usename:          执行流复制的用户的用户名

 application_name: 连接到master的Application name

 client_addr:      standby/streaming replication的ip地址

 client_hostname:  Hostname of standby.

 client_port:      standby上的TCP port

 backend_start:    从数据第一次连接master的时间

 state:            当前WAL sender状态 i.e streaming

 sent_location:    Last transaction location sent to standby.

 write_location:   Last transaction written on disk at standby

 flush_location:   Last transaction flush on disk at standby.

 replay_location:  Last transaction flush on disk at standby.

 sync_priority:    standby服务器的优先级

 sync_state:       standby的同步类型( async/synchronous)(异步/同步).

 e.g.:

 postgres=# \x

 Expanded display is on.

 postgres=# select * from pg_stat_replication;

 -[ RECORD 1 ]----+------------------------------

 pid              | 19597

 usesysid         | 16384

 usename          | repl

 application_name | walreceiver

 client_addr      | 210.61.161.183

 client_hostname  |

 client_port      | 50474

 backend_start    | 2015-02-04 11:07:27.137356+08

 state            | streaming

 sent_location    | 4/E059E560

 write_location   | 4/E059E560

 flush_location   | 4/E059E560

 replay_location  | 4/E059BEB0

 sync_priority    | 0

 sync_state       | async

select pg_is_in_recovery();, 这个函数在standby执行，会告诉你，是否处于recovery 模式！

 e.g.

 # standby处于复制状态，返回 t, 否则返回 f

 postgres=# select pg_is_in_recovery();

  pg_is_in_recovery

 -------------------

  t

 (1 row)

 # 下面不是standby的例子

 postgres=#  select pg_is_in_recovery();

  pg_is_in_recovery

 -------------------

  f

 (1 row)

select pg_last_xlog_replay_location();, 同样是在standby上执行，显示recovery 过程中的最近一个事务。

 e.g.

 postgres=# select pg_last_xlog_replay_location();

  pg_last_xlog_replay_location

 ------------------------------

  0/27099838

 (1 row)

select pg_last_xlog_receive_location();, standby上执行，standby最后接收到的事务日志，并且已经同步写到硬盘的.

 e.g.

 postgres=#  select pg_last_xlog_receive_location();

  pg_last_xlog_receive_location

 -------------------------------

  0/2709CB70

 (1 row)

select pg_last_xact_replay_timestamp();, standby上执行，recovery过程中最后一个事务执行的时间

 e.g.

 postgres=# select pg_last_xact_replay_timestamp();

  pg_last_xact_replay_timestamp

 -------------------------------

  2015-02-09 19:48:57.916245+08

 (1 row)

接下来的问题，是如何正确的在master和standby上监控Streaming Replication：

standby上的监控：

select pg_is_in_recovery(); 判断是否处于recovery模式

查看recovery的延时情况：

 SELECT CASE WHEN pg_last_xlog_receive_location() = pg_last_xlog_replay_location()

 	THEN 0

 	ELSE EXTRACT (EPOCH FROM now() - pg_last_xact_replay_timestamp())

     END AS log_delay;

 # 如果receive和replay是同一个位置，延时为0；否则当前时间减去最后一个事务的时间为延时

  log_delay

 -----------

          0

 (1 row)

pg_last_xact_replay_timestamp和 pg_last_xlog_replay_location 判断recovery是否处于工作状态。当Streaming Replication在复制的时候，replay_timestamp和pg_last_xlog_replay_location会一直增长。

 postgres=# select pg_last_xact_replay_timestamp();

  pg_last_xact_replay_timestamp

 -------------------------------

  2015-02-09 20:53:54.48081+08

 (1 row)

 postgres=# select pg_last_xact_replay_timestamp();

  pg_last_xact_replay_timestamp

 -------------------------------

  2015-02-09 20:53:55.456179+08

 postgres=# select pg_last_xlog_replay_location();

  pg_last_xlog_replay_location

 ------------------------------

  5/723E528

 (1 row)

 postgres=# select pg_last_xlog_replay_location();

  pg_last_xlog_replay_location

 ------------------------------

  5/72514B8

 (1 row)

master上的监控：

查看pg_stat_replication 中的状态呢，使用

 postgres=# select * from pg_stat_replication;

 -[ RECORD 1 ]----+------------------------------

 pid              | 19597

 usesysid         | 16384

 usename          | repl

 application_name | walreceiver

 client_addr      | 210.61.161.183

 client_hostname  |

 client_port      | 50474

 backend_start    | 2015-02-04 11:07:27.137356+08

 state            | streaming

 sent_location    | 5/64046A8

 write_location   | 5/64046A8

 flush_location   | 5/64046A8

 replay_location  | 5/64027B0

 sync_priority    | 0

 sync_state       | async

在master判断recovery的滞后程度，以字节为单位

 postgres=# select pg_xlog_location_diff(sent_location, replay_location) from pg_stat_replication;

  pg_xlog_location_diff

 -----------------------

                   1968

 (1 row)

 postgres=# select pg_xlog_location_diff(sent_location, replay_location) from pg_stat_replication;

  pg_xlog_location_diff

 -----------------------

                   1488

参考：

Postgresql The Statistics Collector

System Administration Functions

PostgreSQL 9.3 Streaming Replication 状态监控的更多相关文章

postgresql Streaming Replication监控与注意事项
一监控Streaming Replication集群 1 pg_stat_replication视图(主库端执行) pid Wal sender process的进程ID usesysid 执行流复制 ...
Streaming replication slots in PostgreSQL 9.4
Streaming replication slots are a pending feature in PostgreSQL 9.4, as part of the logical changese ...
配置PostgreSQL Streaming Replication集群
运行环境: Primary: 192.168.0.11 Standby: 192.168.0.21, 192.168.0.22 OS: CentOS 6.2 PostgreSQL: 9.1.2 版本以 ...
PostgreSQL Streaming Replication的FATAL ERROR
磨砺技术珠矶,践行数据之道,追求卓越价值回到上一级页面: PostgreSQL集群方案相关索引页回到顶级页面:PostgreSQL索引页[作者高健@博客园 luckyjackgao@gm ...
PostgreSQL的streaming replication
磨砺技术珠矶,践行数据之道,追求卓越价值回到上一级页面: PostgreSQL集群方案相关索引页回到顶级页面:PostgreSQL索引页[作者高健@博客园 luckyjackgao@gm ...
mysql主从同步(4)-Slave延迟状态监控
mysql主从同步(4)-Slave延迟状态监控转自:http://www.cnblogs.com/kevingrace/p/5685511.html 之前部署了mysql主从同步环境(Mysql ...
redis状态监控与性能调优
本文主要介绍及演示一些Redis相关的状态监控和性能调优的命令及使用方法: 1.redis-benchmark redis基准信息,redis服务器性能检测例如: 检测redis服务器性能,本机63 ...
6. MGR状态监控 | 深入浅出MGR
GreatSQL社区原创内容未经授权不得随意使用,转载请联系小编并注明来源. 目录 1. 节点状态监控 2. MGR事务状态监控 3. 其他监控 4. 小结参考资料.文档免责声明文章推荐: 关于 ...
【VC++技术杂谈003】打印技术之打印机状态监控
在上一篇博文中我主要介绍了如何获取以及设置系统的默认打印机,本文将介绍如何对打印机状态进行实时监控,记录下所打印的文档.打印的份数以及打印时间等打印信息. 1.打印机虚脱机技术在正式介绍如何对打印机 ...

随机推荐

MySQL与逻辑模块
启动MySQL 1.初始化模块运行&&存储引擎初始化运行 2.1中运行完毕后 ---->连接管理模块接手 3.连接管理模块启动处理client连接请求的监听程序(tcp/ip 网 ...
认识Backbone (三)
Backbone.Collection(集合) collection是model对象的一个有序的组合,我们可以在集合上绑定 "change" 事件,从而当集合中的模型发生变化时f ...
学习通过Thread+Handler实现非UI线程更新UI组件（转）
[Android线程机制] 出于性能考虑,Android的UI操作并不是线程安全的,这就意味着如果有多个线程并发操作UI组件,可能导致线程安全问题.为了解决这个问题,Android制定了一条简单的规则 ...
Poj3414广泛搜索
<span style="color:#330099;">/* D - D Time Limit:1000MS Memory Limit:65536KB 64bit I ...
Android - 支持不同的设备 - 支持不同的平台版本
在最新版本的Android为app提供很好的新API时,也应该继续支持旧版本的Android直到大部分设备已经更新了.这里将要介绍如何在使用最新API带来的优点的同时继续支持老版本. Dashboar ...
nuget 命令详解
包相关 Install-Package 安装包 -Version 4.3.1 参数指定版本Uninstall-Package 卸载包Update-Package 更新包Get-Package 默认 ...
Raw-OS互斥的源代码分析的量的Mutex
作为分析的内核版本2014-04-15,基于1.05正式版.blogs我们会跟上的内核开发进度的最新版本,如果出现源代码的目光"???"的话,没有深究的部分是理解. Raw-OS官 ...
ext Window点击右上角的关闭（Xbutton）加入监控事件
使用场景:关闭window的时候添加监听事件. 正确的使用方式: addwin = new Ext.Window({ title : '新增', closable : true, width : 50 ...
FreeBSD包管理
FreeBSD软件没有安装Ubuntu的apt-get.它也不是Gentoo的portage.有三种方式: package ports 自主进行源代码编译安装这里简介前两种. FreeBSD 6.0 ...
SDL2来源分析7：演出（SDL_RenderPresent()）
===================================================== SDL源代码分析系列文章上市: SDL2源码分析1:初始化(SDL_Init()) SDL2 ...

PostgreSQL 9.3 Streaming Replication 状态监控

PostgreSQL 9.3 Streaming Replication 状态监控的更多相关文章

随机推荐

热门专题