DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)
安装greenplum集群出现以下错误:
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-Checking configuration parameters, please wait...
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-Reading Greenplum configuration file init_config
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-Locale has not been set in init_config, will set to default value
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-Locale set to en_US.utf8
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-No DATABASE_NAME set, will exit following template1 updates
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-MASTER_MAX_CONNECT not set, will set to default value 250
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Checking configuration parameters, Completed
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Commencing multi-home checks, please wait...
..
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Configuring build for standard array
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Commencing multi-home checks, Completed
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Building primary segment instance array, please wait...
..................
20160315:13:49:24:025696 gpinitsystem:h95:jason-[INFO]:-Checking Master host
20160315:13:49:24:025696 gpinitsystem:h95:jason-[INFO]:-Checking new segment hosts, please wait...
..................
20160315:13:49:39:025696 gpinitsystem:h95:jason-[INFO]:-Checking new segment hosts, Completed
20160315:13:49:39:025696 gpinitsystem:h95:jason-[INFO]:-Building the Master instance database, please wait...
20160315:13:49:49:025696 gpinitsystem:h95:jason-[INFO]:-Starting the Master in admin mode
20160315:13:51:35:025696 gpinitsystem:h95:jason-[INFO]:-Commencing parallel build of primary segment instances
20160315:13:51:35:025696 gpinitsystem:h95:jason-[INFO]:-Spawning parallel processes batch [1], please wait...
..................
20160315:13:51:36:025696 gpinitsystem:h95:jason-[INFO]:-Waiting for parallel processes batch [1], please wait...
..................................................
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:------------------------------------------------
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:-Parallel process exit status
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:------------------------------------------------
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:-Total processes marked as completed = 18
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:-Total processes marked as killed = 0
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:-Total processes marked as failed = 0
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:------------------------------------------------
20160315:13:52:27:025696 gpinitsystem:h95:jason-[INFO]:-Deleting distributed backout files
20160315:13:52:27:025696 gpinitsystem:h95:jason-[INFO]:-Removing back out file
20160315:13:52:27:025696 gpinitsystem:h95:jason-[INFO]:-No errors generated from parallel processes
20160315:13:52:27:025696 gpinitsystem:h95:jason-[INFO]:-Restarting the Greenplum instance in production mode
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Starting gpstop with args: -a -i -m -d /home/jason/gpdata/gpseg-1
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Gathering information and validating the environment...
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Obtaining Greenplum Master catalog information
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Obtaining Segment details from master...
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Greenplum Version: 'greenplum (Greenplum Database) 4.3.99.00 build dev'
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-There are 0 connections to the database
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Commencing Master instance shutdown with mode='immediate'
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Master host=h95
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Commencing Master instance shutdown with mode=immediate
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Master segment instance directory=/home/jason/gpdata/gpseg-1
20160315:13:52:28:011100 gpstop:h95:jason-[INFO]:-Attempting forceful termination of any leftover master process
20160315:13:52:28:011100 gpstop:h95:jason-[INFO]:-Terminating processes for segment /home/jason/gpdata/gpseg-1
20160315:13:52:28:011100 gpstop:h95:jason-[ERROR]:-Failed to kill processes for segment /home/jason/gpdata/gpseg-1: ([Errno 3] No such process)
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Starting gpstart with args: -a -d /home/jason/gpdata/gpseg-1
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Gathering information and validating the environment...
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Greenplum Binary Version: 'greenplum (Greenplum Database) 4.3.99.00 build dev'
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Greenplum Catalog Version: '201310150'
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Starting Master instance in admin mode
20160315:13:52:29:011187 gpstart:h95:jason-[INFO]:-Obtaining Greenplum Master catalog information
20160315:13:52:29:011187 gpstart:h95:jason-[INFO]:-Obtaining Segment details from master...
20160315:13:52:30:011187 gpstart:h95:jason-[INFO]:-Setting new master era
20160315:13:52:30:011187 gpstart:h95:jason-[INFO]:-Master Started...
20160315:13:52:30:011187 gpstart:h95:jason-[INFO]:-Shutting down master
20160315:13:52:31:011187 gpstart:h95:jason-[INFO]:-Commencing parallel segment instance startup, please wait...
.......
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-Process results...
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-----------------------------------------------------
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:- Successful segment starts = 18
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:- Failed segment starts = 0
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:- Skipped segment starts (segments are marked down in configuration) = 0
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-----------------------------------------------------
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-Successfully started 18 of 18 segment instances
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-----------------------------------------------------
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-Starting Master instance h95 directory /home/jason/gpdata/gpseg-1
20160315:13:52:39:011187 gpstart:h95:jason-[INFO]:-Command sys_ctl reports Master h95 instance active
20160315:13:54:33:011187 gpstart:h95:jason-[WARNING]:-FATAL: DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603) 20160315:13:54:33:011187 gpstart:h95:jason-[INFO]:-No standby master configured. skipping...
20160315:13:54:33:011187 gpstart:h95:jason-[INFO]:-Check status of database with gpstate utility
20160315:13:54:37:025696 gpinitsystem:h95:jason-[INFO]:-Completed restart of Greenplum instance in production mode
20160315:13:54:37:025696 gpinitsystem:h95:jason-[INFO]:-Loading gp_toolkit...
psql: FATAL: DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)
20160315:13:56:26:gpinitsystem:h95:jason-[FATAL]:-Failed to retrieve rolname. Script Exiting!
我的集群配置:两台机器,32g内存16g交换分区。每台机器9个节点。集群按照完成之后,显示segment启动的18个,但是通过psql连接不上,报错!
主要错误信息:
DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)
去官网看了很多人遇到此类的问题,错误原因有很多,今天特地总结以下:
Q&A1:系统环境变量没有设置正确,这个需要根据自己安装版本的greenplum去设置一下环境变量,可以去官网相对应的版本install guide 那里设置一下!
Q&A2:shared_buffers设置太大,对于如何根据自己内存和segment节点个数分配shared_buffers,可以去官网找一下,通常出去2g的other,以及statement_mem * segment 个数,剩下的除以segment的个数即可。这种情况通常出现中安装过程中就设置了shared_buffers,一般默认的125MB
Q&A3:防火墙是否关闭,这个情况最容易忽略,也是最容易出现的,通常有些人重启机器之后就忘记了关闭,我就是这样的,嘿嘿。你可以设置防火墙重启后一样生效!
。。。还有其他的原因欢迎来补充!谢谢,分享是一种美,希望能帮到你!
DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)的更多相关文章
- 运行QQ出现initialization failure 0x0000000c错误和浏览器上不了网
出现QQ出现initialization failure 0x0000000c错误和浏览器上不了网的问题,原因是关机的时候没有正常关闭导致的. 解决方法: 1.我们在开始菜单栏中的附件中找到“命令提示 ...
- Fast Failure Detection and Recovery in SDN with Stateful Data Plane
文章名称:Fast Failure Detection and Recovery in SDN with Stateful Data Plane 利用SDN的带状态数据平面进行快速故障检测和恢复 发表 ...
- java.lang.NoClassDefFoundError: com.sap.conn.jco.JCo (initialization failure) java.lang.UnsatisfiedLinkError: no sapjco3 in java.library.path
java.lang.NoClassDefFoundError: com.sap.conn.jco.JCo (initialization failure) at java.lang.J9VMInter ...
- java执行spark查询hbase的jar包出现错误提示:ob aborted due to stage failure: Master removed our application: FAILED
执行java调用scala 打包后的jar时候出现异常 /14 23:57:08 WARN TaskSchedulerImpl: Initial job has not accepted any re ...
- ”initialization failure:0x0000000C“错误,何解?
今天开机后打开软件,报出这样的警告”initialization failure:0x0000000C“. 我问了度娘,看了很多回答,答案参差不齐.其中,有个回答还是很不错的(刚好我的是win10系统 ...
- “error: command 'x86_64-linux-gnu-gcc' failed with exit status 1” in virtualenv
Most of the time these are dependency-issues. Following the stack-trace of the gcc compiler one ca ...
- 10.Execution failed with exit status: 3
错误信息: insert overwrite table t_mobile_mid_use_p_tmp4_rcf select '201411' as month_id, a.prov_id, a.c ...
- command 'x86_64-linux-gnu-gcc' failed with exit status 1错误及解决方案
Ubuntu16.04安装Scrapy(pip install Scrapy)时提示错误如下: Failed building wheel for cryptography Running setup ...
- error: command 'cc' failed with exit status 1
报错: Complete output from command /usr/bin/python -u -c "import setuptools, tokenize;__file__='/ ...
随机推荐
- 数据可视化开源系统(python开发)
Caravel 是 Airbnb (知名在线房屋短租公司)开源的数据探查与可视化平台(曾用名Panoramix),该工具在可视化.易用性和交互性上非常有特色,用户可以轻松对数据进行可视化分析. 核心功 ...
- dedecms(织梦)自定义表单后台显示不全 自定义模型当中添加自定义字段后在后台添加内容后不显示解决方案
我们常用dedecms 自定义表单做留言功能.但是偶尔会遇到这样一个问题,就是 在前台提交表单后..后天显示不全.特别是中文字符 都不会显示, 比如下图: 这是因为 如果你织梦是gbk的话那就对了 ...
- php 不能同时提交form
注意:提交form到相应的页面时,不能在form中嵌套一个form,否则,不能提交
- JS的单例模式
维基百科对单例模式的介绍如下: 在应用单例模式时,生成单例的类必须保证只有一个实例的存在,很多时候整个系统只需要拥有一个全局对象,才有利于协调系统整体的行为.比如在整个系统的配置文件中,配置数据有一个 ...
- 关于 free() 函数用法的若干疑问
<C语言参考手册>中关于 free() 函数有如下描述. (1)free() 函数的原型 void free(void *ptr); (2)free 函数对以前由 malloc.callo ...
- SharePoint 2010 用Event Receiver将文件夹自动变成approved状态 (1)
当开发一个sharepoint门户网站,或者是一个内容管理的网站的时候,站点的模板通常会选用publish portal,或者是开启了publishing feature来对内容进行版本控制和流程控制 ...
- ASP.net体系
- mobilize扁平化的fullPage.js类工具使用心得
可以生成一个fullPage效果的主页,但是列表页面和内容页面呢? 主页中的block,可以选择多种组建生成.甚至连form都有: 应该改造其源代码,动态化和cms系统化,添加二三级页面模板: == ...
- Cloudera Manager、CDH零基础入门、线路指导 http://www.aboutyun.com/thread-9219-1-1.html (出处: about云开发)
Cloudera Manager.CDH零基础入门.线路指导http://www.aboutyun.com/thread-9219-1-1.html(出处: about云开发) 问题导读:1.什么是c ...
- Java学习03
Java学习03 1.java面试一些问题 一.什么是变量 变量是指在程序执行期间可变的数据.类中的变量是用来表示累的属性的,在编程过程中,可以对变量的值进行修改.变量通常是可变的,即值是变化的 二. ...