Pivotal Knowledge Base

Follow

HAWQ fails to start giving error, "Failed to start Master instance in admin mode "

Environment

Product Version
 Pivotal HDB  2.x / 3.x
 Others  

Symptom

When starting HAWQ, the following may be seen:

[gpadmin@cl1554 init.d]$ service hawq start 
20161012:05:53:53:609170 gpstart:cl1554:gpadmin-[INFO]:-Starting gpstart with args: -a
20161012:05:53:53:609170 gpstart:cl1554:gpadmin-[INFO]:-Gathering information and validating the environment...
20161012:05:53:53:609170 gpstart:cl1554:gpadmin-[INFO]:-Greenplum Binary Version: 'postgres (HAWQ) 4.2.0 build 1'
20161012:05:53:54:609170 gpstart:cl1554:gpadmin-[INFO]:-Greenplum Catalog Version: '201402180'
20161012:05:53:54:609170 gpstart:cl1554:gpadmin-[INFO]:-Starting Master instance in admin mode
20161012:05:58:55:609170 gpstart:cl1554:gpadmin-[CRITICAL]:-Failed to start Master instance in admin mode
20161012:05:58:55:609170 gpstart:cl1554:gpadmin-[CRITICAL]:-Error occurred: non-zero rc: 1
Command was: 'env GPSESSID=0000000000 GPERA=None $GPHOME/bin/pg_ctl -D /apps/hdfs/data/nn/hawqmaster/gpseg-1 -l /apps/hdfs/data/nn/hawqmaster/gpseg-1/pg_log/startup.lo
g -w -t 600 -o " -p 5432 -b 1 -z 64 --silent-mode=true -i -M master -C -1 -x 66 -c gp_role=utility " start'
rc=1, stdout='waiting for server to start...............................................................................................................................
........................................................................................................................................................................
.........could not start server
', stderr='pg_ctl: PID file "/apps/hdfs/data/nn/hawqmaster/gpseg-1/postmaster.pid" does not exist

In startup.log the following is seen:

2016-10-12 09:53:55.519809 GMT,,,p609185,th1707034656,,,,0,,,seg-1,,,,,"FATAL","55000","data directory ""/apps/hdfs/data/nn/hawqmaster/gpseg-1"" has group or world access","Permissions should be u=rwx (0700).",,,,,,,"checkDataDir","postmaster.c",1789,
2016-10-12 09:59:58.119876 GMT,,,p609329,th-1399326688,,,,0,,,seg-1,,,,,"FATAL","55000","data directory ""/apps/hdfs/data/nn/hawqmaster/gpseg-1"" has group or world access","Permissions should be u=rwx (0700).",,,,,,,"checkDataDir","postmaster.c",1789,
2016-10-12 10:05:33.659481 GMT,,,p609663,th1654880288,,,,0,,,seg-1,,,,,"FATAL","55000","data directory ""/apps/hdfs/data/nn/hawqmaster/gpseg-1"" has group or world access","Permissions should be u=rwx (0700).",,,,,,,"checkDataDir","postmaster.c",1789,

Cause

The wrong permissions are set on the master directory.

Resolution

Correct the permissions on the master directory, from the example above the master directory, was "/apps/hdfs/data/nn/hawqmaster/gpseg-1"

The correct permissions for the master directory is:

[root@amb171hawq ~]# ls -ltr /data/hawq/master/
total 8
drwx------. 17 gpadmin gpadmin 4096 May 21 12:36 gpseg-1
[root@amb171hawq ~]#

  

Comments

Powered by Zendesk