在CentOS上以源碼編譯的方式安裝Greenplum數據庫

集羣組成:html

  一臺主機,一臺從節點。python

系統環境:linux

  操做系統:CentOS 7,64位,7.4.1708(/etc/redhat-release中查看)c++

  CPU:AMD Fx-8300 8核git

  內存:8GBgithub

  硬盤:120GBsql

  GNOME:3.22.2數據庫

安裝版本:bootstrap

  GPDB:V5.4.1api

  GPORCA:V2.53.11

 

經驗參考:
  http://www.jishux.com/plus/view-683368-1.html
  http://www.javashuo.com/article/p-gwikepxw-dn.html
  http://www.jpblog.cn/greenplum%E6%BA%90%E7%A0%81%E7%BC%96%E8%AF%91%E5%AE%89%E8%A3%85.html

 

前提條件:禁用防火牆(全部節點和主機都要禁用!!)

使用root帳號執行下列命令(同時禁用默認的防火牆和可能已經安裝的iptables,共兩個防火牆程序):

關閉默認的防火牆

# systemctl stop firewalld

屏蔽默認的防火牆(重啓後也不會啓動)

# systemctl mask firewalld

關閉iptables

# systemctl stop iptables

禁用iptables

# systemctl disable iptables

 

安裝過程

一)建立專有帳號gpdba,並加入root用戶組。

下面全部操做都使用gpdba帳號來執行!若是操做失敗,則使用root帳號。

 

二)修改全部服務器的主機名(全部節點和主機)

 

1)修改hosts使用命令 vi /etc/hosts 來修改

 

127.0.0.1 localhost localhost.localdomain

 

192.168.58.102 Master shsm002

 

192.168.58.104 Slave1 shsm004

 

最後,再輸入 source /etc/profile 刷新。

 

2)修改network文件,輸入命令vi /etc/sysconfig/network

 

NETWORKING=yes
HOSTNAME=對應的主機名稱

 

3)若是主機名稱與設備名稱不符,則按照下列格式修改:

 

127.0.0.1 localhost localhost.localdomain

 

IP地址 主機名稱 設備名稱
最後使用ping命令驗證是否能夠連通。

 

三)修改系統文件(全部節點和主機)

1)修改內核配置

vi /etc/sysctl.conf,添加下面內容:

kernel.shmmax = 5000000000

kernel.shmmni = 4096

kernel.shmall = 4000000000

kernel.sem = 250 512000 100 2048

kernel.sysrq = 1

kernel.core_uses_pid = 1

kernel.msgmnb = 65536

kernel.msgmax = 65536

kernel.msgmni = 2048

net.ipv4.tcp_syncookies = 1

net.ipv4.ip_forward = 0

net.ipv4.conf.default.accept_source_route = 0

net.ipv4.tcp_tw_recycle = 1

net.ipv4.tcp_max_syn_backlog = 4096

net.ipv4.conf.all.arp_filter = 1

net.ipv4.ip_local_port_range = 1025 65535

net.core.netdev_max_backlog = 10000

net.core.rmem_max = 2097152

net.core.wmem_max = 2097152

vm.overcommit_memory = 2

執行命令 sysctl -p 使修改數值生效

2)修改限制配置

vi /etc/security/limits.conf

添加下面內容:

* soft nofile 65536

* hard nofile 65536

* soft nproc 131072

* hard nproc 131072
3)禁用SELINUX

vi /etc/selinux/config,修改SELINUX的值爲disabled。修改後,以下:

# This file controls the state of SELinux on the system.

# SELINUX= can take one of these three values:

# enforcing - SELinux security policy is enforced.

# permissive - SELinux prints warnings instead of enforcing.

# disabled - No SELinux policy is loaded.

SELINUX=disabled

# SELINUXTYPE= can take one of these two values:

# targeted - Targeted processes are protected,

# mls - Multi Level Security protection.

SELINUXTYPE=targeted


三)安裝優化器GPORCA的依賴項(全部節點和主機)

1)安裝cmake(3.10.2)

下載:
$ wget http://www.cmake.org/files/v3.10/cmake-3.10.2.tar.gz
解壓:
$ tar xzf cmake-3.10.2.tar.gz

定位到解壓後的目錄中:
$ cd cmake-3.10.2
關於configure命令:
若是要查看詳細的配置選項,使用下面命令:

$ ./configure --help
執行配置命令(安裝到目錄/usr/cmake):

$ ./configure --prefix=/usr/cmake

編譯:
$ make
安裝:
# make install
最後進行驗證:
$ /usr/cmake/bin/cmake -version

輸出相似下面內容顯示出版本號:
cmake version 3.10.2

編輯修改/etc/profile文件,將cmake添加到環境變量定義中,添加下面內容:

  ### CMAKE 3.10 ###
  export PATH=/usr/cmake/bin:$PATH

2)安裝gp-xerces

使用gpdba帳號解壓源碼文件壓縮包,進入解壓目錄,執行下面命令。
mkdir build
cd build
../configure --prefix=/usr/local  ##安裝到/usr/local目錄下
(注意:若是出錯,則使用root帳號執行下面的make命令)
make
make install

3)安裝re2c(1.0.3)

進入 http://re2c.org/install/install.html 頁面下載本身須要的版本
安裝re2c是因爲配置ninja時須要
$ ./configure --prefix=/usr/local
(注意:使用root帳號執行下面的make命令;若是用戶沒有在root用戶組中時)
$ make
$ make install

4)安裝Ninja

可使用git下載:https://github.com/ninja-build/ninja.git
下載後進入ninja目錄執行以下命令:
./configure.py --bootstrap
因爲最終結果只是一個二進制文件ninja,以後使用root帳號拷貝ninja文件到/usr/bin目錄便可(/usr/bin目錄已經在環境變量PATH中配置定義了)
Installation is not necessary because the only required file is the resulting ninja binary. However, to enable features like Bash completion and Emacs and Vim editing modes, some files in misc/ must be copied to appropriate locations.

特別說明:先在主機上安裝全部依賴項的程序,而後經過scp命令遠程複製安裝包或壓縮包到其餘節點上逐個執行安裝。

四)安裝GPORCA

下載地址:https://github.com/greenplum-db/gporca

安裝GPORCA(GPDB-5.4.1對應的依賴版本,2.53.11)
使用gpdba帳號解壓源碼文件壓縮包,進入解壓目錄,執行下面命令。
cmake -GNinja -H. -Bbuild
ninja install -C build

查看GPDB依賴的ORCA的版本信息:/gpdb-5.4.1/depends/conanfile_orca.txt文件
[requires]
orca/v2.53.11@gpdb/stable

 

安裝完成後,進入/gporca/build目錄,執行ctest命令進行檢查
若是最後輸出相似以下結果:
100% tests passed, 0 tests failed out of 119

Total Test time (real) = 195.48 sec
這說明編譯成功了。

【刪除舊版的GPORCA】
進入源文件目錄下,執行命令
rm -rf build/*
rm -rf /usr/local/include/naucrates
rm -rf /usr/local/include/gpdbcost
rm -rf /usr/local/include/gpopt
rm -rf /usr/local/include/gpos
rm -rf /usr/local/lib/libnaucrates.so*
rm -rf /usr/local/lib/libgpdbcost.so*
rm -rf /usr/local/lib/libgpopt.so*
rm -rf /usr/local/lib/libgpos.so*

 

五)安裝GPDB(選擇版本5.4.1)

1)使用root帳號安裝依賴項

sudo yum install -y epel-release

sudo yum install -y apr-devel bison bzip2-devel cmake3 flex gcc gcc-c++ krb5-devel libcurl-devel libevent-devel libkadm5 libyaml-devel libxml2-devel perl-ExtUtils-Embed python-devel python-paramiko python-pip python-psutil python-setuptools readline-devel xerces-c-devel zlib-devel

# Install lockfile with pip because the yum package `python-pip` is too old (0.8).
sudo pip install lockfile conan

2)下載源代碼文件,解壓後編譯安裝。

使用gpdba帳號進入下載解壓的源文件目錄下,執行命令(prefix後面的路徑/usr/gpdb是安裝目錄)
./configure --with-perl --with-python --with-libxml --with-gssapi --prefix=/usr/gpdb
若是沒有安裝ORCA,則可使用:./configure --with-perl --with-python --with-libxml --with-gssapi --disable-orca --prefix=/usr/gpdb

而後執行make
make -j8

最後執行安裝
make -j8 install

3)分發

首先,建立服務器之間的ssh免密鏈接。

建立目錄/usr/gpdb-conf,在該目錄中建立主機清單文件hostlist,文件內容以下:

  Master

  Salve1

而後繼續在gpdb-conf目錄中建立seg_hosts,文件內容以下:

  Slave1

刷新greenplum_path的配置

source /usr/gpdb/greenplum_path.sh

gpssh交換密鑰

gpssh-exkeys -f /usr/gpdb-conf/hostlist

 

最後,將安裝成功的文件夾壓縮打包

gtar -cvf /home/gpdba/gpdb-install-binary-5.4.1.tar /usr/gpdb

使用gpscp命令複製到其餘節點上(或者先ssh後scp也能夠)

gpscp -f /usr/gpdb-conf/seg_hosts /home/gpdba/gpdb-install-binary-5.4.1.tar =:/usr

使用gpssh鏈接主機與從節點,解壓tar文件,安裝路徑同主機的安裝路徑保持一致。

gpssh -f /usr/gpdb-conf/hostlist

master 節點鏈接 slave 節點以後,執行全部命令都應該有n份輸出才表示正常。

解壓文件

gtar -xvf gpdb-install-binary-5.4.1.tar

建立數據庫工做目錄

cd /home/gpdba/gpdata

mkdir gpdatap1 gpdatap2 gpdatam1 gpdatam2 gpmaster

4)初始化數據庫(在master主機)

配置bash_profile環境變量

vi .bash_profile

修改以下:

# .bash_profile

# Get the aliases and functions
if [ -f ~/.bashrc ]; then
. ~/.bashrc
fi

# User specific environment and startup programs

PATH=$PATH:$HOME/.local/bin:$HOME/bin

export PATH

## Greenplum Database
source /usr/gpdb/greenplum_path.sh
export MASTER_DATA_DIRECTORY=/home/gpdba/gpdata/gpmaster/gpseg-1
export PGPORT=2346
export PGDATABASE=testDB

保存後,刷新生效:

. ~/.bash_profile

配置數據庫的啓動參數

將/usr/gpdb/docs/cli_help/gpconfigs/gpinitsystem_config 文件 複製到 /usr/gpdb-conf 目錄下而後編輯,保留以下內容:

# FILE NAME: gpinitsystem_config

# Configuration file needed by the gpinitsystem

################################################
#### REQUIRED PARAMETERS
################################################

#### Name of this Greenplum system enclosed in quotes.
ARRAY_NAME="Greenplum Data Platform"

#### Naming convention for utility-generated data directories.
SEG_PREFIX=gpseg

#### Base number by which primary segment port numbers
#### are calculated.
PORT_BASE=40000

#### File system location(s) where primary segment data directories
#### will be created. The number of locations in the list dictate
#### the number of primary segments that will get created per
#### physical host (if multiple addresses for a host are listed in
#### the hostfile, the number of segments will be spread evenly across
#### the specified interface addresses).
declare -a DATA_DIRECTORY=(/data1/primary /data1/primary /data1/primary /data2/primary /data2/primary /data2/primary)

#### OS-configured hostname or IP address of the master host.
MASTER_HOSTNAME=mdw

#### File system location where the master data directory
#### will be created.
MASTER_DIRECTORY=/data/master

#### Port number for the master instance.
MASTER_PORT=5432

#### Shell utility used to connect to remote hosts.
TRUSTED_SHELL=ssh

#### Maximum log file segments between automatic WAL checkpoints.
CHECK_POINT_SEGMENTS=8

#### Default server-side character set encoding.
ENCODING=UNICODE

################################################
#### OPTIONAL MIRROR PARAMETERS
################################################

#### Base number by which mirror segment port numbers
#### are calculated.
#MIRROR_PORT_BASE=50000

#### Base number by which primary file replication port
#### numbers are calculated.
#REPLICATION_PORT_BASE=41000

#### Base number by which mirror file replication port
#### numbers are calculated.
#MIRROR_REPLICATION_PORT_BASE=51000

#### File system location(s) where mirror segment data directories
#### will be created. The number of mirror locations must equal the
#### number of primary locations as specified in the
#### DATA_DIRECTORY parameter.
#declare -a MIRROR_DATA_DIRECTORY=(/data1/mirror /data1/mirror /data1/mirror /data2/mirror /data2/mirror /data2/mirror)


################################################
#### OTHER OPTIONAL PARAMETERS
################################################

#### Create a database of this name after initialization.
#DATABASE_NAME=name_of_database

#### Specify the location of the host address file here instead of
#### with the the -h option of gpinitsystem.
#MACHINE_LIST_FILE=/home/gpadmin/gpconfigs/hostfile_gpinitsystem

最後,執行命令開始初始化:

gpinitsystem -c /usr/gpdb-conf/gpinitsystem_config -a

 

特別說明:若是初始化執行失敗以後,再次執行初始化,則須要先執行下面命令進行環境重置:

查詢並關閉配置指定端口的postgres進程

刪除生成的未完成的數據庫文件(多是全部節點服務器),/home/gpdba/gpdata/gpmaster/gpseg-1文件夾。

六)錯誤解決

錯誤:
[gpdba@shsm002 ~]$ gpssh-exkeys -f /usr/gpdb-conf/hostlist
Error: unable to import module: version conflict: '/usr/lib64/python2.7/site-packages/psutil/_psutil_linux.so' C extension module was built for another version of psutil (different than 2.2.1)
解決:從新安裝psutil。sudo pip install psutil==2.2.1


錯誤:
20180129:23:40:43:gpinitsystem:shsm002:gpdba-[FATAL]:-Found indication of postmaster process on port 2345 on Master host Script Exiting!
解決:關閉殺死佔用端口2345的進程。
先查詢進程
$ lsof -i:2345

COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAME

postgres 10738 gpadmin 3u IPv4 264510 0t0 TCP *:postgres (LISTEN)
postgres 10738 gpadmin 4u IPv6 264511 0t0 TCP *:postgres (LISTEN)
而後殺死進程
$ kill -9 10738


錯誤:
20180207:00:14:09:005166 gpinitsystem:shsm002:gpdba-[INFO]:-Building the Master instance database, please wait...
20180207:00:14:17:005166 gpinitsystem:shsm002:gpdba-[INFO]:-Starting the Master in admin mode
20180207:00:14:23:gpinitsystem:shsm002:gpdba-[FATAL]:-Unknown host shsm004 Script Exiting!
20180207:00:14:23:005166 gpinitsystem:shsm002:gpdba-[WARN]:-Script has left Greenplum Database in an incomplete state
緣由:hostname與用戶帳號的@後面的主機名稱不一致,hosts定義中也沒有shsm004,添加進去便可。
解決:修改hosts文件,每行記錄爲:IP地址 主機名 域名。將hostname數值shsm004放到域名字段保存便可。使用ping命令能夠ping通。

 

錯誤:
20180207:00:05:00:003516 gpinitsystem:shsm002:gpdba-[INFO]:-Checking Master host
20180207:00:05:00:003516 gpinitsystem:shsm002:gpdba-[WARN]:-Have lock file /tmp/.s.PGSQL.2346.lock but no process running on port 2346
20180207:00:05:00:gpinitsystem:shsm002:gpdba-[FATAL]:-Found indication of postmaster process on port 2346 on Master host Script Exiting!
解決:刪除文件,rm /tmp/.s.PGSQL.2346.lock。


錯誤:
[gpdba@shsm002 ~]$ /bin/bash /home/gpdba/gpAdminLogs/backout_gpinitsystem_gpdba_20180207_225128
[FATAL]:-Not on original master host Master, backout script exiting!
解決:不使用這個腳本清理中間數據,直接刪除gpdata目錄下的未完成的數據庫文件便可。

 

錯誤:
20180207:23:39:31:028691 gpcreateseg.sh:shsm002:gpdba-[INFO][1]:-Start Function PROCESS_QE
20180207:23:39:31:028691 gpcreateseg.sh:shsm002:gpdba-[INFO][1]:-Processing segment Slave1
/usr/gpdb/bin/postgres: error while loading shared libraries: libgpopt.so.3: cannot open shared object file: No such file or directory
no data was returned by command ""/usr/gpdb/bin/postgres" -V"
The program "postgres" is needed by initdb but was either not found in the same directory as "/usr/gpdb/bin/initdb" or failed unexpectedly.
Check your installation; "postgres -V" may have more information.
/usr/gpdb/bin/postgres: error while loading shared libraries: libgpopt.so.3: cannot open shared object file: No such file or directory
no data was returned by command ""/usr/gpdb/bin/postgres" -V"
The program "postgres" is needed by initdb but was either not found in the same directory as "/usr/gpdb/bin/initdb" or failed unexpectedly.
Check your installation; "postgres -V" may have more information.
cat: /home/gpdba/gpdata/gpdatap1/gpseg0.initdb: No such file or directory
cat: /home/gpdba/gpdata/gpdatap2/gpseg1.initdb: No such file or directory
解決:修改/usr/gpdb/greenplum_path.sh文件,添加libgpopt.so.3文件所在目錄到環境變量LD_LIBRARY_PATH定義中,而後執行source命令刷新(在重啓電腦以前,可能每次打開終端命令行時都須要手動刷新一下)。修改後的文件內容以下:

GPHOME=/usr/gpdb

# Replace with symlink path if it is present and correct
if [ -h ${GPHOME}/../greenplum-db ]; then
GPHOME_BY_SYMLINK=`(cd ${GPHOME}/../greenplum-db/ && pwd -P)`
if [ x"${GPHOME_BY_SYMLINK}" = x"${GPHOME}" ]; then
GPHOME=`(cd ${GPHOME}/../greenplum-db/ && pwd -L)`/.
fi
unset GPHOME_BY_SYMLINK
fi
#setup PYTHONHOME
if [ -x $GPHOME/ext/python/bin/python ]; then
PYTHONHOME="$GPHOME/ext/python"
fi
PYTHONPATH=$GPHOME/lib/python
PATH=$GPHOME/bin:$PYTHONHOME/bin:$PATH
LD_LIBRARY_PATH=$GPHOME/lib:/usr/local/lib:${LD_LIBRARY_PATH-}
export LD_LIBRARY_PATH
OPENSSL_CONF=$GPHOME/etc/openssl.cnf
export GPHOME
export PATH
export PYTHONPATH
export PYTHONHOME
export OPENSSL_CONF

錯誤:20180208:01:57:59:012804 gpinitsystem:shsm002:gpdba-[INFO]:-Start Function CREATE_DATABASEpsql: FATAL: DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1513)20180208:01:58:00:012804 gpinitsystem:shsm002:gpdba-[INFO]:-Start Function ERROR_CHK20180208:01:58:00:012804 gpinitsystem:shsm002:gpdba-[INFO]:-End Function ERROR_CHK20180208:01:58:00:012804 gpinitsystem:shsm002:gpdba-[INFO]:-Start Function ERROR_EXIT20180208:01:58:00:gpinitsystem:shsm002:gpdba-[FATAL]:-Failed to complete create database testDB Script Exiting!解決:關閉並禁用防火牆(全部的防火牆程序)運行命令:# systemctl stop firewalld# systemctl mask firewalld# systemctl stop iptables# systemctl disable iptables另外一種方法供參考:shared_buffers設置太大,對於如何根據本身內存和segment節點個數分配shared_buffers,能夠去官網找一下,一般出去2g的other,以及statement_mem * segment 個數,剩下的除以segment的個數便可。這種狀況一般出現中安裝過程當中就設置了shared_buffers,通常默認的125MB。

相關文章
相關標籤/搜索