Steps of adding a new grid node
来自cslt Wiki
using a backup image
- restore image when booting from USB
- post installation settings
- do a full system update (yum update -y)
- reset root password
- change /etc/fstab, make directory /work{,0,1,2}
- install sge under /opt/sge
- set cell name to 'redhat' when run /opt/sge/install_execd
- rm -f /etc/profile.d/sge.{,c}sh && cp /opt/sge/redhat/common/settings.{,c}sh /etc/profile.d
- yum install -y ntp && ntpdate pool.ntp.org && hwclock -w
- ln -s /nfs/disk/perm/sys/env/cslt.sh /etc/profile.d (grid without GPU) or ln -s /nfs/disk/perm/sys/env/cslt_cuda.sh /etc/profile.d (grid with GPU)
generic steps
1. Install Linux box grid-X,
* Ensure the following are there: perl, python, g++ for c/c++/fortran, Tcl. * mount nfs service from grid-0 to /nfs/disk * define the node name as grid-X and update /etc/hosts for the nodes to reflect the new node. * copy /etc/bash.bashrc from grid-0 to the new node
2. Install NIS service, propagate the NIS account from grid-0
3. Install SGE, accepted by grid-0
4. Install additional tools
* matlab client * ATALAS and CLAPACK
5. Done
some tips:
1. statistic IPs are used right now. For that configure /etc/conf.d/netcfg. Check /etc/network.d/examples to create a profile. systemctl enable netcfg is needed to enable the service. DHCP should be disabled, i.e., systemctl disable dhcp@eth0.
2. in the old nodes, grid-0 and gird-1, the netcfg is enabled by replacing network by net-profiles in DAEMON of /etc/rc.conf.
3. Note that to let the cluster know the new node, should put the name in /etc/hosts of grid-0, and restart the dns server in grid-0: systemctl restart dnsmasq