![]() |
login
webmail
|
||
![]() |
Installation | ||
|
... |
|||
|
|
|||
DCC/Debian cluster installation HOWTONikola PavkovicValentin Vidic$Id: install.sgml,v 1.19 2005/02/14 10:53:32 vvidic Exp $Copyright © 2004 Nikola Pavkovic $Date: 2005/02/14 10:53:32 $
1. HardwareA Linux cluster consists of a number of computers, networked over a (preferably high speed) switch. One of those nodes has a second Ethernet controller, which is connected to the outside network. That 'public' node is called the front-node. The other nodes in the internal network are called work-nodes. The following figure shows a typical cluster network topology: ------------- eth0
| work-node 1 |-------------
------------- 10.0.0.2 |
|
------------- eth0 |
| work-node 2 |------------ | ________
------------- 10.0.0.3 || ( )
.... || eth1 ------------ eth0 ( outside )
.... || --| front-node |-------( network )
------------- eth0 || | ------------ x.x.x.x ( )
| work-node n |----------- || | 10.0.0.1 (_______)
------------- 10.y.y.y ||| |
-------------
| eth. switch |
-------------In order to build a simple cluster you will need:
2. PrerequisitesBefore you start building your new DCC/Debian cluster, make sure that you have/know the following:
3. InstallationNow that you have all the required hardware/software/knowledge, we are ready to proceed with the cluster installation procedure. 3.2. Front-node installationYour first step is to install Debian Sarge/testing on your front-node (the one with two network cards). Please refer to official Debian installer homepage , and download the installation CD. Check Debian installation manual for details.
3.2.1. Network setupNow you have a working Debian Sarge/testing installation on the front-node. In order to install DCC components on the front-node, two network interfaces have to be configured properly, one of which is connected to the outside network, and the other connected to the internal switch to which all the other cluster nodes are connected too. A typical network configuration might look like this (file /etc/network/interfaces):
auto eth0 eth1
iface eth0 inet static
address 161.53.2.15
netmask 255.255.255.0
broadcast 161.53.2.255
iface eth1 inet static
address 10.0.0.1
netmask 255.255.255.0
broadcast 10.0.0.255
Don't forget to check/set the hostname in /etc/hostname file:
EXTERNALNAME
The corresponding /etc/hosts file should look like: 10.0.0.1 node0.cluster node0
161.53.2.15 EXTERNALNAME.DOMAIN EXTERNALNAME
127.0.0.1 localhost
The EXTERNALNAME, DOMAIN, and the corresponding IP address are to be changed to whatever
your settings for accessing the public network are. Restart network if necessary for the changes
to take effect.
3.2.2. Check network setupCheck ifconfig, you should have eth0 and eth1 configured. Also, check the following (all should work): # getent hosts $EXT_IP # getent hosts 10.0.0.1 # getent hosts $EXT_NAME # getent hosts node0 # hostname -fOf course, use the external hostname and IP address instead of $EXT_* . 3.2.3. APT setupAs you have the network properly configured, you are ready to pull the DCC packages from our APT repository. In order to do so, add following line on top of your /etc/apt/sources.list file: deb http://ftp.irb.hr/pub/irb/dcc ./Update local APT database: # apt-get updateBefore proceeding with DCC installation, it is preferred to upgrade all packages to the newest version: # apt-get -u upgrade 3.2.4. DCC installationNow, you are ready to install the DCC packages. This is done in two steps: # apt-get install debconf-dcc
Answer possible questions asked by debconf, and as debconf-dcc installation finishes, install the dcc-front
meta-package:
# apt-get install dcc-front
3.3. Building the imageIn order to install the operating system on the work-nodes, we must understand how the autoinstallation procedure works. DCC comes with System Installation Suite, and as such, is an image-based installation model. The image needs to be built before it can be deployed on the work nodes. To get the whole picture how the SIS autoinstallation system works, please refer to the SIS project documentation. The image that is to be deployed on the work-nodes has to be built on the front-node. It is done following this simple procedure:
3.4. Deploying the image on work nodesNow that the work-node image is created, the images have to be deployed on the work-nodes. This is done in a very easy way. If your work-nodes' BIOS supports booting over the network(PXE), choose this option as the primary boot method. On the front-node, run: # dcc_discovernode...and turn on your work-node(s). As the DHCP requests come from the work-nodes, their MAC addresses are extracted, a new IP address is assigned to that MAC, and that information is put into the SIS database. However, if your work-nodes' BIOS does not support netboot, create a boot diskette: # mkautoinstalldiskette... boot from the diskette or CD, and proceed as described above. When the installation finishes, the work-node is rebooted. As it comes up again, it is ready to do the real work. Do the same with all the work nodes that you wish to install.
4. Postinstall notesRun the following two commands on the front-node after all the worker-nodes have been installed: # cpushimage image_name # /etc/init.d/torque-server restart-quick # cexec /etc/init.d/torque-mom restart-quickFor a detailed explanation why this is required, read the DCC FAQ. Check out Ganglia cluster-monitor web interface at http://your.cluster.fqdn/ganglia ! | |||||||||||||||||||||||||
|
© 2003-2009 Ruđer Bošković Institute || last changed: 03/31/2006 03:51 pm (Valentin Vidić) Optimized for: Internet Explorer and Mozilla Firefox. | |||||||||||||||||||||||||