Big Cluster Node Instructions
If you are the DHCP master, see
BigClusterDHCPInstructions.
- Follow directions for trivial-net-setup. Hit Enter to select the highlighted answer and the arrow keys to change the selection.
On the Mac, it will boot you straight through the boot sequence. On x86 machines:
- Boot up to the BCCD splash screen.
- Hit Enter
Everybody (x86 and PPC both) then:
- Enter the password we give you on the day of the event.
- Follow directions for trivial-net-setup.
- Say YES/OK (Enter to accept) to everything.
- When you get to the option of logging in:
- Login as root, using the password listed at the login prompt
- Change the password. If you are helping the owner, let the owner set the password.
- df to get a list of the mounted partitions
- umount partition for all of your local drive partitions, (e.g. umount /mnt/rw/discs/disc0/part3/home/fred) Macs don't seem to mount any of your local drives.
- exit
- Sign in as bccd, with the password given earlier.
- Answer yes when it asks if you want to run a heartbeat.
Everybody continue on to switching to LAM:
- edit ~/.bashrc
- edit the PATH line so that the line reads export PATH=/lam-mpi/bin:$PATH
- write file and quit
- source ~/.bashrc (or log out and log back in)
- For each node, rebuild the library cache:
- su - root (using the root password given)
- ldconfig -v | less
- exit (back to bccd)
- bccd-allowall (Answer yes.)
- bccd-snarfhosts
- recon -v ~/machines
- lamboot -v ~/machines
Windows users only have the option of
but Mac users, your trackpad might not work; you might get stuck and hosed. Don't startx.
Everybody needs to compile the target code.
- The DHCP server needs to push out the demo code.
- The code will show up in a directory named something like /tmp/6g2w98s.
- cp -r /name.arch ~bccd/cs521 (where name is name of the code we're going to run, dirname is the tmp dir name and arch is either x86 or ppc, e.g. cp -r /tmp/6g2w98s/findPi.x86 ~bccd/cs521)
- cd ~bccd/cs521
- make
As more machines come online, you might need to refresh your system state:
- bccd-allowall
- bccd-snarfhosts
- recon -v ~/machines
There is some order dependency that Ducky hasn't quite figured out yet; keep doing those and eventually it will all get settled out.
Mac users, if you have trouble rebooting into OS X immediately after booting into BCCD, try holding Control-Command-Power after an non-starting boot.