Difference: BigClusterNodeInstructions (4 vs. 5)

Revision 52006-04-11 - TWikiGuest

Line: 1 to 1
 
META TOPICPARENT name="BigClusterProject20060411"

Big Cluster Node Instructions

If you are the DHCP master, see BigClusterDHCPInstructions.
Line: 21 to 21
 
  • Follow directions for trivial-net-setup.
    • Say YES/OK (Enter to accept) to everything.
    • When it tells you what your IP address is, write the last bit down in big letters (easy to read from six feet away). So if it tells you that you are 198.162.1.105, write down 105 on the card.
Changed:
<
<
    • Write your name (legibly!) on the card. If you are not the owner, write the owner's name down also. First name is probably enough.
>
>
    • Write your name (legibly!) on the index card. If you are not the owner, write the owner's name down also. First name is probably enough.
 
  • When you get to the option of logging in:
    • Login as root, using the password listed at the login prompt
Line: 31 to 31
 
    • exit
  • Sign in as bccd, with the password given earlier.
  • Answer yes when it asks if you want to run a heartbeat.
Deleted:
<
<

Switch to LAM

Everybody continue on to switching to LAM:
  • edit ~/.bashrc
    • edit the PATH line so that the line reads export PATH=/lam-mpi/bin:$PATH
    • write file and quit
  • source ~/.bashrc (or log out and log back in)
  • For each node, rebuild the library cache:
    • su - root (using the root password given)
    • ldconfig -v | less
    • exit (back to bccd)
 
  • bccd-allowall (Answer yes.)
  • bccd-snarfhosts
Changed:
<
<
  • recon -v ~/machines
  • lamboot -v ~/machines
>
>
  Windows users only have the option of
  • startx
Line: 54 to 42
 

Compile the target code

Everybody needs to compile the target code.
Changed:
<
<
  • The DHCP server needs to push out the demo code.
>
>
  • Ducky needs to push out the demo code.
 
  • The code will show up in a directory named something like /tmp/6g2w98s.
Changed:
<
<
  • cp -r /name.arch ~bccd/cs521 (where name is name of the code we're going to run, dirname is the tmp dir name and arch is either x86 or ppc, e.g. cp -r /tmp/6g2w98s/findPi.x86 ~bccd/cs521)
  • cd ~bccd/cs521
>
>
  • cp -r dirname/povray31 ~bccd/ (where dirname is something like /tmp/6g2w98s)

For x86 machines, the code should just run. PPC users (and those who feel inclined to do more) recompile by doing:

  • cd ~bccd/povray31/source/libpng
  • make -f makefile.lnx clean
  • make -f makefile.lnx
  • cd ~bccd/povray31/source/mpi-unix
  • change the makefile to use the Generic Linux CFLAGS: comment out line 71 and uncomment 65.
  • make clean
 
  • make
Added:
>
>
  • cd ~bccd/povray31
 
  • When you are done with all your setup, stick the index card in your keyboard (so that we can easily find a node if we need to).
Added:
>
>
Other code that we might run:
  • hello world
    • cd ~bccd
    • mpicc -o hello hello_*.c NOTE THE UNDERSCORE!
    • mpirun -np 8 -machinefile ~/machines hello
  • various code in ~bccd/lam-mpi. Note that it requires a fair amount of gymnastics to switch to LAM.


To switch to LAM

To switch to LAM, you need to do this:

  • edit ~/.bashrc
    • edit the PATH line so that the line reads export PATH=/lam-mpi/bin:$PATH
    • write file and quit
  • source ~/.bashrc (or log out and log back in)
  • For each node, rebuild the library cache:
    • su - root (using the root password given)
    • ldconfig -v | less
    • exit (back to bccd)
  • bccd-allowall (Answer yes.)
  • bccd-snarfhosts
  • recon -v ~/machines
  • lamboot -v ~/machines
 

As more machines come online, you might need to refresh your system state:

 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback