Welcome
Participating Sites
Teaching Assistants
Group Discussion Etiquette
Software Requirements
Getting Started on TeraGrid
Course Schedule
Presentation Materials for Week
  + Keynote
  + Introduction to HPC Systems
  + Hybrid MPI Programming
  + Multi-core Programming
  + Totalview Debugging Techniques
  + Parallel I/O
  + Experience from the Field
  + Eclipse
  + DDT Debugging Techniques
  + Numerical Libraries
  + Performance and Code Profiling
  + Visualization
    > Overview and Introduction to Scientific Visualization
    > Parallel Visualization, Data Formatting, Software Overview
    > Hands-on Tutorial: VisIt
    > Hands-on Tutorial: ParaView
    > Sample Datasets
Biographies for Presenters
General Exercises
  + Jacobi Exercise 1
  + Jacobi Exercise 2
  + Jacobi Exercise 3
  + Jacobi Exercise 4
  + Jacobi Exercise 5
Molecular Dynamics Background
  + Molecular Dynamics Exercise 1
  + Molecular Dynamics Exercise 2
  + Molecular Dynamics Exercise 3
  + Molecular Dynamics Exercise 4
  + Molecular Dynamics Exercise 5
Access to Other Training Resources

Jacobi Exercise 1

Exercise 1: Starting Out

Objectives

Getting familiar with the high-performance computing platform you will be using for the workshop.
Getting familiar with the Jacobi iteration algorithm used in all of these exercises.

You can move on when?

You have successfully compiled, submitted, and competed a run with the Jacobi program and completed a plot of the scaling of the algorithm with respect to matrix dimension.

Description

In Exercise 1, you will become familiar with the serial version of the algorithm described in Background section. A reference implementation will be provided, with your task to examine and make sure you understand it, compile it on your HPC architecture, and then submit several runs of differing matrix sizes to view the performance characteristics of the code and the processors in your machine.

The program can be downloaded at:

Since the code is one straight file, compilation is trivial:

C/C++

For Kraken: CC jacobi.cpp -o jacobi
For Ranger: pgCC jacobi.cpp -o jacobi
For Bluefire: xlC jacobi.cpp -o jacobi

FORTRAN

For Kraken: ftn jacobi.F -o jacobi
For Ranger: pgf90 jacobi.F -o jacobi
For Bluefile: xlF jacobi.F -o jacobi

For further help on compiling codes on these HPC architectures:

The program has the following syntax:

jacobi <Dimension> <NumIteration> <RowPeek> <ColPeek>

Dimension - The size of one side of the square matrix

NumIterations - The number of fixed iterations

RowPeek, ColPeek - Specify the x,y coordinates on the grid of an
element to be printed at the end, used to check correctness.

For example: sbrown@kraken-pwd4(XT5): ./jacobi 128 100 5 5
Time Iterations = 0.005294 seconds
Result SurfaceMatrix[5][5]=2.02957

Because of indexing the FORTRAN version of the code produces a different answer to the same command line, the answer will be 1.9977370057.

Instructions

Download the serial version of the code in your language of choice.
Spend some time looking over the code, if there is something you don't understand, please ask an instructor to help.
Compile the code with optimization level -O3.
Test the code on a very small matrix (e.g. the inputs 10 100 3 3 should give 22.622).
Submit the following matrix sizes for 100 iterations to the queue: 128, 256, 512, 1024, 4096.
Make a plot of matrix dimension vs. time reported to determine the scaling of the algorithm.

Questions to Ponder...

The scaling of your algorithm with matrix size should be relatively straight forward to determine, are the results what you expected. If not, can you think why?
What may limit the size of system you can do with this serial algorithm.

Extra Credit

Are there any compiler flags beyond -O3 that enhance the serial performance of the code?
Are there any programmatic enhancements that could be made to improve performance?
One could analyze this algorithm with in-depth performance tools to understand why it performance at certain sizes.

Hints

The queue submission script for this exercise should be fairly similar to the one you used for the example hello_world at the beginning of the workshop.