Skip to content

Near world record speed matrix-matrix multiply C software for distributed systems

Notifications You must be signed in to change notification settings

scottenriquez/parallel-matrix-multiply

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

parallel-matrix-multiply

This is the final project for my undergraduate high-performance computing class. The objective was to achieve near world record speeds for matrix-matrix multiplications using the OpenBLAS library, MPI, and base code provided by the brilliant Dr. Robert van de Geijn run on one of the world's largest supercomputing clusters, Stampede (TACC).

The plots folder contains visual representations of our results generated using MATLAB. The results folder contains all of our raw .out files from the jobs run on Stampede. The Stampede folder contains all of the C implementations and MATLAB code used to generate the graphs. There are two implementations of the matrix-matrix multiply and five implementations of the rank-k update. These different implementations can be specified in the makefile. To run the code, simply use the run command specified in the makefile. Be sure that the path to your OpenBLAS installation is correct and that you've updated the hostfile to match your host names.

About

Near world record speed matrix-matrix multiply C software for distributed systems

Resources

Stars

Watchers

Forks

Packages

No packages published