GitHub - mahi29/MatMul-Optimization: Optimized the speed of naive matrix multiplication algorithm to show a 50-fold increase

README

Given a naive implementation of matrix multiplication, optimized and increased performance of 50 fold (1 GFlop/s to 50 GFlop/s). This increase was gained by using SSE Instructions along with register and cache blocking. Also used OpenMP to implement parallelization, along with loop reordering, to further increase speed.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Makefile		Makefile
README.md		README.md
benchmark.c		benchmark.c
sgemm-openmp.c		sgemm-openmp.c
tester.c		tester.c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Makefile

Makefile

README.md

README.md

benchmark.c

benchmark.c

sgemm-openmp.c

sgemm-openmp.c

tester.c

tester.c

Repository files navigation

About

Releases

Packages

Languages

mahi29/MatMul-Optimization

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages