Sample matrix multiply code to show affect of blocking and data alignment The code mm.c accompanies two papers at software.intel.com that discuss memory layout and performance. A simple matrix multiply is reordered and blocked to show performance improvement An exercise is included to show the impact on performance when matrices are not aligned on cacheline boundaries.
forked from drmackay/samplematrixcode
wlxiong/samplematrixcode
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Sample matrix multiply code to show affect of blocking and data alignment
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- C 100.0%