Matrix Multiplication Optimization Project

A compact yet powerful demonstration of matrix multiplication optimizations using cache blocking, memory alignment, loop unrolling, and multi-threading (OpenMP).

Highlights

Naive vs. Optimized
Compare a simple triple-nested loop (matmul_naive.c) against optimized approaches (cache-blocked, aligned, unrolled).
Multi-threading
All methods support OpenMP for parallel execution and improved CPU utilization.
Analysis
Profiling with Intel VTune plus custom scripts yields metrics on:
- Execution Time
- Speedup
- L1/LLC Cache Miss Rates
- CPU Utilization

Directory Overview

src/
- Core implementations (matmul_naive.c, matmul_blocked.c, etc.)
- test_matmul.c for validation and performance checks
logs/
- Recorded performance data (cache miss rates, CPU usage)
graphs/
- Plots illustrating key performance metrics (shown below)
scripts/
- Automation and visualization scripts (e.g., cache_analysis_draw.py, compare_threading.py)
report/
- Methodology, results, and conclusions in a concise PDF/Markdown document

Detailed Graphs

Below are all of the generated PNGs, separated by metric and matrix size.

1) CPU Utilization

Matrix Size = 1024

Matrix Size = 2048

Matrix Size = 4096

2) Execution Time

Matrix Size = 1024

Matrix Size = 2048

Matrix Size = 4096

3) L1-dcache Miss Percentage

Matrix Size = 1024

Matrix Size = 2048

Matrix Size = 4096

4) LLC-load Miss Percentage

Matrix Size = 1024

Matrix Size = 2048

Matrix Size = 4096

5) Speedup

Matrix Size = 1024

Matrix Size = 2048

Matrix Size = 4096

Conclusion

By integrating cache blocking, memory alignment, loop unrolling, and multi-threading, we significantly reduce cache misses and boost CPU utilization. Check out the logs for raw data, graphs for visual insights, and the report folder for a comprehensive discussion of these results.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
bin		bin
graphs		graphs
logs		logs
report		report
scripts		scripts
src		src
vtune_results		vtune_results
LICENSE		LICENSE
README.md		README.md
makefile		makefile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Matrix Multiplication Optimization Project

Highlights

Directory Overview

Detailed Graphs

1) CPU Utilization

2) Execution Time

3) L1-dcache Miss Percentage

4) LLC-load Miss Percentage

5) Speedup

Conclusion

About

Uh oh!

Releases

Packages

Languages

License

yigitbektasgursoy/openmp-matrix-optimization

Folders and files

Latest commit

History

Repository files navigation

Matrix Multiplication Optimization Project

Highlights

Directory Overview

Detailed Graphs

1) CPU Utilization

2) Execution Time

3) L1-dcache Miss Percentage

4) LLC-load Miss Percentage

5) Speedup

Conclusion

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages