Skip to content

Commit

Permalink
A bit more content
Browse files Browse the repository at this point in the history
  • Loading branch information
jussienko committed May 4, 2024
1 parent 5513237 commit f6378b7
Showing 1 changed file with 6 additions and 4 deletions.
10 changes: 6 additions & 4 deletions docs/03-main-memory.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,10 +37,12 @@ lang: en

# Memory speed - practice

- Latency
- Typical memory accees latency: ~80 ns
- Concurrency = latency x throughput
- Single core performance limitations
- Bus contention when many cores access the memory
- In order to achieve a bandwidth of 100 GB/s, over 100 memory requests need to be concurrently on the fly (single request = cache line = 64B)
- Single core has can sustain typically only ~10 concurrent memory requests, so single
thread can achieve only a farction of peak performance
- When many cores access the memory, there is bus contention
- Highest memory bandwidth is typically achieved with some cores idle

# Non-uniform memory access
Expand All @@ -49,7 +51,7 @@ lang: en
- A node can have multiple sockets with memory attached to each socket
- Non Uniform Memory Access (NUMA)
- All memory within a node is accessible, but latencies and bandwidths vary
- Hardware needs to maintain cahce coherency also between different NUMA nodes (ccNUMA)
- Hardware needs to maintain cache coherency also between different NUMA nodes (ccNUMA)
</div>

<div class=column>
Expand Down

0 comments on commit f6378b7

Please sign in to comment.