LLC miss #2018

KKwanhee · 2023-06-27T12:23:09Z

KKwanhee
Jun 27, 2023

I found some model data which is 'not aligned' (maybe not aligned to ggml tensor alignment) and it can't use mmap so loads every model data on main memory.

My question is, compared to 'aligned' model data which use mmap, runing with 'not aligned' model data occurs around 3x LLC miss and why is that happening? I checked the LLC miss with Intel VTune and most LLC miss occured at 'ggml_vec_dit_f16' from 'ggml_compute_forward_mul_mat'.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLC miss #2018

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

LLC miss #2018

KKwanhee Jun 27, 2023

Replies: 0 comments

KKwanhee
Jun 27, 2023