Skip to content

Ensemble model with shared memory #5418

Closed Answered by dyastremsky
NikeNano asked this question in Q&A
Discussion options

You must be logged in to vote

If you're asking whether ensemble shares memory between models, the ensemble scheduler passes pointers to the tensors between models to avoid copies. However, there is a copy at the end of the ensemble for the final output.

The backends may also make copies during execution (unrelated to the ensemble). This could be due to the dynamic batcher (if enabled, copies during gathering/scattering of inputs and outputs), pinned memory manager (if used to improve performance), and models moving tensors between host and device memory. There are also other backend-specific situations, like moving data between models pertaining to different backends could introduce copies.

CC: @GuanLuo @Tabrizian

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@NikeNano
Comment options

@dyastremsky
Comment options

@NikeNano
Comment options

Answer selected by Tabrizian
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #5416 on February 25, 2023 01:07.