-
Notifications
You must be signed in to change notification settings - Fork 58
Implement abstract per-GPU cache helper class #814
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@dvrogozh, thank you for working on this! These abstractions are very much needed. The small tweaks to the implementation we talked about should be enough. |
This commit introduces abstract per-GPU cache helper class. It allows to store and reuse HW contexts which are expensive to create. Class is abstract and allows storing different object types. Co-authored-by: Nicolas Hug <[email protected]> Signed-off-by: Dmitry Rogozhkin <[email protected]>
Signed-off-by: Dmitry Rogozhkin <[email protected]>
Signed-off-by: Dmitry Rogozhkin <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@dvrogozh, this looks great, thank you!
These ci failures seem unrelated. |
@dvrogozh, yup, agreed, it's a problem we know about and should start tracking. I'm going to merge. |
This commit introduces per-GPU cache helper class based on original design of the cache in CUDA device interface. New helper allows to store and reuse HW contexts which are expensive to create. Class is templated and allows storing different object types. At the moment it's applied to cache CUDA ffmpeg device contexts (
AVBufferRef
) and NPP contexts (NppStreamContext
). Class can be further reused in non-CUDA device implementations.CC: @scotts, @NicolasHug, @eromomon