Support CUDA frame in FilterGraph #3183

mthrok · 2023-03-17T21:30:29Z

This commit adds CUDA frame support to FilterGraph

It initializes and attaches CUDA frames context to FilterGraph,
so that CUDA frames can be processed in FilterGraph.

As a result, it enables

CUDA filter support such as scale_cuda
Properly retrieve the pixel format coming out of FilterGraph when
CUDA HW acceleration is enabled. (currently it is reported as "cuda")

Resolves #3159

facebook-github-bot · 2023-03-17T21:30:56Z

@mthrok has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: This commit adds CUDA frame support to FilterGraph It initializes and attaches CUDA frames context to FilterGraph, so that CUDA frames can be processed in FilterGraph. As a result, it enables 1. CUDA filter support such as `scale_cuda` 2. Properly retrieve the pixel format coming out of FilterGraph when CUDA HW acceleration is enabled. (currently it is reported as "cuda") Pull Request resolved: pytorch#3183 Differential Revision: D44183722 Pulled By: mthrok fbshipit-source-id: 9ae9a925df5a5e1770e32917e097a7d03853b6b9

facebook-github-bot · 2023-03-17T23:28:43Z

This pull request was exported from Phabricator. Differential Revision: D44183722

Summary: This commit adds CUDA frame support to FilterGraph It initializes and attaches CUDA frames context to FilterGraph, so that CUDA frames can be processed in FilterGraph. As a result, it enables 1. CUDA filter support such as `scale_cuda` 2. Properly retrieve the pixel format coming out of FilterGraph when CUDA HW acceleration is enabled. (currently it is reported as "cuda") Pull Request resolved: pytorch#3183 Differential Revision: D44183722 Pulled By: mthrok fbshipit-source-id: c4e672ee319ccb1e354d94a7c0d6ddd503d40e7e

facebook-github-bot · 2023-03-17T23:35:13Z

This pull request was exported from Phabricator. Differential Revision: D44183722

facebook-github-bot · 2023-03-19T04:21:29Z

@mthrok has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-03-19T14:12:41Z

@mthrok has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: This commit adds CUDA frame support to FilterGraph It initializes and attaches CUDA frames context to FilterGraph, so that CUDA frames can be processed in FilterGraph. As a result, it enables 1. CUDA filter support such as `scale_cuda` 2. Properly retrieve the pixel format coming out of FilterGraph when CUDA HW acceleration is enabled. (currently it is reported as "cuda") Resolves pytorch#3159 Pull Request resolved: pytorch#3183 Differential Revision: D44183722 Pulled By: mthrok fbshipit-source-id: 263999172522233401109b9a0d13514883d95660

facebook-github-bot · 2023-03-19T17:40:42Z

This pull request was exported from Phabricator. Differential Revision: D44183722

facebook-github-bot · 2023-03-19T17:47:45Z

This pull request was exported from Phabricator. Differential Revision: D44183722

Summary: This commit adds CUDA frame support to FilterGraph It initializes and attaches CUDA frames context to FilterGraph, so that CUDA frames can be processed in FilterGraph. As a result, it enables 1. CUDA filter support such as `scale_cuda` 2. Properly retrieve the pixel format coming out of FilterGraph when CUDA HW acceleration is enabled. (currently it is reported as "cuda") Resolves pytorch#3159 Pull Request resolved: pytorch#3183 Differential Revision: D44183722 Pulled By: mthrok fbshipit-source-id: ae99c63c770234ec979008a31fcbe661d0265fb3

facebook-github-bot · 2023-03-19T18:26:16Z

@mthrok has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: This commit adds CUDA frame support to FilterGraph It initializes and attaches CUDA frames context to FilterGraph, so that CUDA frames can be processed in FilterGraph. As a result, it enables 1. CUDA filter support such as `scale_cuda` 2. Properly retrieve the pixel format coming out of FilterGraph when CUDA HW acceleration is enabled. (currently it is reported as "cuda") Resolves pytorch#3159 Pull Request resolved: pytorch#3183 Differential Revision: D44183722 Pulled By: mthrok fbshipit-source-id: 7a1ec4717348965d178045c76b0bbe506140f8c7

facebook-github-bot · 2023-03-19T22:09:31Z

This pull request was exported from Phabricator. Differential Revision: D44183722

facebook-github-bot · 2023-03-19T22:57:46Z

@mthrok has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-03-20T04:25:49Z

This pull request was exported from Phabricator. Differential Revision: D44183722

Summary: This commit adds CUDA frame support to FilterGraph It initializes and attaches CUDA frames context to FilterGraph, so that CUDA frames can be processed in FilterGraph. As a result, it enables 1. CUDA filter support such as `scale_cuda` 2. Properly retrieve the pixel format coming out of FilterGraph when CUDA HW acceleration is enabled. (currently it is reported as "cuda") Resolves pytorch#3159 Pull Request resolved: pytorch#3183 Differential Revision: D44183722 Pulled By: mthrok fbshipit-source-id: 971f796c11a96d728065f84726bdb7acd6e656bc

Summary: This commit adds CUDA frame support to FilterGraph It initializes and attaches CUDA frames context to FilterGraph, so that CUDA frames can be processed in FilterGraph. As a result, it enables 1. CUDA filter support such as `scale_cuda` 2. Properly retrieve the pixel format coming out of FilterGraph when CUDA HW acceleration is enabled. (currently it is reported as "cuda") Resolves pytorch#3159 Pull Request resolved: pytorch#3183 Differential Revision: D44183722 Pulled By: mthrok fbshipit-source-id: d319fee3a6c03e1dbc985879f0eead879925b4c8

facebook-github-bot · 2023-03-20T04:30:11Z

This pull request was exported from Phabricator. Differential Revision: D44183722

Summary: This commit adds CUDA frame support to FilterGraph It initializes and attaches CUDA frames context to FilterGraph, so that CUDA frames can be processed in FilterGraph. As a result, it enables 1. CUDA filter support such as `scale_cuda` 2. Properly retrieve the pixel format coming out of FilterGraph when CUDA HW acceleration is enabled. (currently it is reported as "cuda") Resolves pytorch#3159 Pull Request resolved: pytorch#3183 Differential Revision: D44183722 Pulled By: mthrok fbshipit-source-id: 7f3a223330cc5bc54f99c87c203494c948e9dfba

facebook-github-bot · 2023-03-20T04:35:54Z

This pull request was exported from Phabricator. Differential Revision: D44183722

Summary: This commit adds CUDA frame support to FilterGraph It initializes and attaches CUDA frames context to FilterGraph, so that CUDA frames can be processed in FilterGraph. As a result, it enables 1. CUDA filter support such as `scale_cuda` 2. Properly retrieve the pixel format coming out of FilterGraph when CUDA HW acceleration is enabled. (currently it is reported as "cuda") Resolves pytorch#3159 Pull Request resolved: pytorch#3183 Differential Revision: D44183722 Pulled By: mthrok fbshipit-source-id: 394c16b2d95d6a741addd17b1284c901ba6a8de6

facebook-github-bot · 2023-03-20T14:29:23Z

@mthrok merged this pull request in c5b9655.

github-actions · 2023-03-20T14:29:35Z

Hey @mthrok.
You merged this PR, but labels were not properly added. Please add a primary and secondary label (See https://github.com/pytorch/audio/blob/main/.github/process_commit.py)

Summary: Fix the GPU memory leak introduced in pytorch#3183 The HW frames context is owned by AVCodecContext. The removed `av_buffer_ref` call increased the ferenrence counting unnecessarily, and prevented AVCodecContext from feeing the resource. Reviewed By: nateanl Differential Revision: D44231876 fbshipit-source-id: 12967a7ed35b4a0a9adf2daf3f1e26db394be779

Summary: Pull Request resolved: #3186 Fix the GPU memory leak introduced in #3183 The HW frames context is owned by AVCodecContext. The removed `av_buffer_ref` call increased the ferenrence counting unnecessarily, and prevented AVCodecContext from feeing the resource. (Note: this ignores all push blocking failures!) Reviewed By: nateanl Differential Revision: D44231876 fbshipit-source-id: 9be2c33049dd02a3fa82a85271de7fb62e5b09ea

Summary: Refactor the process after decoding in StreamRader. The post-decode process consists of three parts, 1. preprocessing using FilterGraph 2. conversion to Tensor 3. store in Buffer The FilterGraph class is a thin wrapper around AVFilterGraph structure from FFmpeg and it is agnostic to media type. However Tensor conversion and buffering consists of bunch of different logics. Currently, conversion process is abstracted away with template, i.e. `Buffer<typename Conversion>`, and the whole process is implemeted in Sink class which consists of `FilterGraph` and `Buffer` which internally contains Conversion logic, even though conversion logic and buffer have nothing in common and beter logically separated. The new implementation replaces `Sink` class with `IPostDecodeProcess` interface, which contains the three components. The different post process is implemented as a template argument of the actual implementation, i.e. ```c++ template<typename Converter, typename Buffer> ProcessImpl : IPostDecodeProcess ``` and stored as `unique_ptr<IPostDecodeProcess>` on `StreamProcessor`. ([functionoid pattern](https://isocpp.org/wiki/faq/pointers-to-members#functionoids), which allos to eliminate all the branching based on the media format.) Note: This implementation was not possible at the initial version of StreamReader, as there was no way of knowing the media attribtues coming out of `AVFilterGraph`. pytorch#3155 and pytorch#3183 added features to parse it properly, so we can finally make the post processing strongly-typed. Differential Revision: D44242647 fbshipit-source-id: 3789ba515bf9de917c94e0a301b67968a1209053

Summary: Pull Request resolved: pytorch#3188 Refactor the process after decoding in StreamRader. The post-decode process consists of three parts, 1. preprocessing using FilterGraph 2. conversion to Tensor 3. store in Buffer The FilterGraph class is a thin wrapper around AVFilterGraph structure from FFmpeg and it is agnostic to media type. However Tensor conversion and buffering consists of bunch of different logics. Currently, conversion process is abstracted away with template, i.e. `Buffer<typename Conversion>`, and the whole process is implemeted in Sink class which consists of `FilterGraph` and `Buffer` which internally contains Conversion logic, even though conversion logic and buffer have nothing in common and beter logically separated. The new implementation replaces `Sink` class with `IPostDecodeProcess` interface, which contains the three components. The different post process is implemented as a template argument of the actual implementation, i.e. ```c++ template<typename Converter, typename Buffer> ProcessImpl : IPostDecodeProcess ``` and stored as `unique_ptr<IPostDecodeProcess>` on `StreamProcessor`. ([functionoid pattern](https://isocpp.org/wiki/faq/pointers-to-members#functionoids), which allos to eliminate all the branching based on the media format.) Note: This implementation was not possible at the initial version of StreamReader, as there was no way of knowing the media attribtues coming out of `AVFilterGraph`. pytorch#3155 and pytorch#3183 added features to parse it properly, so we can finally make the post processing strongly-typed. Differential Revision: D44242647 fbshipit-source-id: eda4b1b467c71edfad6a5ff11ff91736d5ef8f63

Summary: Pull Request resolved: pytorch#3188 Refactor the process after decoding in StreamRader. The post-decode process consists of three parts, 1. preprocessing using FilterGraph 2. conversion to Tensor 3. store in Buffer The FilterGraph class is a thin wrapper around AVFilterGraph structure from FFmpeg and it is agnostic to media type. However Tensor conversion and buffering consists of bunch of different logics. Currently, conversion process is abstracted away with template, i.e. `Buffer<typename Conversion>`, and the whole process is implemeted in Sink class which consists of `FilterGraph` and `Buffer` which internally contains Conversion logic, even though conversion logic and buffer have nothing in common and beter logically separated. The new implementation replaces `Sink` class with `IPostDecodeProcess` interface, which contains the three components. The different post process is implemented as a template argument of the actual implementation, i.e. ```c++ template<typename Converter, typename Buffer> ProcessImpl : IPostDecodeProcess ``` and stored as `unique_ptr<IPostDecodeProcess>` on `StreamProcessor`. ([functionoid pattern](https://isocpp.org/wiki/faq/pointers-to-members#functionoids), which allows to eliminate all the branching based on the media format.) Note: This implementation was not possible at the initial version of StreamReader, as there was no way of knowing the media attribtues coming out of `AVFilterGraph`. pytorch#3155 and pytorch#3183 added features to parse it properly, so we can finally make the post processing strongly-typed. Differential Revision: D44242647 fbshipit-source-id: cba1a2a1425761bfb637e666913b9c9aef2a5cc6

Summary: Pull Request resolved: pytorch#3188 Refactor the process after decoding in StreamRader. The post-decode process consists of three parts, 1. preprocessing using FilterGraph 2. conversion to Tensor 3. store in Buffer The FilterGraph class is a thin wrapper around AVFilterGraph structure from FFmpeg and it is agnostic to media type. However Tensor conversion and buffering consists of bunch of different logics. Currently, conversion process is abstracted away with template, i.e. `Buffer<typename Conversion>`, and the whole process is implemeted in Sink class which consists of `FilterGraph` and `Buffer` which internally contains Conversion logic, even though conversion logic and buffer have nothing in common and beter logically separated. The new implementation replaces `Sink` class with `IPostDecodeProcess` interface, which contains the three components. The different post process is implemented as a template argument of the actual implementation, i.e. ```c++ template<typename Converter, typename Buffer> ProcessImpl : IPostDecodeProcess ``` and stored as `unique_ptr<IPostDecodeProcess>` on `StreamProcessor`. ([functionoid pattern](https://isocpp.org/wiki/faq/pointers-to-members#functionoids), which allows to eliminate all the branching based on the media format.) Note: This implementation was not possible at the initial version of StreamReader, as there was no way of knowing the media attribtues coming out of `AVFilterGraph`. pytorch#3155 and pytorch#3183 added features to parse it properly, so we can finally make the post processing strongly-typed. Differential Revision: D44242647 fbshipit-source-id: fa901fbb88b2d0557483b27040ff2c067de02018

Summary: Pull Request resolved: #3188 Refactor the process after decoding in StreamRader. The post-decode process consists of three parts, 1. preprocessing using FilterGraph 2. conversion to Tensor 3. store in Buffer The FilterGraph class is a thin wrapper around AVFilterGraph structure from FFmpeg and it is agnostic to media type. However Tensor conversion and buffering consists of bunch of different logics. Currently, conversion process is abstracted away with template, i.e. `template<typename Conversion> Buffer`, and the whole process is implemeted in Sink class which consists of `FilterGraph` and `Buffer` which internally contains Conversion logic, even though conversion logic and buffer have nothing in common and beter logically separated. The new implementation replaces `Sink` class with `IPostDecodeProcess` interface, which contains the three components. The different post process is implemented as a template argument of the actual implementation, i.e. ```c++ template<typename Converter, typename Buffer> ProcessImpl : IPostDecodeProcess ``` and stored as `unique_ptr<IPostDecodeProcess>` on `StreamProcessor`. ([functionoid pattern](https://isocpp.org/wiki/faq/pointers-to-members#functionoids), which allows to eliminate all the branching based on the media format.) Note: This implementation was not possible at the initial version of StreamReader, as there was no way of knowing the media attributes coming out of `AVFilterGraph`. #3155 and #3183 added features to parse it properly, so we can finally make the post processing strongly-typed. Reviewed By: hwangjeff Differential Revision: D44242647 fbshipit-source-id: 96b8c6c72a2b8af4fa86a9b02292c65078ee265b

Summary: With the support of CUDA filter in pytorch#3183, it is now possible to change the pixel format of CUDA frame. This commit adds conversion for YUV444P format. Pull Request resolved: pytorch#3199 Differential Revision: D44323928 Pulled By: mthrok fbshipit-source-id: e04566af867b4440f7f15c56869368feddf74ba3

Summary: With the support of CUDA filter in pytorch#3183, it is now possible to change the pixel format of CUDA frame. This commit adds conversion for YUV444P format. Pull Request resolved: pytorch#3199 Differential Revision: D44323928 Pulled By: mthrok fbshipit-source-id: 4859e36f4dcd4a810d55e02adf21d260643e00ef

Summary: With the support of CUDA filter in #3183, it is now possible to change the pixel format of CUDA frame. This commit adds conversion for YUV444P format. Pull Request resolved: #3199 Reviewed By: hwangjeff Differential Revision: D44323928 Pulled By: mthrok fbshipit-source-id: 6d9b205e7235df5f21e7d3e06166b3a169f1ae9f

facebook-github-bot added the CLA Signed label Mar 17, 2023

mthrok force-pushed the cuda-filter-graph branch from 7766daf to 45c0a25 Compare March 17, 2023 23:28

mthrok force-pushed the cuda-filter-graph branch from 45c0a25 to 25a191b Compare March 17, 2023 23:35

mthrok mentioned this pull request Mar 19, 2023

List of feature requests received so far for StreamReader/Writer #3139

Open

8 tasks

mthrok force-pushed the cuda-filter-graph branch from b0342f5 to 0415a77 Compare March 19, 2023 17:39

mthrok force-pushed the cuda-filter-graph branch from 0415a77 to ea29cbc Compare March 19, 2023 17:47

mthrok force-pushed the cuda-filter-graph branch from 1d2b21c to 2e2fe7a Compare March 19, 2023 22:08

mthrok force-pushed the cuda-filter-graph branch from 4efb0a8 to 187a688 Compare March 20, 2023 04:25

mthrok force-pushed the cuda-filter-graph branch from 187a688 to a88ca35 Compare March 20, 2023 04:30

mthrok force-pushed the cuda-filter-graph branch from a88ca35 to 90fc533 Compare March 20, 2023 04:35

facebook-github-bot closed this in c5b9655 Mar 20, 2023

facebook-github-bot added the Merged label Mar 20, 2023

mthrok deleted the cuda-filter-graph branch March 20, 2023 15:13

mthrok mentioned this pull request Mar 20, 2023

Fix GPU memory leak on StreamReader #3186

Closed

mthrok added C++ module: IO new feature improvement labels Mar 20, 2023

mthrok mentioned this pull request Mar 21, 2023

Refactor the internal of StreamReader #3188

Closed

mthrok mentioned this pull request Mar 23, 2023

Support YUV444P in GPU decoder #3199

Closed

mthrok mentioned this pull request May 31, 2023

StreamWriter The h264_nvenc/hevc_nvenc encoder supports the YUV420P format #3388

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support CUDA frame in FilterGraph #3183

Support CUDA frame in FilterGraph #3183

Uh oh!

mthrok commented Mar 17, 2023 •

edited

Loading

Uh oh!

facebook-github-bot commented Mar 17, 2023

Uh oh!

facebook-github-bot commented Mar 17, 2023

Uh oh!

facebook-github-bot commented Mar 17, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 20, 2023

Uh oh!

facebook-github-bot commented Mar 20, 2023

Uh oh!

facebook-github-bot commented Mar 20, 2023

Uh oh!

facebook-github-bot commented Mar 20, 2023

Uh oh!

github-actions bot commented Mar 20, 2023

Uh oh!

Uh oh!

Support CUDA frame in FilterGraph #3183

Support CUDA frame in FilterGraph #3183

Uh oh!

Conversation

mthrok commented Mar 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Mar 17, 2023

Uh oh!

facebook-github-bot commented Mar 17, 2023

Uh oh!

facebook-github-bot commented Mar 17, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 19, 2023

Uh oh!

facebook-github-bot commented Mar 20, 2023

Uh oh!

facebook-github-bot commented Mar 20, 2023

Uh oh!

facebook-github-bot commented Mar 20, 2023

Uh oh!

facebook-github-bot commented Mar 20, 2023

Uh oh!

github-actions bot commented Mar 20, 2023

Uh oh!

Uh oh!

mthrok commented Mar 17, 2023 •

edited

Loading