Return RGB frames as output of GPU decoder #5191

prabhat00155 · 2022-01-11T15:05:02Z

GPU decoder currently outputs NV12 flattened frames. This PR changes changes the output type to RGB, also fixing the output shape.
Most of this work was done by @fmassa in his draft PR, I made some adjustments when calculating pyav output.
Resolves #5141 and #5145.

facebook-github-bot · 2022-01-11T15:05:09Z

💊 CI failures summary and remediations

As of commit 7c85da9 (more details on the Dr. CI page):

✅ None of the CI failures appear to be your fault 💚

1/1 broken upstream at merge base 038828e since Jan 14

🚧 1 ongoing upstream failure:

These were probably caused by upstream breakages that are not fixed yet.

unittest_prototype since Jan 14 (adf8466)
- 🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

test/test_video_gpu_decoder.py

fmassa

Thanks for the PR Prabhat!

I made some comments, let me know what you think

torchvision/csrc/io/decoder/gpu/decoder.cpp

fmassa · 2022-01-17T13:48:24Z

torchvision/csrc/io/decoder/gpu/decoder.cpp

  uint8_t* frame_ptr = decoded_frame.data_ptr<uint8_t>();
+  const uint8_t* const source_arr[] = {
+      (const uint8_t* const)source_frame,
+      (const uint8_t* const)(source_frame + source_pitch * ((surface_height + 1) & ~1))};


I would like us to double-check this (surface_height + 1) & ~1) condition. Can you create some videos with odd dimensions to validate what is actually needed?

surface_height is different from luma_height which is directly related to the video dimensions.
I can revisit this when doing code refactoring.

My point is the +1) & ~1 condition. I'm not sure if it's actually necessary

luma height is aligned by 2, so the chroma offset should not be odd(chroma base address can't be odd memory location), hence the alignment.

test/test_video_gpu_decoder.py

fmassa · 2022-01-17T13:53:10Z

Also, I think a bunch of things related to output_format like in

vision/torchvision/csrc/io/decoder/gpu/decoder.h

Lines 44 to 45 in 5e56575

    
           return (video_output_format == cudaVideoSurfaceFormat_NV12 || 
        
                   video_output_format == cudaVideoSurfaceFormat_P016)

can be cleaned now, right? As we will enforce / assume that the decoder returns NV12, and then we convert it to RGB.

prabhat00155 · 2022-01-18T12:23:04Z

Also, I think a bunch of things related to output_format like in

vision/torchvision/csrc/io/decoder/gpu/decoder.h

Lines 44 to 45 in 5e56575

return (video_output_format == cudaVideoSurfaceFormat_NV12 ||

video_output_format == cudaVideoSurfaceFormat_P016)

can be cleaned now, right? As we will enforce / assume that the decoder returns NV12, and then we convert it to RGB.

Yeah, I'll clean these up in a separate PR. There are a few functions in there that may not be needed anymore.

fmassa

I'm approving to unblock, but I think we need to add tests for odd-sized videos to validate that our implementation handles things correctly

prabhat00155 · 2022-01-19T11:36:32Z

I'm approving to unblock, but I think we need to add tests for odd-sized videos to validate that our implementation handles things correctly

I tested it locally using odd-sized videos. We'll could perhaps add some odd-sized videos in torchvision for testing.

Summary: * Return RGB frames as output of GPU decoder * Move clamp to the conversion function * Cleaned up a bit * Remove utility functions from test * Use data member width directly * Fix linter error Reviewed By: jdsgomes, prabhat00155 Differential Revision: D33739378 fbshipit-source-id: cea9f49fdefd777ec27a902947531c561686c80c

Return RGB frames as output of GPU decoder

7ac67b9

prabhat00155 added enhancement module: video labels Jan 11, 2022

prabhat00155 assigned bjuncek and fmassa Jan 11, 2022

pytorch-probot bot added the ciflow/default label Jan 11, 2022

facebook-github-bot added the cla signed label Jan 11, 2022

prabhat00155 commented Jan 11, 2022

View reviewed changes

test/test_video_gpu_decoder.py Outdated Show resolved Hide resolved

prabhat00155 unassigned bjuncek and fmassa Jan 11, 2022

prabhat00155 requested review from fmassa and bjuncek January 11, 2022 15:07

prabhat00155 added 6 commits January 12, 2022 02:41

Move clamp to the conversion function

d5ef8bc

Cleaned up a bit

7a472d1

Merge branch 'master' into prabhat00155/rgb_kernel

8fb7e3c

Merge branch 'master' into prabhat00155/rgb_kernel

b595d8a

Remove utility functions from test

73ab184

Merge branch 'master' into prabhat00155/rgb_kernel

e43282e

fmassa reviewed Jan 17, 2022

View reviewed changes

Use data member width directly

ff09aac

prabhat00155 mentioned this pull request Jan 18, 2022

GPU decoder code cleanup #5205

Closed

prabhat00155 requested a review from fmassa January 18, 2022 13:29

prabhat00155 added 2 commits January 18, 2022 05:57

Fix linter error

eb63a8a

Merge branch 'master' into prabhat00155/rgb_kernel

945c39b

fmassa approved these changes Jan 19, 2022

View reviewed changes

prabhat00155 mentioned this pull request Jan 19, 2022

Find a better way to compare GPU decoder results with pyav results #5216

Open

Merge branch 'master' into prabhat00155/rgb_kernel

7c85da9

prabhat00155 merged commit f4fd193 into pytorch:main Jan 19, 2022

prabhat00155 deleted the prabhat00155/rgb_kernel branch January 19, 2022 12:28

prabhat00155 mentioned this pull request Feb 1, 2022

Removed unused member functions from GPU decoder #5327

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Return RGB frames as output of GPU decoder #5191

Return RGB frames as output of GPU decoder #5191

Uh oh!

prabhat00155 commented Jan 11, 2022 •

edited

Loading

Uh oh!

facebook-github-bot commented Jan 11, 2022 •

edited

Loading

Uh oh!

Uh oh!

fmassa left a comment

Uh oh!

Uh oh!

fmassa Jan 17, 2022

Uh oh!

prabhat00155 Jan 18, 2022 •

edited

Loading

Uh oh!

fmassa Jan 19, 2022

Uh oh!

prabhat00155 Jan 19, 2022

Uh oh!

Uh oh!

fmassa commented Jan 17, 2022

Uh oh!

prabhat00155 commented Jan 18, 2022

Uh oh!

fmassa left a comment

Uh oh!

prabhat00155 commented Jan 19, 2022

Uh oh!

Uh oh!

Return RGB frames as output of GPU decoder #5191

Return RGB frames as output of GPU decoder #5191

Uh oh!

Conversation

prabhat00155 commented Jan 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Jan 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

🚧 1 ongoing upstream failure:

Uh oh!

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fmassa Jan 17, 2022

Choose a reason for hiding this comment

Uh oh!

prabhat00155 Jan 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fmassa Jan 19, 2022

Choose a reason for hiding this comment

Uh oh!

prabhat00155 Jan 19, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fmassa commented Jan 17, 2022

Uh oh!

prabhat00155 commented Jan 18, 2022

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

prabhat00155 commented Jan 19, 2022

Uh oh!

Uh oh!

prabhat00155 commented Jan 11, 2022 •

edited

Loading

facebook-github-bot commented Jan 11, 2022 •

edited

Loading

prabhat00155 Jan 18, 2022 •

edited

Loading