Audio missing when using read_video() with video_reader backend #3890

prabhat00155 · 2021-05-21T12:21:55Z

Read entire video

video_path = "data/WUzgd7C1pWA.mp4"
set_video_backend('video_reader')
print(f'set backend: {get_video_backend()}')

visual, audio, info = read_video(video_path, pts_unit='pts')
print('Visual:', visual.shape, 'Audio:', audio.shape, info)

visual, audio, info = read_video(video_path, pts_unit='sec')
print('Visual:', visual.shape, 'Audio:', audio.shape, info)
---
Visual: torch.Size([327, 256, 340, 3]) Audio: torch.Size([0, 1]) {'video_fps': 29.970029830932617, 'audio_fps': 48000.0}
Visual: torch.Size([327, 256, 340, 3]) Audio: torch.Size([0, 1]) {'video_fps': 29.970029830932617, 'audio_fps': 48000.0}

Read video from start_pts

video_path = "data/WUzgd7C1pWA.mp4"
set_video_backend('video_reader')
print(f'set backend: {get_video_backend()}')

visual, audio, info = read_video(video_path, start_pts=1001, pts_unit='pts')
print('Visual:', visual.shape, 'Audio:', audio.shape, info)

visual, audio, info = read_video(video_path, start_pts=0.0333667, pts_unit='sec')
print('Visual:', visual.shape, 'Audio:', audio.shape, info)
---
set backend: video_reader
Visual: torch.Size([326, 256, 340, 3]) Audio: torch.Size([0, 1]) {'video_fps': 29.970029830932617, 'audio_fps': 48000.0}
Visual: torch.Size([326, 256, 340, 3]) Audio: torch.Size([0, 1]) {'video_fps': 29.970029830932617, 'audio_fps': 48000.0}

Read video from start_pts to end_pts

video_path = "data//WUzgd7C1pWA.mp4"
set_video_backend('video_reader')
print(f'set backend: {get_video_backend()}')

visual, audio, info = read_video(video_path, start_pts=1001, end_pts=2002, pts_unit='pts')
print('Visual:', visual.shape, 'Audio:', audio.shape, info)

visual, audio, info = read_video(video_path, start_pts=0.0333667, end_pts=0.1001000, pts_unit='sec')
print('Visual:', visual.shape, 'Audio:', audio.shape, info)
---
set backend: video_reader
Visual: torch.Size([2, 256, 340, 3]) Audio: torch.Size([0, 1]) {'video_fps': 29.970029830932617, 'audio_fps': 48000.0}
Visual: torch.Size([3, 256, 340, 3]) Audio: torch.Size([3072, 1]) {'video_fps': 29.970029830932617, 'audio_fps': 48000.0}

cc @bjuncek

The text was updated successfully, but these errors were encountered:

prabhat00155 added bug module: video labels May 21, 2021

prabhat00155 self-assigned this May 21, 2021

prabhat00155 mentioned this issue May 27, 2021

Fixed missing audio with video_reader backend #3934

Merged

prabhat00155 closed this as completed in #3934 Jun 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Audio missing when using read_video() with video_reader backend #3890

Audio missing when using read_video() with video_reader backend #3890

prabhat00155 commented May 21, 2021 •

edited by pytorch-probot bot

Loading

Audio missing when using read_video() with video_reader backend #3890

Audio missing when using read_video() with video_reader backend #3890

Comments

prabhat00155 commented May 21, 2021 • edited by pytorch-probot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

prabhat00155 commented May 21, 2021 •

edited by pytorch-probot bot

Loading