Skip to content

av==10.0.0 breaks CI #6790

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
pmeier opened this issue Oct 18, 2022 · 2 comments · Fixed by #6789
Closed

av==10.0.0 breaks CI #6790

pmeier opened this issue Oct 18, 2022 · 2 comments · Fixed by #6789

Comments

@pmeier
Copy link
Collaborator

pmeier commented Oct 18, 2022

The recent release of av==10.0.0 breaks at least two tests: https://app.circleci.com/pipelines/github/pytorch/vision/21245/workflows/ecbc81ff-f864-4227-a35d-b4ed5e9f57ee/jobs/1724114?invite=true#step-108-3075

=================================== FAILURES ===================================
____________________ TestVideo.test_video_clips_custom_fps _____________________
Traceback (most recent call last):
  File "/root/project/env/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 1120, in _try_get_data
    data = self._data_queue.get(timeout=timeout)
  File "/root/project/env/lib/python3.7/multiprocessing/queues.py", line 104, in get
    if not self._poll(timeout):
  File "/root/project/env/lib/python3.7/multiprocessing/connection.py", line 257, in poll
    return self._poll(timeout)
  File "/root/project/env/lib/python3.7/multiprocessing/connection.py", line 414, in _poll
    r = wait([self], timeout)
  File "/root/project/env/lib/python3.7/multiprocessing/connection.py", line 921, in wait
    ready = selector.select(timeout)
  File "/root/project/env/lib/python3.7/selectors.py", line 415, in select
    fd_event_list = self._selector.poll(timeout)
  File "/root/project/env/lib/python3.7/site-packages/torch/utils/data/_utils/signal_handling.py", line 66, in handler
    _error_if_any_worker_fails()
RuntimeError: DataLoader worker (pid 5231) is killed by signal: Segmentation fault. 

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/root/project/test/test_datasets_video_utils.py", line 63, in test_video_clips_custom_fps
    video_clips = VideoClips(video_list, num_frames, num_frames, fps, num_workers=2)
  File "/root/project/torchvision/datasets/video_utils.py", line 132, in __init__
    self._compute_frame_pts()
  File "/root/project/torchvision/datasets/video_utils.py", line 153, in _compute_frame_pts
    for batch in dl:
  File "/root/project/env/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 628, in __next__
    data = self._next_data()
  File "/root/project/env/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 1316, in _next_data
    idx, data = self._get_data()
  File "/root/project/env/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 1282, in _get_data
    success, data = self._try_get_data()
  File "/root/project/env/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 1133, in _try_get_data
    raise RuntimeError('DataLoader worker (pid(s) {}) exited unexpectedly'.format(pids_str)) from e
RuntimeError: DataLoader worker (pid(s) 5231) exited unexpectedly
----------------------------- Captured stderr call -----------------------------
100.0%
ERROR: Unexpected segmentation fault encountered in worker.
ERROR: Unexpected segmentation fault encountered in worker.

I'm going to pin to < 10.

cc @seemethere

@pmeier
Copy link
Collaborator Author

pmeier commented Oct 24, 2022

#6814 includes reproduction

@bjuncek
Copy link
Contributor

bjuncek commented Feb 28, 2023

@pmeier I'll close this as #6814 contains repro, and seems to have been fixed in here.
I'll double check to see if the mentioned fix also fixes the CI locally and add the rest of the info to that issue

@bjuncek bjuncek closed this as completed Feb 28, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants