Skip to content

What is expected Kinetics400 dataset directory structure? #3273

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
vadimkantorov opened this issue Jan 22, 2021 · 5 comments · Fixed by #3453
Closed

What is expected Kinetics400 dataset directory structure? #3273

vadimkantorov opened this issue Jan 22, 2021 · 5 comments · Fixed by #3453

Comments

@vadimkantorov
Copy link

vadimkantorov commented Jan 22, 2021

Given that the dataset does not come with official downloader scripts and that most roll their own or hack some third-party scripts, it would be much clearer if https://pytorch.org/docs/stable/torchvision/datasets.html#kinetics-400 explained what directory structure is expected by torchvision.datasets.Kinetics400

What is the expected dataset size? and the video file extensions?

Thanks!

cc @pmeier

@vadimkantorov vadimkantorov changed the title Document expected Kinetics400 dataset directory structure What is expected Kinetics400 dataset directory structure? Jan 22, 2021
@vfdev-5
Copy link
Collaborator

vfdev-5 commented Jan 27, 2021

@vadimkantorov
Copy link
Author

This is not very helpful, since it does not mention explicitly the class-label directory organisation. It is also not clear if root_dir should contain the splits or if the root_dir is the path to a chosen split.

@vfdev-5
Copy link
Collaborator

vfdev-5 commented Jan 27, 2021

Unfortunately, I removed the dataset from my storage where I tried a minimal training of the ref example.
Maybe, Bruno could share more info about that...

@vadimkantorov
Copy link
Author

The problem is that the dataset does not have official download or dataset stats scripts or recommended directory structure. From what I found, the root_dir should be the path to the split, the split directory should contain directories for every class labels with videos inside them. But this should better be mentioned explicitly in the docs

@vfdev-5
Copy link
Collaborator

vfdev-5 commented Jan 27, 2021

I agree. I had a similar question in my case... If you'd like to improve the docs, feel free to send a PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants