Skip to content

Get dataset size from DVC data registry  #4682

Closed
@RafayAK

Description

@RafayAK

Hi, I would like to request a feature to query dataset sizes from a DVC registry.

I have created a data registry using the documentation and added a dataset to my remote storage through the simple dvc add my_dataset and dvc push commands.
Now, before I import the dataset from a registry I want to make sure I have enough disk space. So, something like the following would be really useful.

dvc du -hs https://github.com/my_data_registry my_dataset
  • du: which stands for Disk Usage is a standard Unix command to query file sizes.
    options h and s represent the following:
    - h: is a human-readable format so in bytes, megabytes, gigabytes etc
    - s: is to sum the sizes of all files in a directory.

If the DVC team thinks there are better ways to do this without adding a new feature I'm all ears.

Metadata

Metadata

Assignees

Labels

feature requestRequesting a new featurep2-mediumMedium priority, should be done, but less important

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions