Skip to content

Can't find documentation for calling dvc from python module. #2690

Closed
@JoeyCarson

Description

@JoeyCarson

One of my utilities pulls in data from various sources and builds a large hierarchy where versioning may or may not be required. Ideally, the hierarchy needs to allow for users to simply changes files in certain directories from time to time and version the appropriate subdirectory of the data.

At first I wanted to have a simple command to track changes, e.g. dvc add rootdir. But that seems to require reindexing of the entire hierarchy, which is not suitable for my use case. Ideally, data import process would dvc add the subdirectory itself when creating it. So I'd like to do that from python.

Otherwise, it would be useful if I can somehow avoid reindexing the whole hierarchy by running dvc add at the root.

As a feature request, I would ask for either some better documentation to describe how to achieve this approach, or documentation for calling dvc API from python (without running shell commands) in order to better work with this style of data organization.

Metadata

Metadata

Assignees

No one assigned

    Labels

    A: apiRelated to the dvc.apiquestionI have a question?

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions