-
Notifications
You must be signed in to change notification settings - Fork 1.2k
dvc: support .dvcignore #1499
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@efiop could you please clarify a bit more? Is it for users mostly or for internal DVC usage? Should we include |
It is for users mostly as a convenient way to ignore some paths/files/dirs from being cached/tracked by dvc. Good point about |
Is this somehow related #1471 ? I saw on the chat that the problem was that the MD5 of a directory changed because the file system created a I can see the same stuff happening for other things if you are not careful enough (e.g. vim swap files, file locking mechanisms that create dotfiles, IDE specific files / If we are introducing |
I confirm that |
I think an equally pressing issue is that having a large (4 million files) un-cached folder slows down dvc, as it needs to traverse the whole folder before executing any command. Adding support for .dvcignore would add the required capability to address this issue. |
Please, create a ticket or a page to document the changes. |
Uh oh!
There was an error while loading. Please reload this page.
Same as .gitignore, but for dvc. It will tell dvc which paths to ignore when caching data. Would be extremely useful for ignoring auto-generated/temporary files/directories that are created as a side effect and don't carry anything useful.
First iteration should make patterns listed in dvcignore be ignored on Repo.stages() when collecting stages.
Second iteration should support
dvc add/run
. We have a separate issue for it at #1876The text was updated successfully, but these errors were encountered: