@@ -22,10 +22,10 @@ _middleware_ between ML projects and cloud storage. Here are its advantages:
22
22
- Reusability: reproduce and organize _ feature stores_ with a simple CLI
23
23
(` dvc get ` and ` dvc import ` commands, similar to software package management
24
24
systems like ` pip ` ).
25
- - Persistence: the DVC registry-controlled
26
- [ remote storage] ( /doc/command-reference/remote ) (e.g. an S3 bucket) improves
27
- data security. There are less chances someone can delete or rewrite a model,
28
- for example .
25
+ - Availability and persistence: the
26
+ [ remote storage] ( /doc/command-reference/remote ) configured in the DVC registry
27
+ (e.g. an S3 bucket) works as a proxy that endures your data is always
28
+ available and that it can outlive the registry itself .
29
29
- Storage optimization: track data
30
30
[ shared] ( /doc/use-cases/sharing-data-and-model-files ) by multiple projects
31
31
centralized in a single location (with the ability to create distributed
@@ -34,10 +34,11 @@ _middleware_ between ML projects and cloud storage. Here are its advantages:
34
34
- Data as code: leverage Git workflow such as commits, branching, pull requests,
35
35
reviews, and even CI/CD for your data and models lifecycle. Think "Git for
36
36
cloud storage", but without ad-hoc conventions.
37
- - Security: registries can be setup to have read-only remote storage (e.g. an
38
- HTTP location). Git versioning of
39
- [ DVC metafiles] ( /doc/user-guide/dvc-files-and-directories ) allows us to track
40
- and audit data changes.
37
+ - Data Security: there are less chances someone can delete or rewrite datasets
38
+ or models, preserving data integrity. Registries can even be setup to use
39
+ read-only remote storage (e.g. an HTTP location). Additionally, versioning
40
+ [ DVC metafiles] ( /doc/user-guide/dvc-files-and-directories ) with Git enables
41
+ following and auditing data changes.
41
42
42
43
## Building registries
43
44
0 commit comments