Data Version Control

The easy to use Git extension for data version control for data scientists.

Apply data version control to your data science workflows with minimal overhead.
Free and open source

We’re thrilled to welcome the DVC Community to the lakeFS family. DVC will continue as an independent open source tool and Git extension for data scientists – everything you love about DVC stays the same.

lakeFS has been the enterprise standard for data version control, serving Fortune 100 companies and organizations like NASA and Volvo with scalable infrastructure for petabyte-scale data and production AI. Check out lakeFS

Together, we’re looking forward to be the trusted resource for teams at every scale
across the entire data version control ecosystem.

Get Started with

Connect storage to repo

Keep large data and model files alongside code and share via your cloud storage.

Configure steps as you go

Declare dependencies and outputs at each step to build reproducible end-to-end pipelines.

Track experiments in Git

Track experiments in your repo, compare results and restore entire experiment states cross-team.


				
Empowering thousands of users and customers from startups to Fortune 500 companies

Subscribe for updates. We won't spam you.

Keep updated on blog posts with our RSS Feed!