Data Version Control
– and much more –
for the GenAI era

Free and open source, forever.

Manage and version images, audio, video, and text files in storage and organize your ML modeling process into a reproducible workflow.

GenAI data chain

Coming soon

Data and model versioning

---

Explore and enrich annotated datasets with custom embeddings, auto-labeling, and bias removal at billion-file scale — without modifying your data.

Connect to versioned data sources and code with pipelines, track experiments, register models — all based on GitOps principles.

Get Started with

DVC^X and DVC: Better Together

Build the datasets you need without modifying your data sources. Create pipelines that connect your versioned datasets, code, and models together for effective experiment tracking the GitOps way.

Get Started with

Get started

DVC For VS CodeGet VS Code Extension

Data Version Control
– and much more –
for the GenAI era

GenAI data chain

Data and model versioning

Get Started with

Filter a billion samples in seconds

Create datasets from queries

DVC^X and DVC: Better Together

Get Started with

Connect storage to repo

Configure steps as you go

Track experiments in Git

Empowering thousands of users and customers from startups to Fortune 500 companies

Data Version Control– and much more –for the GenAI era

GenAI data chain

Data and model versioning

Get Started with

Filter a billion samples in seconds

Create datasets from queries

DVCX and DVC: Better Together

Get Started with

Connect storage to repo

Configure steps as you go

Track experiments in Git

Empowering thousands of users and customers from startups to Fortune 500 companies

Data Version Control
– and much more –
for the GenAI era

DVC^X and DVC: Better Together