Returns the contents of a tracked file.
def read(path: str, repo: str = None, rev: str = None, remote: str = None, mode: str = "r", encoding: str = None)
import dvc.api modelpkl = dvc.api.read( 'model.pkl', repo='https://github.com/iterative/example-get-started', mode='rb')
This function wraps
dvc.api.open(), for a simple way to return the complete
contents of a file tracked in a DVC project. The file can be
tracked by DVC (as an output) or by Git.
This is similar to the
dvc getcommand in our CLI.
The type returned depends on the
modeused. For more details, please refer to Python's
open()built-in, which is used under the hood.
path (required) - location and file name of the target to read, relative
to the root of the project (
repo - specifies the location of the DVC project. It can be a URL or a file
system path. Both HTTP and SSH protocols are supported for online Git repos
[user@]server:project.git). Default: The current project is used
(the current working directory tree is walked up to find it).
remote - name of the DVC remote to look for
the target data. Default: The
default remote of
repo is used if a
remote argument is not given. For local projects, the cache is
tried before the default remote.
mode - specifies the mode in which the file is opened. Defaults to
(read). Mirrors the namesake parameter in builtin
to decode the file contents to a string. This should only be used in text
mode. Defaults to
"utf-8". Mirrors the namesake parameter in builtin
dvc.exceptions.FileMissingError - file in
path is missing from
path cannot be found in
dvc.exceptions.NoRemoteError - no
remote is found.
Any file tracked in a DVC project (and stored remotely) can be loaded directly in your Python code with this API. For example, let's say that you want to load and unserialize a binary model from a repo on GitHub:
import pickle import dvc.api model = pickle.loads( dvc.api.read( 'model.pkl', repo='https://github.com/iterative/example-get-started' mode='rb' ) )
'rb'mode here for compatibility with