Commit e2aa2f6a authored by Mitar's avatar Mitar Committed by Swaroop Vattam
Browse files

Updating README

parent b742dfcc
......@@ -23,7 +23,7 @@ This corpus is organized into seed datasets and training datasets.
Download datasets using [git LFS](https://git-lfs.github.com/):
```
$ git lfs clone git@gitlab.datadrivendiscovery.org:d3m/datasets.git
$ git clone --recursive git@datasets.datadrivendiscovery.org:d3m/datasets.git
```
Note, use `git lfs clone` instead of `git clone` because it
......@@ -45,13 +45,19 @@ It is possible to download only part of the repository. First clone
without downloading files managed by git LFS:
```
$ git lfs clone git@gitlab.datadrivendiscovery.org:d3m/datasets.git -X "*"
$ GIT_LFS_SKIP_SMUDGE=1 git clone --recursive git@datasets.datadrivendiscovery.org:d3m/datasets.git
```
This will download and checkout all files smaller than 100 KB.
This will download and checkout all files smaller than 100 KB,
including all the history.
Now to download all files of one dataset, run inside cloned repository:
After cloning, you can, e.g., download all files of just one dataset.
Run inside the cloned repository:
```
$ git lfs pull -I seed_datasets_current/185_baseball/
```
Another way to download only part of the repository is to not
clone also all git submodules with `--recursive`, but do only
those you are interested in.
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment