Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

start: complete setup for data pipelines #3998

Closed
wants to merge 6 commits into from
Closed
Changes from 4 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
11 changes: 8 additions & 3 deletions content/docs/start/data-management/data-pipelines.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,7 +32,7 @@ into a [stage](/doc/command-reference/stage):

### ⚙️ Expand to download example code.

Get the sample code like this:
Get the sample code, initialize DVC, and save everything to Git:
jorgeorpinel marked this conversation as resolved.
Show resolved Hide resolved

```dvc
$ wget https://code.dvc.org/get-started/code.zip
Expand All @@ -47,6 +47,12 @@ $ tree
├── prepare.py
├── requirements.txt
└── train.py

$ git init # if needed
$ dvc init
jorgeorpinel marked this conversation as resolved.
Show resolved Hide resolved
$ git add .
$ git commit -a -m "Create evaluation stage"
jorgeorpinel marked this conversation as resolved.
Show resolved Hide resolved
$ dvc pull data/data.xml
jorgeorpinel marked this conversation as resolved.
Show resolved Hide resolved
```

Now let's install the requirements:
Expand All @@ -57,10 +63,9 @@ Now let's install the requirements:

```dvc
$ pip install -r src/requirements.txt
$ git commit
jorgeorpinel marked this conversation as resolved.
Show resolved Hide resolved
```

Please also add or commit the source code directory with Git at this point.

Comment on lines -62 to -63
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note: I'm leaving this sentence out. Also, it's repetitive given "This should be a good time to commit the changes with Git" which is at the end of https://dvc.org/doc/start/data-management/data-pipelines#dependency-graphs-dag (after stage add).

</details>

```dvc
Expand Down