Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Get started consolidation #5153

Closed
1 of 9 tasks
dberenbaum opened this issue Feb 23, 2024 · 4 comments
Closed
1 of 9 tasks

Get started consolidation #5153

dberenbaum opened this issue Feb 23, 2024 · 4 comments
Assignees
Labels
A: docs Area: user documentation (gatsby-theme-iterative) C: start Content of /doc/start

Comments

@dberenbaum
Copy link
Contributor

dberenbaum commented Feb 23, 2024

There are some common themes I notice in feedback on our docs:

  • Unclear where to start
  • Too many options
  • Info feels scattered
  • Too much effort/takes too long to onboard

We have simplified the experience for features like experiment tracking and model registry, but users now have to choose a path before knowing anything about DVC, and these entrypoints feel lacking without understanding the fundamentals of data versioning.

I think we should streamline back to a single entrypoint where we introduce data versioning. When we had a single path, I recall users were more complimentary of our docs.

We can keep most of our existing get started docs as secondary entrypoints. Users who want to start there should still be able to do so independently.

Action items:

Related:

@dberenbaum dberenbaum self-assigned this Feb 23, 2024
@dberenbaum dberenbaum added A: docs Area: user documentation (gatsby-theme-iterative) p1-important Active priorities to deal within next sprints C: start Content of /doc/start labels Feb 23, 2024
@tapadipti
Copy link
Contributor

Add action point:

  • Studio docs should be used only for Studio-specific things like team management, billing, on-premises hosting, etc. The rest of the Studio docs should be consolidated with the dvc docs.

@dberenbaum You probably meant this in your Optimize secondary entrypoints action items, but I think it'll be better to mention this specifically, so that we don't miss it later.

@dberenbaum
Copy link
Contributor Author

@tapadipti I was trying to limit the scope of this issue, so I only included get started items. After I started #5133, I realized we might need a higher-level plan and to prioritize which changes will be most impactful.

I wanted to discuss issues like how to consolidate Studio docs in either #4381 or in a new issue. Can we keep that discussion separate?

@tibor-mach
Copy link
Contributor

I agree with the consolidation but I would include more than data management. The way I see this is that this first entry point should be a very quick tour of the main DVC features and concepts - so I'd pick data versioning, experiments, model registry...and mention other stuff in passing, linking to specific guides as you suggest (e.g. for DVC pipelines).

If all people see is data management but they are mostly concerned e.g. with properly versioning artifacts, they might lose interest. They might also assume that what we show them in the quickstart guide is most of what DVC offers (which I think is a reasonable assumption).

Also, I would not expect people to really be onboarded in the sense that the quickstart should not be enough to actually start working on your own stuff. But they should be on board in the sense that they that DVC can actually help them (and so is worth their time) and they know which guide(s) to read to address those issues.

@dberenbaum
Copy link
Contributor Author

Thanks @tibor-mach. Also adding a link to your slack discussion and proposal for context.

We have an intro to what dvc does on the front page of dvc.org today:

Screen.Recording.2024-02-27.at.7.50.22.AM.mov

...and in the readme.

We definitely need users to understand what DVC is for before showing them how to use it. Do we need to revise these, or add something similar (like your notebook) elsewhere, like in https://dvc.org/doc? Is the interactivity of a notebook important if we are not yet showing how to use it? What do you think about revising the existing notebook rather than creating another repo and notebook (consolidation rather than proliferation)?

I expected that by the time the users reach getting started, they are looking for how to use DVC. I have been more focused on reducing choices and guiding users on where and how to start. IMHO users struggle doing anything in our ecosystem until they understand how DVC versions data. We always end up referring back to how to track data with DVC, how to set up remote storage, etc.

@dberenbaum dberenbaum removed the p1-important Active priorities to deal within next sprints label Mar 1, 2024
@dberenbaum dberenbaum closed this as not planned Won't fix, can't repro, duplicate, stale Apr 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A: docs Area: user documentation (gatsby-theme-iterative) C: start Content of /doc/start
Projects
No open projects
Status: In Progress
Development

No branches or pull requests

3 participants