Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Outline of docs structure #4381

Open
dberenbaum opened this issue Mar 11, 2023 · 9 comments
Open

Outline of docs structure #4381

dberenbaum opened this issue Mar 11, 2023 · 9 comments
Labels
A: docs Area: user documentation (gatsby-theme-iterative) p2-nice-to-have Less of a priority at the moment. We don't usually deal with this immediately. type: discussion Requires active participation to reach a conclusion.

Comments

@dberenbaum
Copy link
Collaborator

dberenbaum commented Mar 11, 2023

Opening for discussion. We recently finished moving all the remote info from the command reference to distinct user guides. I think it's way more organized now, but I worry that it's too deeply nested inside user-guide->data-management->remote storage.

We know that this is some of the most frequently visited info, and so is user-guide->project structure->dvc.yaml files. Can we create a top-level reference section that includes remotes, project structure, CLI, and Python references? I think it would help make these pages easier to find and create a cleaner separation between more narrative guides and technical reference material.

Proposed structure would look like:

  • User guide
    • Data management
    • Pipelines
    • Experiment management
      ...
  • Reference
    • Project structure
    • Remote config
    • Commands
    • Python API
@dberenbaum dberenbaum added the type: discussion Requires active participation to reach a conclusion. label Mar 11, 2023
@shcheklein
Copy link
Member

Sounds good to me, @dberenbaum !

@shcheklein
Copy link
Member

Project structure should become more like a Project file or something? I don't like this name, but can't come up with something better ... dvc.yaml reference?

Remote config - I wonder if we should do one Config reference - include remotes there (w/o additional level, all things about remotes and config on the same level).

@dberenbaum
Copy link
Collaborator Author

Project structure should become more like a Project file or something? I don't like this name, but can't come up with something better ... dvc.yaml reference?

Agreed, I think we need to make it better than project structure, which sounds more like we are giving suggestions on how to organize your project. Right now, it includes more than dvc.yaml but also any other DVC-managed files.

Remote config - I wonder if we should do one Config reference - include remotes there (w/o additional level, all things about remotes and config on the same level).

Makes sense. Would we want subpages for each remote like we now have in the guide? Then it becomes as nested as it is now, so now I'm second guessing this 😅 .

@shcheklein
Copy link
Member

I think we can keep it:

Reference
   Config
       SSH
       Google Drive
       ...
       Cache 
       [Other config ....]

wdyt?

@dberenbaum
Copy link
Collaborator Author

It could work, or it might be too busy.

Let's put this as a p2 for now. We have already moved this info around a lot. I'd rather focus on get started and new content focused on basic workflows than on moving around the existing content.

@dberenbaum dberenbaum added the p2-nice-to-have Less of a priority at the moment. We don't usually deal with this immediately. label Mar 13, 2023
@dberenbaum
Copy link
Collaborator Author

Another benefit would be to have somewhere to mention environment variables

@dberenbaum dberenbaum added p1-important Active priorities to deal within next sprints and removed p2-nice-to-have Less of a priority at the moment. We don't usually deal with this immediately. labels Jul 18, 2023
@dberenbaum
Copy link
Collaborator Author

Bumping the importance here but also want to consider other changes to consolidate the dvc/mlops docs (dvc, dvclive, studio, vs code)

@dberenbaum dberenbaum added this to DVC Jul 18, 2023
@github-project-automation github-project-automation bot moved this to Backlog in DVC Jul 18, 2023
@dberenbaum dberenbaum removed the status in DVC Aug 22, 2023
@dberenbaum dberenbaum removed this from DVC Aug 22, 2023
@dberenbaum dberenbaum self-assigned this Oct 16, 2023
@dberenbaum dberenbaum added the A: docs Area: user documentation (gatsby-theme-iterative) label Oct 16, 2023
@dberenbaum dberenbaum added p2-nice-to-have Less of a priority at the moment. We don't usually deal with this immediately. and removed p1-important Active priorities to deal within next sprints labels Jan 5, 2024
@dberenbaum
Copy link
Collaborator Author

dberenbaum commented Feb 9, 2024

Prioritizing Integrations

Integrations is a section with its own importance in docs in B , thing that has proven important when including a tool in a fast-track scenario. In DVC, I have to go to DVCLive > ML Frameworks

𝙲𝙾𝙽𝙲𝙻𝚄𝚂𝙸𝙾𝙽: 𝚝𝚑𝚎 𝚌𝚘𝚗𝚝𝚛𝚒𝚋𝚞𝚝𝚘𝚛 𝚛𝚎𝚌𝚘𝚖𝚖𝚎𝚗𝚍𝚜 𝚝𝚘 𝚝𝚑𝚎 𝚖𝚊𝚒𝚗𝚝𝚊𝚒𝚗𝚎𝚛 𝚝𝚘 𝚛𝚎𝚌𝚘𝚐𝚗𝚒𝚣𝚎 𝚒𝚗𝚝𝚎𝚐𝚛𝚊𝚝𝚒𝚘𝚗𝚜 𝚊𝚜 𝚙𝚊𝚛𝚝 𝚘𝚏 𝚝𝚑𝚎 𝚏𝚊𝚜𝚝-𝚝𝚛𝚊𝚌𝚔 𝚜𝚌𝚎𝚗𝚊𝚛𝚒𝚘, 𝚊𝚗𝚍 𝚙𝚛𝚒𝚘𝚛𝚒𝚝𝚒𝚣𝚎 𝚊𝚌𝚌𝚎𝚜𝚜 𝚊𝚌𝚌𝚘𝚛𝚍𝚒𝚗𝚐𝚕𝚢.

Originally posted by @SoyGema in #4919 (comment)

Consider moving integrations to a top-level section if we do this reorg

@dberenbaum
Copy link
Collaborator Author

Combining with the ideas in #5153, we can use this ticket to discuss the overall structure of the docs. Here's a proposal of an ideal state to work towards:

  • Get started
    • Data pipelines
    • Experiment tracking
    • Model registry
  • Use cases (still not sure where to put these; should they be outside docs? before get started?)
    • Versioning data and models
    • CI/CD for machine learning (generalize beyond CI/CD to automated ML workflows? update to reference studio cloud compute?)
    • Fast and secure data caching hub (analytics show its unpopular; drop/rename/replace in the future with use case about using studio for secure data access?)
    • Experiment tracking
    • Model registry
    • Data registry
  • User guide
    • Data
    • Pipelines
    • Experiments
    • Models
  • Reference
    • Studio UI
    • Studio REST API
    • DVC Configuration
    • DVC Commands
    • DVC Python API
    • DVCLive Python API
    • GTO Commands

@dberenbaum dberenbaum changed the title Top-level reference section Outline of docs structure Feb 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A: docs Area: user documentation (gatsby-theme-iterative) p2-nice-to-have Less of a priority at the moment. We don't usually deal with this immediately. type: discussion Requires active participation to reach a conclusion.
Projects
None yet
Development

No branches or pull requests

2 participants