Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider adding a script to automatically clean old files #24

Closed
bparks13 opened this issue Aug 13, 2024 · 12 comments · Fixed by #53
Closed

Consider adding a script to automatically clean old files #24

bparks13 opened this issue Aug 13, 2024 · 12 comments · Fixed by #53
Assignees

Comments

@bparks13
Copy link
Member

When using Sphinx, there was an option in the make command to clean a directory to remove stale files that are not correctly being served anymore.

There is no readily available option for this in docfx that I have found, but we should be able to create a script that can automatically remove files from the _site directory, remove any existing SVG files that might have an old filename attached to them, and delete the files from the api directory in the case that the submodule has been updated and file names have changed.

There might be other things we can automate with this to help keep the working directory clean, but I think this covers the majority of cases.

@cjsha
Copy link
Member

cjsha commented Aug 13, 2024

super down for this. This annoys me pretty often

@cjsha cjsha self-assigned this Aug 19, 2024
@jonnew jonnew added this to the Infrastructure milestone Aug 19, 2024
@bparks13
Copy link
Member Author

Based on proposed changes in #28, we will have to be careful not to remove any *.md files in the api folder.

@cjsha
Copy link
Member

cjsha commented Aug 19, 2024

ok yeah. I think I can put them in a different directory without changing the URL. We should do that with the automatically exported SVGs as well, right? I just assume it's simpler to delete directories without sorting file extensions. I'll make it a .bat or .ps1 script unless there are any objections.

@bparks13
Copy link
Member Author

The SVGs are fairly simple, since we can assume that the only filetype to be deleted in the workflows directories are *.svg files. Plus, we really don't want to delete all of the *.bonsai files in those directories!

I more so wanted to make sure we were excluding the correct files from other directories (i.e., don't delete .gitignore and *.md files). Moving these markdown files out makes it easier. I have no preference on the script type, as long as we highlight it in the README so everyone understands how to use it.

@cjsha
Copy link
Member

cjsha commented Aug 27, 2024

I want to extend this issue to making a script for linkchecking and wanted to know what you think and if you have any input @bparks13

@cjsha cjsha mentioned this issue Aug 27, 2024
@bparks13
Copy link
Member Author

@cjsha are you thinking about making two scripts, one for linkchecking and the other for cleaning? I think that's a really good idea actually, but it might require that we package lychee in some way, or provide specific instructions on how to set it up for use with a script. In general I am a fan of scripting things and automating them to a degree.

@cjsha
Copy link
Member

cjsha commented Aug 27, 2024

To do a proper linkcheck, the script should start from a clean build anyway, so I was thinking I would just put both in one script and depending on what the user wants to do they could run ./docfx-util.ps1 -clean or ./docfx-util.ps1 -linkcheck <lychee/path>. Running the command with both ./docfx-util.ps1 -clean -linkcheck <lychee/path> arguments is equivalent to running it with just ./docfx-util.ps1 -linkcheck <lychee/path>. And we can add more functionality in the future as we see fit. I agree it would be nice to package it someway so we don't have to specify the path, but for now, what do you think?

@bparks13
Copy link
Member Author

Hmm, I can see the benefit of packing a utility script all together.

If we go this route, I would recommend that we keep the utilities totally separate and only run when the explicit flag is given. So then here is my proposal:

  • -clean: deletes old files and SVGs as discussed previously
  • -linkcheck: With no arguments this simply runs .\build.ps1 --logLevel Warning --warningsAsErrors and dotnet DocLinkChecker -v -f .github/workflows/DocLinkChecker.config
    • To me, this is the first level of link checking, since it is mostly internal to the docfx system
  • -linkcheck /path/to/lychee.exe: Runs the first level of link checking, and also runs the lychee executable using the same commands as found in the actions file
  • -full or -full /path/to/lychee: This would accomplish what you suggest as the -linkcheck flag, which is to say this would run the clean command, then do a full rebuild and check all links

As I write this out I wonder if there is some way to read the actions file and directly pull out the commands from there? We don't want to get into a situation where the two scripts become desynchronized, because then local builds could succeed when the actions fail, or vice versa.

@bparks13
Copy link
Member Author

Also, to stick with convention, any words we use as flags should be preceded by --, and if we wanted to add short-form flags those would be -. So we could have --clean and -c.

@cjsha
Copy link
Member

cjsha commented Aug 27, 2024

As I write this out I wonder if there is some way to read the actions file and directly pull out the commands from there? We don't want to get into a situation where the two scripts become desynchronized, because then local builds could succeed when the actions fail, or vice versa.

Ah great idea. I did a brief search, and I came across the suggestion to put all the linkcheck commands in the powershell script and to run the powershell script during the GitHub Actions build process (instead of trying to pull the commands into the powershell script from the build.yml). This seems like the most simple way to keep these things synchronized.

Also, to stick with convention, any words we use as flags should be preceded by --, and if we wanted to add short-form flags those would be -. So we could have --clean and -c.

I had the same thought but it seems Powershell scripts have -parameterKey <parameter-value>/-parameterKey:<parameter-value> syntax already built-in, and I didn't want to look much into getting around that while we have other past due milestones/issues.

  • -clean: deletes old files and SVGs as discussed previously
  • -linkcheck: With no arguments this simply runs .\build.ps1 --logLevel Warning --warningsAsErrors and dotnet DocLinkChecker -v -f .github/workflows/DocLinkChecker.config
    • To me, this is the first level of link checking, since it is mostly internal to the docfx system
  • -linkcheck /path/to/lychee.exe: Runs the first level of link checking, and also runs the lychee executable using the same commands as found in the actions file
  • -full or -full /path/to/lychee: This would accomplish what you suggest as the -linkcheck flag, which is to say this would run the clean command, then do a full rebuild and check all links

This is fine. It is annoying to export all .svgs every time if I just wanna do a lychee linkcheck.

Do we prefer -linkcheck or -checklinks? I realize I have the -checklinks implemented, but I wrote -linkcheck all over my previous comment.

@cjsha
Copy link
Member

cjsha commented Aug 27, 2024

I'm also not immediately sure there's be a way to run -linkcheck without additional arguments so I might end up separating the 2nd and 3rd bullet points into -linkcheck and -lychee <path/to/lyche>

@bparks13
Copy link
Member Author

put all the linkcheck commands in the powershell script and to run the powershell script during the GitHub Actions build process

That sounds great to me, let me know whenever this is finalized and merged, then I'll work on integrating the script into the actions. I think that makes the most sense

I had the same thought but it seems Powershell scripts have -parameterKey /-parameterKey: syntax already built-in

Got it, I'm not as familiar with Powershell but if that's already there then we can utilize it

This is fine. It is annoying to export all .svgs every time if I just wanna do a lychee linkcheck.

What would be an intermediate then? If we are already splitting functionality, maybe we can word it in a way so that the lychee check is separate from the other checks? Also I'm fine with -checklinks.

Maybe we split it up so that we have:

  • -clean: see above
  • -checklinks: builds and runs DocLinkChecker
  • -lychee /path/to/lychee.exe: only runs lychee
  • -full /path/to/lychee.exe or -all or something like that, I don't love either of these but in that vein

Obviously -full (or whatever we decide on) will run everything, but if we only want to run the -lychee command we can.

cjsha added a commit that referenced this issue Aug 27, 2024
- Add a powershell script to clean files and check links
- Add note in README regarding the bullet point above
- Fix typos in README
Misc changes not related to above issues:
- index.md configured with landing page metadata in docfx.json and
  removed relevant metadata from the file itself
- docfx.json files files entry refactored to be more succinct
@cjsha cjsha closed this as completed in #53 Sep 5, 2024
cjsha added a commit that referenced this issue Sep 5, 2024
…ly (#53)

* Resolve issues #24 & #43

- Add a powershell script to clean files and check links
- Add stuff in README regarding the bullet point above including a dedicated section
- Fix typos in README

Miscellaneous:
- docfx.json files entry refactored to be more succinct
- index.md configured with landing page metadata in docfx.json and
  removed relevant metadata from the file itself
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants