Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
121 changes: 121 additions & 0 deletions hack/diff-toc-vs-template.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,121 @@
#!/usr/bin/env bash

# Copyright 2025 The Kubernetes Authors.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

set -o errexit
set -o nounset
set -o pipefail

# keep in sync with hack/verify-toc.sh
# TODO: dedupe
TOOL_VERSION=v1.1.0
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sounds like the best is to dedupe now, we have 2 other instances of that in verify-toc.sh and update-toc.sh. Maybe something like https://github.com/kubernetes/kubernetes/blob/master/hack/lib/init.sh and we'll just do source "${KUBE_ROOT}/hack/lib/init.sh" here instead?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This would also allow us to get rid of the next few lines all the way to ROOT env setting, wdyt?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sounds like the best is to dedupe now, we have 2 other instances of that in verify-toc.sh and update-toc.sh. Maybe something like https://github.com/kubernetes/kubernetes/blob/master/hack/lib/init.sh and we'll just do source "${KUBE_ROOT}/hack/lib/init.sh" here instead?

We can track it with a go tools module, just didn't seem that important for this PR.

It's unlikely that we will have a critical patch in mdtoc and we don't actually care if these scripts have the same output because we're not writing back the output in this one, just using it to list the headings from two files with the self-same version of mdtoc.


# cd to the root path
ROOT="$(cd "$(dirname "${BASH_SOURCE[0]}")/.." && pwd -P)"
cd "${ROOT}"

# create a temporary directory
TMP_DIR=$(mktemp -d)
# cleanup
exitHandler() (
echo "Cleaning up..."
rm -rf "${TMP_DIR}"
)
trap exitHandler EXIT
# Perform go install in a temp dir as we are not tracking this version in a go
# module.
# If we do the go install in the repo, it will create/update go.mod and go.sum.
cd "${TMP_DIR}"
GO111MODULE=on GOBIN="${TMP_DIR}" go install "sigs.k8s.io/mdtoc@${TOOL_VERSION}"
export PATH="${TMP_DIR}:${PATH}"
cd "${ROOT}"

# Identify KEP files changed by the PR:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(from a release team perspective) it would be great to expand this to work for files that are already merged (in addition to open PRs)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, we just need a clean documented override for the base commit detection logic.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was focused on the idea of getting this to a CI check so it reports "this PR edits Y files which have Z missing sections" (which would most commonly be updates to PRR since the KEP was started)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In the latest iteration you can set BASE_COMMIT=<commit hash> hack/diff-toc-vs-template.sh as one way to include more commits instead of the auto-detected set

# default from prow env if unset from args
# https://docs.prow.k8s.io/docs/jobs/#job-environment-variables
# TODO: handle batch PR testing
target=HEAD
base="${BASE_COMMIT:-}"
if [[ -z "${target:-}" && -n "${PULL_PULL_SHA:-}" ]]; then
target="${PULL_PULL_SHA}"
fi
# target must be a something that git can resolve to a commit.
# "git rev-parse --verify" checks that and prints a detailed
# error.
if [[ -n "${target}" ]]; then
target="$(git rev-parse --verify "${target}")"
fi
if [[ -z "${base}" && -n "${PULL_BASE_SHA:-}" && -n "${PULL_PULL_SHA:-}" ]]; then
if ! base="$(git merge-base "${PULL_BASE_SHA}" "${PULL_PULL_SHA}")"; then
echo >&2 "Failed to detect base revision correctly with prow environment variables."
exit 1
fi
elif [[ -z "${base}" ]]; then
# origin is the default remote, but we encourage our contributors
Copy link
Member Author

@BenTheElder BenTheElder Nov 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ported this commit back to the original script in kubernetes/kubernetes#135380

# to have both origin (their fork) and upstream, if upstream is present
# then prefer upstream
# if they have called it something else, there's no good way to be sure ...
remote='origin'
if git remote | grep -q 'upstream'; then
remote='upstream'
fi
default_branch="$(git rev-parse --abbrev-ref "${remote}"/HEAD | cut -d/ -f2)"
if ! base="$(git merge-base "${remote}/${default_branch}" "${target:-HEAD}")"; then
echo >&2 "Could not determine default base revision. -r must be used explicitly."
exit 1
fi
fi
base="$(git rev-parse --verify "${base}")"

echo "base: $base target: $target"

readonly template_readme='keps/NNNN-kep-template/README.md'

# get TOC for template
readonly mdtoc_options=(
# make sure to include all headings for this purpose even if we
# wouldn't surface them in the checked-in toc in update-toc.sh
'--max-depth' '100'
)
template_toc=$(mdtoc "${mdtoc_options[@]}" "${template_readme}")

result=0
# get KEP README files changed in the diff
kep_readmes=()
while IFS= read -r changed_file
do
# make sure to ignore the template kep itself, we don't want to self-diff
if [[ "${changed_file}" == "keps"*"README.md" ]] && [[ "${changed_file}" != "${template_readme}" ]]; then
kep_readmes+=("${changed_file}")
fi
done < <(git diff-tree --no-commit-id --name-only -r "${base}".."${target}")

for kep_readme in "${kep_readmes[@]}"; do
kep_toc=$(mdtoc "${mdtoc_options[@]}" "${kep_readme}")
echo >&2 "Diffing table of contents for $kep_readme:"
# diff only removals versus the template
# we don't care about _additional_ headings in the KEP
# we also don't care if (Optional) headings are missing
git diff <(echo "${template_toc}" ) <(echo "${kep_toc}" ) \
| grep -E '^-' \
| grep -v '(Optional)' \
|| result=-1
done


echo >&2 "Checked: ${kep_readmes[@]}"
echo >&2 "Result: ${result}"
exit "${result}"

8 changes: 4 additions & 4 deletions keps/NNNN-kep-template/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -91,8 +91,8 @@ tags, and then generate with `hack/update-toc.sh`.
- [Non-Goals](#non-goals)
- [Proposal](#proposal)
- [User Stories (Optional)](#user-stories-optional)
- [Story 1](#story-1)
- [Story 2](#story-2)
- [Story 1 (Optional)](#story-1-optional)
- [Story 2 (Optional)](#story-2-optional)
- [Notes/Constraints/Caveats (Optional)](#notesconstraintscaveats-optional)
- [Risks and Mitigations](#risks-and-mitigations)
- [Design Details](#design-details)
Expand Down Expand Up @@ -218,9 +218,9 @@ the system. The goal here is to make this feel real for users without getting
bogged down.
-->

#### Story 1
#### Story 1 (Optional)

#### Story 2
#### Story 2 (Optional)

### Notes/Constraints/Caveats (Optional)

Expand Down