Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about PyPi releases #700

Open
MarieRoald opened this issue Feb 16, 2024 · 1 comment
Open

Question about PyPi releases #700

MarieRoald opened this issue Feb 16, 2024 · 1 comment

Comments

@MarieRoald
Copy link

Hello! I'm trying to understand the releases on PyPi. PyPi lists two releases: 3.1.2 and 3.1.1 (https://pypi.org/project/whisperx/), both published on February 6th this year. But here on GitHub, the latest release was 3.1.1 (May 13th 2023). Are the versions on PyPi official releases by the WhisperX team?

@gillens
Copy link
Contributor

gillens commented Feb 21, 2024

I have no idea myself but I just diffed the PyPi 3.1.2 release with this repo. At first I thought that the 3.1.2 release on there was simply the latest commit on the main branch when they uploaded it, but they seem to have changed some small things. It's based off commit 06e30b2 from Jan 1 2024. From there, they include these currently-open PRs:

And these other changes:

  • change version to 3.1.2 and readme to be PyPi-friendly
  • revert commit 5dfbfcb
  • use pyannote.audio 3.1.1 instead of 3.1.0 (this is now merged)

I assume the upload is unofficial as the PyPi maintainer does not have commits to this repository, at least under their linked GitHub account.

Used this script to find the commit with the smallest diff to the extracted PyPi tar:

Script

#!/usr/bin/env bash

REPO_PATH="/tmp/whisperX"
PYPI_PATH="/tmp/whisperx-3.1.2"

cd "$REPO_PATH"

BEST_MATCH=""
MIN_DIFFS=1000000 # Arbitrarily large number

# Iterate over commits
for commit in $(git rev-list --all --max-count=50); do
  # Check out the commit
  git checkout $commit &> /dev/null

  # Compare the commit against the PyPi package
  DIFFS=$(diff -urN --exclude=".git" --exclude="build" "$PYPI_PATH" "$REPO_PATH" | wc -l)

  echo "Commit $commit has $DIFFS differences"

  # Update the best match if this commit has fewer differences
  if [ $DIFFS -lt $MIN_DIFFS ]; then
    BEST_MATCH=$commit
    MIN_DIFFS=$DIFFS
  fi
done

echo "Best match: $BEST_MATCH with $MIN_DIFFS differences"

git checkout $BEST_MATCH

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants