-
-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bugfix: django.db.utils.IntegrityError: UNIQUE constraint failed: core_snapshot.timestamp #412
Comments
Happens for me as well. Archivebox version: I experimented a bit and managed to consistently reproduce. I suspect the urls that have a suffix in the timestamp are causing it.
|
Very helpful @karlicoss! This is high on our priority list of things to fix. I'll check in with an update once we've started working on this. I suspect it's a relatively simple bug in the timestamp deduping code, most of the work will be QA and testing to make sure we don't introduce any regressions while we fix it. For context, timestamp deduping has been one of the most brittle parts of ArchiveBox in the past years, and we already have plans to remove the need for it in a refactoring in the next major version. |
I unfortunately ran into this issue as well. From my testing, I agree with @karlicoss and his assessment that it is related to the timestamp suffixes. I am trying to pin it down further than that, I'll reply if I figure anything out. Thanks |
Nothing works for me anymore, sadly... Attempting to add any link, whether completely new or already archived, results in |
Hello @coisnepe @jrruethe @karlicoss @drpfenderson & everyone else, would you mind testing my |
@apkallum - Using your build, it gets a bit further. Modifies a few entries, and then gives following error:
EDIT: To be clear, this is using EDIT 2: Oops. Realized I had switched to Python 3.8 for another project and forgot to update-alternatives. Running |
Give the latest pip install --upgrade archivebox
# or if you use docker
docker pull nikisweeting/archivebox |
Used
Went to archive directory to run
Note: I'm not sure if you need the entire traceback each time, since most of it is identical, but figured more is better when hunting down bugs. Apologies if it's too much. |
Deployed the latest Docker image and it seems to have fixed the issue. Thanks so much! |
@drpfenderson let me know if you're still having any issues and we can reopen the ticket. |
@pirate Updated to newest.
same error, exactly, as my last log.
The rest of the log is exactly the same as well, line references and all. EDIT: I thought maybe I could try nuking it, starting from scratch. No dice, same error. I tried with docker and docker-compose as well, after removing the original package from pip. Same error in both, but with python3.8 instead. |
For what it is worth, v0.4.21 fixed the issue I was having regarding |
With the changes present in the |
Note I've added a new DB/filesystem troubleshooting area to the wiki that may help people arriving here from Google: https://github.com/ArchiveBox/ArchiveBox/wiki/Upgrading-or-Merging-Archives#database-troubleshooting Contributions/suggestions welcome there. |
Describe the bug
Y'all helped me with upgrading my super old archive to the django branch before official 0.4.9 release. I recently upgraded to the newest version, so I could start adding links. archivebox said I had to re-init.
archivebox init
gives me following error, and will not let me add new links.Full log/error below.
Steps to reproduce
git checkout master
to switch from django branch.git pull origin master
to pull new release.pip install -e .
(also tried withpip uninstall archivebox && pip install .
)archivebox init
.Screenshots or log output
Software versions
The text was updated successfully, but these errors were encountered: