-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Documentation Clarification Needed #12
Comments
For dumps, you need to decompress the zstd files. The scripts should be ran outside of the docker container. If you used the default postgres installation specified in the README, you shouldn't have to change the scripts. The database should be running on
Could you elaborate on this? Thank you for the feedback, I agree the docs need to be improved. |
I had incorrectly assumed the postgres setup was stored in the container but it seems it was stored in the working directory, so every time I recreated the dockers, the password was stuck at my altered one. I wiped that and started again and looks like the scripts will work now. apt install zstd and zstd --memory=2048MB -d submissions.txt are what I found on debian/ubuntu.
I just meant that postgres container didn't seem to have access to the scripts but doesn't really matter, since I was supposed to run the scripts outside the containers anyway. I'm stuck at the index.py script now, however. I'm using 2009-03 from the full dump, both submissions and comments decompressed and imported. However, the index.py script seems to not recognize any subreddits. I've mainly been trying "iphone" since basedbin.org shows entries from those dates.
As an aside, is there a way to simply import all subreddits to index.py? This was my initial intention, so even if I got this working, doing it one-by-one even if I found a list online would be rather tedious. |
It's saying that the submissions table doesn't exist? That's very odd. Can you verify the following?
You an do that with I noticed you said you wiped/reset your database after running the container. The container runs a set of scripts that sets up the postgres container on startup. You would need to delete and rerun your container if you haven't done so already |
I haven't made a script that does that but you could in theory make a script that downloads and decompresses all the dumps into a directory and then runs the load scripts on each file. |
I'm having some trouble with this too, except no errors. The docker container is running using docker compose with no errors. The scripts ran without errors. I'm able to view the frontend at localhost:8080. But the subreddit data isn't there. When running from inside the container EDIT: EDIT 2: EDIT 3: EDIT 4: Uncaught (in promise) ReferenceError: setThreads is not defined EDIT 5: |
This comment was marked as outdated.
This comment was marked as outdated.
Line 30 in 5b81895
8080:80 to 80:80
|
This works, thanks! |
I'm running into a number of errors attempting to get this running. If any of them are real errors and not my own mistakes, then I will create separate issues for them. For now, I am assuming this is my own misunderstanding of the instructions, hence this issue requesting better documentation.
I can't for the life of me figure this second one out. I tried searching the codebase for references to the submissions zst files and couldn't find anything.
For number 3, I found I needed to get the first script running:
python3
python3-pip
pip install pyscopg2-binary
I ran the first script on reddit/submissions/2023-09.zst. I am unsure if that's what I'm supposed to do. Anyway, I tried running it from both docker exec inside the redarc container and from outside the container. Either way, I would get some sort of connection error. Wrong password or connection refused, depending on... I don't know. Oddly, it seems to be attempting to connect to localhost? That's not where the postgres db is. And the working directory is only in the redarc container. Maybe I'm misunderstanding this.
The web frontend does load. But obviously as above, there's no subreddits listed.
Cheers and thanks for the excellent frontend.
The text was updated successfully, but these errors were encountered: