-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can it keep track of what it has already downloaded? #3
Comments
It seems the reddit api uses To do this I'd probably need to save a metadata file locally that stores the Ids of the latest data, then subsequent runs of the tool would first read from the metadata file if it exists and use the Ids to fetch the latest content. Also would need a new CLI option to enable this feature, options might be:
Any ideas? |
I like either "cron" or "latest". You could also use "-r, --resume". |
I went with an Orca now generates an
So now just run the below command in a cron job to get only the latest data since the last download.
Make sure to copy or process the data files written to the file system before running Orca multiple times. |
This should be able to track what it last downloaded and then download only everything after that.
In this way, it could be "cron"-able and used as some sort of a regular backup of your data from reddit.
The text was updated successfully, but these errors were encountered: