Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Efficient periodical backup. #205

Open
indrakaw opened this issue Jan 16, 2019 · 4 comments
Open

Efficient periodical backup. #205

indrakaw opened this issue Jan 16, 2019 · 4 comments

Comments

@indrakaw
Copy link

indrakaw commented Jan 16, 2019

Please take a look:

https://drive.google.com/drive/folders/16fMyKcBfo5mtLr-jbZZEj4AovM1SaAB9

The case is I'm backing up a Tumblr blog periodically, not incremental. Eg.

tumblr_backup.py -j -I i --save-video-tumblr --save-audio -p ${T_YEAR} -O ${T_NAME}-${T_YEAR} ${T_NAME}

This will produce something like staff-2004/, staff-2005/, etc.

The problem is, It starts from begin to the latest. Eg, I just want to download a blog on period 2006, but it will scan from 2004 to 2016, then download that period. Imagine, the blog has 100,000+ posts and is 8 years old.

@bbolli
Copy link
Owner

bbolli commented Jul 4, 2019

There's nothing I can do about this. The API only allows sequential access, so scanning is needed.

@bbolli bbolli closed this as completed Jul 4, 2019
@cebtenzzre
Copy link
Collaborator

Sequential access only? Isn't there a before parameter, and a potentially arbitrary offset parameter (for binary searching)?

@indrakaw
Copy link
Author

indrakaw commented Jul 10, 2019

@Cebtenzzreep That what I meant. It would faster than way.

Imagine if you have to wait for tumblr_backup to index whole posts on a blog that over 8 years old. Downloading posts from specified year would be a pain. Especially if what you were doing is backing up the posts sperated by years: you have to do start over, start over again, and over again.

@indrakaw
Copy link
Author

I requested the same thing, and they done it.
mikf/gallery-dl#337

I haven't archived the goal since gallery-dl has limited API than tumblr_backup.py, I have no idea why?

@cebtenzzre cebtenzzre reopened this Aug 19, 2019
cebtenzzre added a commit to cebtenzzre/tumblr-utils that referenced this issue Sep 24, 2020
cebtenzzre added a commit to cebtenzzre/tumblr-utils that referenced this issue Sep 29, 2020
cebtenzzre added a commit to cebtenzzre/tumblr-utils that referenced this issue Oct 2, 2020
cebtenzzre added a commit to cebtenzzre/tumblr-utils that referenced this issue Nov 25, 2020
cebtenzzre added a commit to cebtenzzre/tumblr-utils that referenced this issue Jan 17, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants