-
-
Notifications
You must be signed in to change notification settings - Fork 975
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Large Twitter Galleries Not Fully Downloading #2226
Comments
This is a limit on twitter's end, unfortunately. |
I personally ended up having to make a PowerShell script that would cycle through two week intervals since the user's registration date using Twitter search queries, with one day of overlap in both from: and until: so nothing gets skipped over (with Even still, I don't think that will get everything, because Twitter is dumb like that. It caps out retrieving a user's timeline at around 3200 tweets (and that includes retweets). |
What usually works for me is using a search query to download: |
twitter shadow hide all nsfw from search feature so i think there's no way to download all, you must use |
From my experience, it seems people usually don't flag stuff as "sensitive content". I've seen that with both art and irl stuff. I've been using the api to get the total tweet count of 100% nsfw accounts and compared it against how much I could get from the search results and it's usually most of the tweets. I tried searching a nsfw tag in the browser and tweets marked by Twitter as "sensitive content" (ones that require you to verify you want to see it) still popped up occasionally. |
it's been blocked since 2020 |
Bumping this because I'm trying to consolidate my various twitter archives. I have a lot of content that was downloaded from twMediaDownloader and I planned to merge this content with gallery-dl using twitter-click-and-save to minimize file duplication by hardlinking across the drive. My example case is casulcasulcasul with the following settings:
As this account seems to fall under 1000 media tweets, I try Out of the 970 media files twMediaDownloader calculated (using dryrun), gallery-dl using the above command downloaded 944, seemingly omitting anything earlier than November 2019. Using I don't quite know what/where the issue is, but the two tweets gallery-dl seems to miss are this one and this one. Manually grabbing those tweets with gallery-dl downloads just fine. EDIT: I guess theoretically, you could use twMediaDownloader to generate a list of media tweets and use the .csv file it provides as input for gallery-dl. 🤔 EDIT2: Some issues are stating to include |
Are you sure You can use both Or you can download latest artifact and simply paste |
I will scream it from the rooftops:
Addding |
@cglmrfreeman use
|
Huh, that one worked. I don't think I've ever seen anyone suggest that before. It's always "copy the twitter search url" I will def be using this from now on, thanks! |
You probably should have put the link in double quotes But as mikf said plain spaces are fine too. In double quotes as well. |
No, The function that parses query parameters does not "support" them, meaning it just returns |
I see. Well, it still works with twitter specifically. Pluses in a query string are just ignored by twitter (or treated as spaces). |
Oh, so the "NotFoundError"s are a bug introduced with 77bdd8f. This commit splits search queries by whitespace only, and throws an error because there is no user named |
Ah I only recently started using gallery-dl for twitter archiving and I definitely updated after that, so that might explain it. |
I see x2, I'm on latest stable ver, so I didn't notice. I thought you would leave the behavior for search as it was. I guess you should also consider that there can be multiple |
... and do not raise exception if searched user does not exist
For smaller galleries |
If you mean retweets you should add |
I don't mean retweets. |
Yeah, there's 2 posts which don't appear in the search at all. Even without filters. |
Popping back in here to say after fairly extensive testing, |
usually username and username/media, but pretty sure if their twitter have so many retweet and media, you can't get all, tries some 5-10k tweet to see, that's twitter limit |
I tried to download every MP4 from this gallery (NSFW), and it only went as far back as this tweet (also NSFW). After that tweet, it just stopped and acted as though it had downloaded the entire gallery, meaning that any older tweets, such as this one, were excluded.
If I don't use the link for the media tab, it stops at an even more recent tweet (NSFW).
For reference, the command I ran was
gallery-dl "https://twitter.com/furui_1111/media" --filter "extension in ('mp4')"
The text was updated successfully, but these errors were encountered: