Adds video upload #1414

fitnr · 2020-08-11T23:08:51Z

This squashes the changes in #929 down to a single commit, incorporating comments from @Harmon758.

Some of this work is due @jamesandres and @Choko256.

This is currently in draft because I have poor internet for the next week and testing video upload isn't feasible. We've waited five years, so another week should be fine. 😄

* And safe import urllib in both py2/3 * rename methods to match API * DRY max sizes, put in class in case twitter changes them * use single media_upload for image and video, send to standard or * chunked upload based on mime type and size * finalize needs ModelParser, not RawParser

Add minimum chunk size of 16K, keeps number of chunks under 999

Harmon758 · 2020-08-12T10:03:39Z

I think it would be preferable to keep the original commits if possible, to keep credit where it's due.

fitnr · 2020-08-14T13:14:39Z

How about I squash them all into a single commit with multiple authors?

Harmon758 · 2020-08-14T13:42:03Z

Is there a reason it needs to be a single commit?

fitnr · 2020-08-14T14:15:52Z

Yes. The main branch has changed so much since this was introduced that rebasing each commit is tiresome.

Harmon758 · 2020-08-14T14:42:28Z

Wouldn't it be possible to use the original commits already in your video-upload2 branch and make a merge commit that merges the main branch and resolves any conflicts? I think if you really wanted to, you could even just copy all the code as it is in this commit and paste it as part of the merge commit. Although, it'd be preferable for the additional changes in this commit to be separate from changes for conflict resolution as well. Another reason for this is that it'd be easier to specifically review the changes you made in addition to what's already in #929 and the additional commits you have in video-upload2 than to re-review the entire thing.

fitnr · 2020-08-22T22:37:55Z

I've force-pushed a new version that has the complete history.

Harmon758

Since the merge commit itself had all the changes, I went ahead and re-reviewed the entire PR.

Although some of the issues I pointed out in my initial review of #929 were addressed, there seem to have been some regressions as well. Did you end up testing this PR yet?

My initial review that the interaction between upload_chunked and _chunk_media is messy also still stands.
_chunk_media should be split into three separate methods, corresponding to INIT, APPEND, and FINALIZE.
They don't need to be public methods, but each should be a bound API method so that the logic in upload_chunked can be refactored.

Also, I'm not sure how much of fitnr@4a3e2cf made it into this PR. Are there missing fixes/improvements from that commit?

Harmon758 · 2020-08-22T22:50:34Z

tweepy/api.py

@@ -5,14 +5,24 @@
 import imghdr
 import mimetypes
 import os
-


This isn't necessary. Separating standard library imports from third party imports is consistent with PEP 8.

Harmon758 · 2020-08-22T23:19:55Z

tweepy/api.py

-            max_size = 14649
+        size = os.path.getsize(filename)
+
+        if file_type == 'gif' or file_type in CHUNKED_TYPES:


The first part of this or is redundant; 'gif' is in CHUNKED_TYPES.

I'm not sure what this if-else statement is for. The only cases where the conditional is false is when the file type is not supported or when imghdr.what is unable to determine the file type from the header of a valid image file type (e.g. #1411) and the fallback mimetypes.guess_type determines the file type instead.

tweepy/api.py

Harmon758

As @Maradonna90 pointed out, these references weren't updated from #929.

tweepy/api.py

fitnr · 2020-09-06T16:37:10Z

Thank you @Harmon758 and @Maradonna90 for your help reviewing. I've incorporated your suggested changes.
In particular:

new global constants are used for tracking the MIN, MAX and DEFAULT chunk sizes. All of these constants are stored in KiB, and multiplied by 1024 when comparing to bytes.
Enhancement: the api.media_upload method will use chunked upload for images that exceed the standard upload size limit
Repeated logic around checking file types has been removed/simplified
I added two public domain sample files (gif and mp4) and three methods (and casettes) around the media upload endpoint.

savetz · 2020-09-08T14:47:50Z

I am eager for the video upload feature, and respectfully ask that it be added to tweepy as soon as possible. Thank you all for your work on this project.

Maradonna90 · 2020-09-08T17:17:28Z

If I upload a bigger video and try to post a tweet with it I can get a 324 errorcode with not valid video. This happends because the upload hasn't finished yet.

I checked the twitter API and found a method that checks for the upload progress of media. I wrote a small function in my project to use it.

def get_media_upload_status(api, *args, **kwargs):
    """ :reference: https://developer.twitter.com/en/docs/twitter-api/v1/media/upload-media/api-reference/get-media-upload-status
            :allowed_param:
        """

    return bind_api(
            api=api,
            path='/media/upload.json',
            payload_type='media',
            allowed_param=['command', 'media_id'],
            upload_api=True,
            require_auth=True
    )(*args, **kwargs)

potentially the media_upload method should return the media_id when the media_upload finished. The above funciton could be used within a routine.

P.S: A quickfix obv is to use time.sleep(), but I think this is not intuitive for user to recognize and might cause significant slowdowns in the process (e.g waiting for 30s although the upload is finished after 5)

fitnr · 2020-09-09T23:46:44Z

I've added a commit with @Maradonna90's get_media_upload_status, it's much more efficient than time.sleep(10)

Harmon758

I'm going to push this back to v4.0. This would be the only major feature to be added to v3.10, and with it being the last version to support Python 2.7 (and probably Python 3.5), I think it'd be better to have it be released with v4.0, where there's a lot of other new features planned, rather than including it in a version that I'd like to be as mature and stable as possible on release. I'd like to begin development of v4.0 as soon as possible, and if any bugs with v3.10 are found after there's commits to the master branch intended for v4.0, I'd probably have to make a separate v3.10.x branch to backport any fixes to release.

I also think there's still a lot of improvements to be made. For example, I'm still of my initial opinion that _chunk_media should be refactored into three separate methods.

Regardless, after v3.10 is released, I'll probably merge this into a new separate video-support branch and make a draft PR to merge that branch into the master branch. That will increase visibility and ease of access for anyone wanting to test the feature out before it's released. Also, that way, I'll be able to fix and improve any remaining issues that I find, without having to go through another (and potentially further) reviews, and of course, at that point, any PRs to that branch would be welcome, if anyone else, @fitnr included, wants to make any additional improvements.

Harmon758 · 2020-12-28T03:22:37Z

I've gone ahead and made a branch, https://github.com/tweepy/tweepy/tree/video-upload, and merged this into it.
I've also drafted PR #1486 merging it into the master branch.
I'll be making further improvements in that branch and PR, but as I said before, feel free to PR to that branch if anyone else wants to make additional improvements as well.

@vegit0 @savetz See https://tweepy.readthedocs.io/en/latest/install.html and https://pip.pypa.io/en/stable/reference/pip_install/#git.

LuisMayo · 2021-01-31T15:00:24Z

The [{'code': 324, 'message': 'Not valid video'}] error keeps happening
Shouldn't media_upload wait for it before returning?

Harmon758 · 2021-01-31T16:23:38Z

Are you using the video-upload branch / PR #1486? This PR has been superseded by that one.

Regardless, videos need to meet certain specifications for Twitter's API.

Shouldn't media_upload wait for it before returning?

Wait for what?

LuisMayo · 2021-01-31T16:43:32Z

I'm using the "video-upload" branch.

Ok let me explain.
When I'm calling

 uploaded_media = api.media_upload(output_filename, media_category='TWEET_VIDEO')

I expect the function to return when the state of the upload is no longer "pending".
Instead I think it's returning right after the "finalize" call. However, after calling finalize you still have to wait until Twitter has ended processing the video, as explained in the docs:

it may also be necessary to use a STATUS command and wait for it to return success before proceeding to Tweet creation.

I feel that the same way the API is handling all the process from start to finalize, it should also wait for STATUS not to be "pending" instead of having to handle that outside this library.

I have currently fixed it on my own code using a while loop and waiting for the proper state like this:

uploaded_media = api.media_upload(output_filename, media_category='TWEET_VIDEO')
while (uploaded_media.processing_info['state'] == 'pending'):
   time.sleep(uploaded_media.processing_info['check_after_secs'])
   uploaded_media = api.get_media_upload_status(uploaded_media.media_id_string)
api.update_status('@' + tweet.author.screen_name + ' ', in_reply_to_status_id=tweet.id_str, media_ids=[uploaded_media.media_id_string])

I hope it's clear now.
Thanks

Harmon758 · 2021-01-31T16:59:10Z

Ah, I see. Thanks for the feedback.
I think I'll probably add a kwarg to allow waiting for the async finalize process to finish.
I'll look into it later and let you know in #1486.

Harmon758 · 2021-02-21T05:31:09Z

The video-upload branch / pull request #1486 should be complete now.
Any feedback or review would be appreciated.

Michael Chacaton and others added 9 commits May 5, 2017 09:50

Video Upload

4643d70

fix file type sanity check in upload

63bea16

update default chunk size to 1MB from 4K

84ba912

Add minimum chunk size of 16K, keeps number of chunks under 999

allow media_upload with file objects

26a0e08

check error on posting data

c7af7fb

check error

95c93e6

Setting media_category when needed for bulk upload of larger files

9df1077

Fix to allow proper passing of media_category key

3fccf68

Merge branch 'video_upload2' into video-upload-3

4c862fe

fitnr force-pushed the video-upload-3 branch from f5bb916 to 4c862fe Compare August 22, 2020 22:31

fitnr marked this pull request as ready for review August 22, 2020 22:37

Harmon758 self-assigned this Aug 22, 2020

Harmon758 self-requested a review August 22, 2020 22:47

Harmon758 added Feature This is regarding a new feature Improvement This is regarding an improvement to an existing feature labels Aug 22, 2020

Harmon758 requested changes Aug 22, 2020

View reviewed changes

Harmon758 added the Need Follow-Up This needs to be followed up on to be actionable label Aug 22, 2020

Harmon758 linked an issue Aug 23, 2020 that may be closed by this pull request

Support video upload #640

Closed

Maradonna90 reviewed Aug 31, 2020

View reviewed changes

tweepy/api.py Outdated Show resolved Hide resolved

Harmon758 requested changes Aug 31, 2020

View reviewed changes

tweepy/api.py Outdated Show resolved Hide resolved

tweepy/api.py Outdated Show resolved Hide resolved

tweepy/api.py Outdated Show resolved Hide resolved

fitnr added 2 commits September 6, 2020 12:31

Refactor flow around chunked uploads

e14b872

add media upload tests

84abf93

This comment has been minimized.

Sign in to view

fitnr requested a review from Harmon758 September 9, 2020 17:31

Add media_upload_status method

5dfe5ca

This comment has been minimized.

Sign in to view

Harmon758 reviewed Dec 25, 2020

View reviewed changes

Harmon758 removed the Need Follow-Up This needs to be followed up on to be actionable label Dec 25, 2020

Harmon758 added this to the 4.0 milestone Dec 25, 2020

This was referenced Dec 25, 2020

Support video upload #640

Closed

Video upload (take 2) #929

Closed

Merge branch 'master' into video-upload-3

ca633c4

Harmon758 changed the base branch from master to video-upload December 28, 2020 02:51

Harmon758 merged commit 7487c20 into tweepy:video-upload Dec 28, 2020

Harmon758 mentioned this pull request Dec 28, 2020

Rework media uploading #1486

Merged

Harmon758 added Documentation This is regarding the library's documentation Tests This is regarding the library's tests labels Dec 28, 2020

fitnr deleted the video-upload-3 branch February 21, 2021 13:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds video upload #1414

Adds video upload #1414

fitnr commented Aug 11, 2020 •

edited

Harmon758 commented Aug 12, 2020

fitnr commented Aug 14, 2020

Harmon758 commented Aug 14, 2020

fitnr commented Aug 14, 2020

Harmon758 commented Aug 14, 2020

fitnr commented Aug 22, 2020

Harmon758 left a comment •

edited

Harmon758 Aug 22, 2020

Harmon758 Aug 22, 2020

Harmon758 left a comment

fitnr commented Sep 6, 2020 •

edited

savetz commented Sep 8, 2020

Maradonna90 commented Sep 8, 2020 •

edited

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

fitnr commented Sep 9, 2020

This comment has been minimized.

Harmon758 left a comment

Harmon758 commented Dec 28, 2020

LuisMayo commented Jan 31, 2021

Harmon758 commented Jan 31, 2021

LuisMayo commented Jan 31, 2021

Harmon758 commented Jan 31, 2021

Harmon758 commented Feb 21, 2021

Adds video upload #1414

Adds video upload #1414

Conversation

fitnr commented Aug 11, 2020 • edited

Harmon758 commented Aug 12, 2020

fitnr commented Aug 14, 2020

Harmon758 commented Aug 14, 2020

fitnr commented Aug 14, 2020

Harmon758 commented Aug 14, 2020

fitnr commented Aug 22, 2020

Harmon758 left a comment • edited

Choose a reason for hiding this comment

Harmon758 Aug 22, 2020

Choose a reason for hiding this comment

Harmon758 Aug 22, 2020

Choose a reason for hiding this comment

Harmon758 left a comment

Choose a reason for hiding this comment

fitnr commented Sep 6, 2020 • edited

savetz commented Sep 8, 2020

Maradonna90 commented Sep 8, 2020 • edited

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

fitnr commented Sep 9, 2020

This comment has been minimized.

Harmon758 left a comment

Choose a reason for hiding this comment

Harmon758 commented Dec 28, 2020

LuisMayo commented Jan 31, 2021

Harmon758 commented Jan 31, 2021

LuisMayo commented Jan 31, 2021

Harmon758 commented Jan 31, 2021

Harmon758 commented Feb 21, 2021

fitnr commented Aug 11, 2020 •

edited

Harmon758 left a comment •

edited

fitnr commented Sep 6, 2020 •

edited

Maradonna90 commented Sep 8, 2020 •

edited