Added mimetype to file data sent to model #39

ajbozarth · 2018-08-16T01:03:07Z

This is a follow up to IBM/MAX-Image-Caption-Generator#9 which added file extension verification using mimetype

Closes #38

This is a follow up to IBM/MAX-Image-Caption-Generator#9 which added file extension verification using mimetype

ptitzler · 2018-08-16T01:58:33Z

app.py

@@ -120,7 +121,8 @@ def valid_file_ext(filename):

 # Runs ML on given image
 def run_ml(img_path):
-    img_file = {'image': open(img_path, 'rb')}
+    mime_type = mimetypes.guess_type(img_path)[0]


Have you tested what happens when no mime-type can be guessed?

I tested with the three valid extensions, any other extension would be rejected before getting to this line. Given this library guesses mime type by the file extension (IIUC), that should be enough. I originally used magic to do this (more robust, not a guess), but it requires installs outside pip, which I didn't want to add.

ptitzler · 2018-08-16T02:00:12Z

app.py

@@ -120,7 +121,8 @@ def valid_file_ext(filename):

 # Runs ML on given image
 def run_ml(img_path):
-    img_file = {'image': open(img_path, 'rb')}
+    mime_type = mimetypes.guess_type(img_path)[0]
+    img_file = {'image': (img_path, open(img_path, 'rb'), mime_type)}


If I'm not mistaken the file handle returned by open is never closed ...

Nice catch, odd that no one ever caught that before, not that it matter much given this file is opened in a thread. I'll close it.

MLnick · 2018-08-16T12:24:47Z

I see the Python 2.7 build on Travis failed to produce output. Do we know the issue?

MLnick · 2018-08-16T12:26:21Z

Changes LGTM

stevemar · 2018-08-16T13:51:15Z

@MLnick the travis job failed in a weird way, i'm restarting it:

No output has been received in the last 10m0s, this potentially indicates a stalled build or something wrong with the build itself.
Check the details on how to adjust your build configuration on: https://docs.travis-ci.com/user/common-build-problems/#Build-times-out-because-no-output-was-received
The build has been terminated

stevemar · 2018-08-16T13:55:19Z

app.py

-    img_file = {'image': open(img_path, 'rb')}
-    r = requests.post(url=ml_endpoint, files=img_file)
+    mime_type = mimetypes.guess_type(img_path)[0]
+    img_file = open(img_path, 'rb')


If you use with it'll be closed at the end of the block. See the last paragraph of https://docs.python.org/2/tutorial/inputoutput.html#methods-of-file-objects

with open(img_path, 'rb') as img_file: file_form = {'image': (img_path, img_file, mime_type)} r = requests.post(url=ml_endpoint, files=file_form) cap_json = r.json()

ajbozarth · 2018-08-16T15:11:57Z

Updated to use with per @stevemart comment

stevemar · 2018-08-16T15:22:20Z

app.py

@@ -120,8 +120,10 @@ def valid_file_ext(filename):

 # Runs ML on given image
 def run_ml(img_path):
-    img_file = {'image': open(img_path, 'rb')}
-    r = requests.post(url=ml_endpoint, files=img_file)
+    mime_type = mimetypes.guess_type(img_path)[0]


reading the docs, it's fine to assume [0] here. cool.

https://docs.python.org/2/library/mimetypes.html#mimetypes.guess_type

yeah I looked into that to make sure and the return is always a "pair"

stevemar

this LGTM

stevemar · 2018-08-16T15:23:58Z

app.py

-    r = requests.post(url=ml_endpoint, files=img_file)
+    mime_type = mimetypes.guess_type(img_path)[0]
+    with open(img_path, 'rb') as img_file:
+        file_form = {'image': (img_path, img_file, mime_type)}


i'm not sure if requests will like mime_type set to None if it can't guess the mimetype correctly.

the current default is None which is what causes the error. And IIUC mimetypes.guess_type() uses the file extension in the filename to guess, and jpg, jpeg, and png all work and thats all we need.

you could handle the case where someone submits a file with no extension :) but we're bikeshedding now :)

valid extensions are already checked elsewhere in both the python code that calls this function and on the html form client-side, so I think we're good

Added mimetype to file data sent to model

2b9ee92

This is a follow up to IBM/MAX-Image-Caption-Generator#9 which added file extension verification using mimetype

ajbozarth added the bug Something isn't working label Aug 16, 2018

ajbozarth self-assigned this Aug 16, 2018

ajbozarth requested a review from MLnick August 16, 2018 01:03

ptitzler reviewed Aug 16, 2018

View reviewed changes

Closed file

7e990a3

stevemar reviewed Aug 16, 2018

View reviewed changes

updated to use with

ea098c7

stevemar reviewed Aug 16, 2018

View reviewed changes

stevemar approved these changes Aug 16, 2018

View reviewed changes

ajbozarth merged commit a3ea78b into IBM:master Aug 16, 2018

ajbozarth deleted the mime branch August 16, 2018 18:38

ajbozarth mentioned this pull request Aug 16, 2018

Add mime type inference when uploaded images are missing mime type IBM/MAX-Image-Caption-Generator#13

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added mimetype to file data sent to model #39

Added mimetype to file data sent to model #39

ajbozarth commented Aug 16, 2018

ptitzler Aug 16, 2018

ajbozarth Aug 16, 2018

ptitzler Aug 16, 2018

ajbozarth Aug 16, 2018

MLnick commented Aug 16, 2018

MLnick commented Aug 16, 2018

stevemar commented Aug 16, 2018

stevemar Aug 16, 2018

ajbozarth commented Aug 16, 2018

stevemar Aug 16, 2018

ajbozarth Aug 16, 2018

stevemar left a comment

stevemar Aug 16, 2018

ajbozarth Aug 16, 2018

stevemar Aug 16, 2018

ajbozarth Aug 16, 2018

Added mimetype to file data sent to model #39

Added mimetype to file data sent to model #39

Conversation

ajbozarth commented Aug 16, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MLnick commented Aug 16, 2018

MLnick commented Aug 16, 2018

stevemar commented Aug 16, 2018

Choose a reason for hiding this comment

ajbozarth commented Aug 16, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stevemar left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment