Fixing decode error #405

epifanio · 2019-09-16T07:29:04Z

Fixing issue with UTF-8 strings when

decoding = self.charset or self.default_body_encoding

passes the wrong decoding option.

adding a try / except statement when the body.decode fails in detecting the right encoding

fixing bare except

stevepiercy

Initial triage. Still needs further review from maintainers.

.gitignore

stevepiercy · 2019-09-16T08:48:25Z

src/webob/response.py

+        try:
+            return body.decode(decoding, self.unicode_errors)
+        except UnicodeDecodeError:
+            return body.decode("UTF-8", self.unicode_errors)


This causes test coverage to fail. 100% test coverage is required in PRs.

am not familiar with coverage test, trying to learn. I had a look at the code in test_response.py
which seems the part of coverage addressing the function I modified.
do you have any hints on what needs to be added, in order to fix the coverage issue?
Thanks!

You need to add a test that covers that which is not covered, specifically the except block.

@epifanio to save time, you can also run tests locally before committing and pushing to this PR by installing tox and using tox -e py27,py36,coverage. Let me know if you need help with that.

Got the tox -e py27,py36,coverage working. From the test coverage codei see some clue can come from the following lines https://github.com/Pylons/webob/blob/master/tests/test_response.py#L758-L787. — it is my understanding I should add a new test method like def test_text_get_wrong_encoding(). To try to cover the case when a wrong encoding is passed to _text_get method. Is that correct?

You're on the right path. I would narrow it down by running a specific test in debug mode, and step through, to see which lines are executed. Saves guesswork.

As far as its name, naming is hard. Maybe incorporate the method name, e.g., text_get_body_unicode_decode_error or something? Although longer, it gives hints to what it tests.

We should not fall back to attempting to decode with UTF-8 if the remote client has sent a charset.

IDE specific instructions should go in a personal `.gitignore` file.

digitalresistor · 2019-10-02T06:46:57Z

.gitignore

@@ -18,3 +18,4 @@ WebOb.egg-info/
 pytest*.xml
 coverage*.xml
 .pytest_cache/
+


Please don't update the .gitignore, I would ask you to please rebase your changes to avoid touching this file at all.

digitalresistor · 2019-10-02T06:48:38Z

src/webob/response.py

+        try:
+            return body.decode(decoding, self.unicode_errors)
+        except UnicodeDecodeError:
+            return body.decode("UTF-8", self.unicode_errors)


We should not fall back to attempting to decode with UTF-8 if the remote client has sent a charset.

digitalresistor · 2019-10-02T06:50:28Z

I am sorry, but I am going to reject this change. If the remote client has sent a charset, we shouldn't attempt to guess and fall back to decoding UTF-8. If the remote has not set a charset, we are correctly using the self.default_body_encoding.

epifanio added 3 commits July 4, 2019 11:53

adding pycharm project files to gitignore

f87db39

Update response.py

22c8c80

adding a try / except statement when the body.decode fails in detecting the right encoding

Update response.py

059649d

fixing bare except

stevepiercy requested changes Sep 16, 2019

View reviewed changes

epifanio added 3 commits September 19, 2019 13:03

Update .gitignore

b0cd835

IDE specific instructions should go in a personal `.gitignore` file.

fix wrong exception

76680cc

Update response.py

066c06c

digitalresistor requested changes Oct 2, 2019

View reviewed changes

digitalresistor closed this Oct 2, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing decode error #405

Fixing decode error #405

epifanio commented Sep 16, 2019

stevepiercy left a comment

stevepiercy Sep 16, 2019

epifanio Sep 19, 2019

stevepiercy Sep 19, 2019

stevepiercy Sep 19, 2019

epifanio Sep 20, 2019

stevepiercy Sep 20, 2019

digitalresistor Oct 2, 2019

digitalresistor Oct 2, 2019

digitalresistor Oct 2, 2019

digitalresistor commented Oct 2, 2019

Fixing decode error #405

Fixing decode error #405

Conversation

epifanio commented Sep 16, 2019

stevepiercy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

digitalresistor commented Oct 2, 2019