Py3 #287

sheagcraig · 2018-10-04T15:24:37Z

Once more, with feeling.

sheagcraig · 2018-11-08T02:11:16Z

There's a couple of unicode vs. legacy python "strings" stuff to unravel in here still.

We have a server.text_utils.safe_bytes function that is used in a a dozen places to take an input text type and ensure that the resulting output is a bytestring object (legacy python "string"). Django natively handles unicode, and as long as the database is configured to handle it (it is: docker/setup_db.sh, you can just push unicode into the database. So one of the next steps here is to clean up all of the coercion from bytes to unicode and vice versa for all Django and database code. This will almost certainly be tedious. I think it's pretty common to just speculatively try adding .decode() or .encode() to text in python 2 until things work when debugging "strange character" issues (guilty...), so they can grow like weeds throughout a codebase. The resolution here is to grep through all sal code looking for decode, encode, casting to str, bytes, or unicode, and probably any time the string 'utf-8' gets bandied about.
One exception is URL construction. URLs have to be in ASCII, so we have to make sure all URLs are encoded before we are done with them. For example, links for applications in the app inventory could potentially have accents in them due to app name or version (nëthäck).
Despite all this, postgres cannot store a null character in a TextField, which is the database field type used for the majority of Sal's stored data. This is a limitation in C, which treats a null character (python chr(0) == b'\x00' == '\x00'). Why do we care? Well, plists can include data of a variety of types: int, date, float, string, and data. Python's plistlib deserializes the plist <data> into a bystestring, and it can most certainly contain a null character. For example, the manufacturer data returned from the battery plugin, which is one of many pieces of data returned, can (and does) include null characters, causing postgres to FREAK OUT! This data is not being analyzed, and frankly, is not really needed. But the battery plugin is just an example; what needs to be done is to protect Sal from trying to write those rows in the first place. I'm going to just replace null characters with an empty string or maybe a smiling poo emoji so that the database can save the rest of the data.

sheagcraig · 2018-11-08T14:43:00Z

Another potential breakage point is CSV generation. Python happily handles unicode in csv. Google sheets handles it just fine. Excel 2019 just treats it as garbage. I'm sure there's a way to get Excel to handle it, but I'm not feeling inclined to figure it out.

Unless someone feels very strongly about encoding all text going out from Sal into utf-8 encoded bytestrings for csvs, I'm going to leave the data as unicode and let the csv writer handle it as is.

sheagcraig · 2018-11-08T19:13:02Z

clburlison · 2018-11-26T17:10:56Z

Based on the docker images 3ef8d9fa9dcc (should be the last commit 4b51c76) I am having issues with the client checkin.

https://gist.github.com/clburlison/d5730d6c4e4edc9193ea9981d18d877a

sheagcraig · 2018-11-26T17:19:39Z

Thanks @clburlison!

I think that's probably the #298 issue in another form. I'm hoping to merge in the fix for that soon so we can rebase the py3 branch off of it. I'll let you know once I do so, so you can give it another go.

sheagcraig · 2018-12-06T20:11:51Z

There's a crazy circeci /pip issue going on that wasn't here before:
ModuleNotFoundError: No module named 'pip._internal'

Not sure what to do about that one...

This fixes all of the print statements and relative import issues.

The profiles squash still referenced the migrations it was replacing, despite them being removed.

I can't get these to work with postgres.

This is to block poorly formed report data from blowing up the checkin entirely.

sheagcraig changed the base branch from master to version-4 November 30, 2018 14:45

sheagcraig added the V4 To be added to version 4 label Nov 30, 2018

Add feature request issue template.

d893687

sheagcraig added 21 commits December 7, 2018 09:59

Update python packages so it will set up with python3.

cf965f4

Fix urllib package imports. Update django-bootstrap3.

a143819

Fix all py3 syntax errors from ./manage.py runserver

3a522e5

This fixes all of the print statements and relative import issues.

Remove unused imports to prepare for future django updates.

d9cb999

Remove squashed server migrations. Fix profiles squash.

6480387

The profiles squash still referenced the migrations it was replacing, despite them being removed.

Rename xrange to range for python3.

47ddb1a

Remove encoding in inventory for py3.

5f891a1

Update usage of plistlib for py3 API.

834c7fd

Update sole use of iteritems to py3 syntax.

7d1235d

Update circleci config to use py3.

fce4351

Update all references to python binary to python3.

378047e

Fix import of system_settings.

770ef10

Fix metaclass usage for py3.

e469532

Rmove py2 source code encoding cruft.

6deb14a

Fix some remaining print statements, use new io instead of stringIO.

7d939f3

Remove from future imports; the future is **NOW**.

744fba9

Update py2 unicode handling to py3.

aa39b28

Fix missing paren, imports, spelling.

93756f6

Fix urllib api changes.

c3b252b

Fix exception handling in management command. Clean up.

6a58c7f

Fix imports in munkiinfo plugin.

31b5f34

sheagcraig and others added 22 commits December 7, 2018 10:01

Make plist content for Catalog a BinaryField, and cleanup sub views.

c37d2fa

Remove unused copy of inventory sub; clean up to use py3 text utils.

11f7764

Remove unused imports.

8ca7716

Factor out plist parsing from profiles views.

9126542

Migrate Machine.report to a BinaryFIeld and update report method.

1d44230

Clean up key_auth setting checking logic.

9cfbfb7

Factor checkin machine-from-serial code and test it.

d9aefb4

Fix linting errors.

c62e380

Pull back the BinaryFields.

2a51bb8

I can't get these to work with postgres.

Update __unicode__ model methods to __str__.

27d7b87

Test checkin more, factor it apart into smaller chunks.

34bd989

Add potential exception type to submission data decoder.

e4efe66

Pull out report getting fromo checkin view.

7f504dc

Do a catch on ValueError in case the save fails.

9a0f473

This is to block poorly formed report data from blowing up the checkin entirely.

Fix some renaming mistakes. Simplify os family determination.

c717bf2

Reorganize tests into helper tests and checkin tests. Add a bunch more.

bf04dc8

Fix regression in plugin script retrieval.

494b152

Refactor checkin into more subfunctions. Test.

e446467

Remove redundant loggin import.

c285de5

Bump python version ini Dockerfile.

08306d2

Update python version used in circle testing.

303020e

Add safety check for report StartTime.

df58856

sheagcraig force-pushed the py3 branch from 2bbce0e to df58856 Compare December 7, 2018 15:04

sheagcraig added 4 commits December 7, 2018 10:09

Unjack up the rebase I tried to do.

fe81712

Fix flake8 errors.

08a4636

Continue fixing flake8 issues.

8ad622b

Fix syntax error.

515f827

sheagcraig merged commit b4b6200 into version-4 Dec 7, 2018

sheagcraig deleted the py3 branch December 7, 2018 16:04

grahamgilbert mentioned this pull request Jan 24, 2019

Change search_field and search_models to allow for longer search fields. #311

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Py3 #287

Py3 #287

sheagcraig commented Oct 4, 2018 •

edited

sheagcraig commented Nov 8, 2018

sheagcraig commented Nov 8, 2018

sheagcraig commented Nov 8, 2018 •

edited

clburlison commented Nov 26, 2018

sheagcraig commented Nov 26, 2018

sheagcraig commented Dec 6, 2018

Py3 #287

Py3 #287

Conversation

sheagcraig commented Oct 4, 2018 • edited

sheagcraig commented Nov 8, 2018

sheagcraig commented Nov 8, 2018

sheagcraig commented Nov 8, 2018 • edited

Text TODOs

Submission TODOs

clburlison commented Nov 26, 2018

sheagcraig commented Nov 26, 2018

sheagcraig commented Dec 6, 2018

sheagcraig commented Oct 4, 2018 •

edited

sheagcraig commented Nov 8, 2018 •

edited