Fix %notebook magic, etc. nbformat unicode tests and fixes by minrk · Pull Request #1480 · ipython/ipython

minrk · 2012-03-08T22:51:56Z

use io.open for better unicode-handling
json.writes always gives unicode, so that current.writes can be trusted to give the same interface
setup base TestCase for nbformat tests, to consolidate code, and better test both file formats
add tests for reading/writing to files
allow name as kwarg to new_notebook to avoid unnecessary breakage of previous API.
remove fallback to xml, which would hide corrupt notebook files behind a nonsensical 'xml unsupported' message.

avoids unnecessary breakage of backwards-compatible interface when creating new notebook.

takluyver · 2012-03-08T23:21:12Z

The default encoding is platform dependent (not perhaps the best design, but that's the way things are), so we should explicitly specify encoding='utf-8' for reading & writing JSON files.

Ah, good call. Should we be using io.open in utils.py3compat, and just use that?

I think it's simpler to use io.open directly - it makes it obvious that we're not doing any extra magic. Also, it's identical to the Python 3 built-in open(), so there's less to change when we eventually drop py3compat.

okay, makes sense. Should we just remove the open stuff from py3compat, then?

Seems like we should either:

a) remove open from py3compat
b) on py2, use open = io.open, rather than the custom class we have currently

Yes, I'd side with removing it. I didn't know about io.open when I wrote py3compat.open (codecs.open is more often mentioned, but doesn't have universal newlines). I'll get onto it in the next couple of days unless you want to do it as part of this.

Okay, let's do that separately.

takluyver · 2012-03-09T13:09:31Z

I think this can open Python files, which aren't necessarily UTF-8, although most are. I was preparing code somewhere to sniff the encoding magic comment and open a Python file accordingly, so let's just make a note to hook that up to this when I add that in.

Principally important for distinguishing a sequence of `print n,` messages from a sequence of `print n` messages.

minrk · 2012-04-02T18:48:14Z

This PR now closes the issue reported in #1545

fperez · 2012-04-14T09:45:37Z

On quick read this looks good, and tests pass, but I'm too tired to finish the review now; will continue tomorrow. @takluyver, do you have any remaining concerns with this one? If not, I'll give it a pass tomorrow and assuming all looks OK I'll merge it.

takluyver · 2012-04-14T17:46:12Z

I think all four of these should probably be unicode literals - i.e. u"u = {u}'héllo'". 8-bit strings with non-ascii characters are a bit of a headache.

When I make these unicode literals, I get:

AttributeError: 'unicode' object attribute '__doc__' is read-only

Should I leave them as str literals, then?

Oops, that's a bug in py3compat. It should be basestring rather than str on this line: https://github.com/ipython/ipython/blob/master/IPython/utils/py3compat.py#L35 . You can just make the change in this branch.

Gotcha, will do, thanks.

Thanks, that fixed it.

takluyver · 2012-04-14T17:57:27Z

Besides that comment, I think it's all OK.

fperez · 2012-04-14T20:29:25Z

Thanks @takluyver; I'll then wait for @minrk to have a chance to fix this before proceeding further.

fperez · 2012-04-14T21:09:48Z

Any particular reason to not follow the naming pattern for test files of having them all be called test_*? I know nose will recognize this as well, but since we already have a pattern for those filenames, I think it would be best to follow it. That makes it easier to know that all files not named test_X.py are simply auxiliary tools for the test suite.

Yes, because it shouldn't (and doesn't) pick up this one. It's a base class for real tests to inherit from, and won't work without subclassing. It's the same as clienttest in IPython.parallel.

Ah, OK. I got confused b/c there was a TestCase subclass in there, which I thought was usable by itself. In a similar situation, the pattern I use is to call that class FooTestBase, and not subclass TestCase, so that it's clear that this is an incomplete class, meant to be used as a mixin. Then, the real test class will do class FooTestCase(FooTestBase, TestCase). It's slightly more verbose, but it makes the code more self-documenting, I think.

That makes good sense, and I will make the change to avoid future similar confusion.

fperez · 2012-04-14T21:13:05Z

OK, I'm good with this going in once @takluyver's minor fix is made.

@takluyver

per review by @takluyver

minrk · 2012-04-14T23:06:18Z

Okay, now using unicode literals (and @takluyver's tiny fix to py3compat required for that to work), and NBFormatTest is a mixin as recommend by @fperez.

Good to go?

takluyver · 2012-04-14T23:12:17Z

OK by me.

Fix %notebook magic, etc. nbformat unicode tests and fixes. * json.writes always gives unicode, so that `current.writes` can be trusted to give the same interface * setup base TestCase for nbformat tests, to consolidate code, and better test both file formats * add tests for reading/writing to files * allow `name` as kwarg to new_notebook to avoid unnecessary breakage of previous API. * remove fallback to xml, which would hide corrupt notebook files behind a nonsensical 'xml unsupported' message. Closes #1545, #1487.

Fix %notebook magic, etc. nbformat unicode tests and fixes. * json.writes always gives unicode, so that `current.writes` can be trusted to give the same interface * setup base TestCase for nbformat tests, to consolidate code, and better test both file formats * add tests for reading/writing to files * allow `name` as kwarg to new_notebook to avoid unnecessary breakage of previous API. * remove fallback to xml, which would hide corrupt notebook files behind a nonsensical 'xml unsupported' message. Closes ipython#1545, ipython#1487.

minrk added 5 commits March 8, 2012 14:29

allow name as kwarg to new_notebook

501b1b0

avoids unnecessary breakage of backwards-compatible interface when creating new notebook.

nbjson.writes always returns unicode

5902725

add NBFormatTestCase base class, to consolidate nbformat testing

7185014

unicode-related fixes in rwbase, nbformat tests

3b6df9f

test and fix %notebook magic

3d8ee25

takluyver reviewed Mar 8, 2012
View reviewed changes

specify utf8 when calling io.open

76bb87c

takluyver reviewed Mar 9, 2012
View reviewed changes

takluyver mentioned this pull request Mar 10, 2012

%notebook fails in qtconsole #1487

Closed

preserve trailing newlines in ipynb

0895a9b

Principally important for distinguishing a sequence of `print n,` messages from a sequence of `print n` messages.

minrk mentioned this pull request Apr 2, 2012

trailing newline not preserved in splitline ipynb #1545

Closed

takluyver reviewed Apr 14, 2012
View reviewed changes

fperez reviewed Apr 14, 2012
View reviewed changes

minrk added 3 commits April 14, 2012 15:38

unicode literals in notebook magic tests

befb89e

per review by @takluyver

NBFormatTest is now a mixin, rather than a base class

ef2fac7

py3compat._modify_str_or_docstring should check against basestring

2708316

fperez merged commit c4cf940 into ipython:master Apr 15, 2012

Uh oh!

Conversation

minrk commented Mar 8, 2012

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

minrk commented Apr 2, 2012

Uh oh!

fperez commented Apr 14, 2012

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

takluyver commented Apr 14, 2012

Uh oh!

fperez commented Apr 14, 2012

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fperez commented Apr 14, 2012

Uh oh!

minrk commented Apr 14, 2012

Uh oh!

takluyver commented Apr 14, 2012

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants