DM-18203: Set encoding for reading text catalogs. #158

jdswinbank · 2019-03-27T22:12:48Z

No description provided.

timj · 2019-03-27T22:23:29Z

tests/test_readTextCatalog.py

+        config = ReadTextCatalogTask.ConfigClass()
+        config.encoding = "ascii"
+        task = ReadTextCatalogTask(config=config)
+        # This will generate a ResourceWarning due to a NumPy bug.


If you want to make the test output clean consider using a warnings.catch_warnings with a simplefilter to disable warning output for that command.

I don't think you can, at least not trivially — I tried this, but the warning isn't generated in this code, but deep in the guts of Python: it comes from /Users/jds/Projects/LSST/stack/python/miniconda3-4.5.4/envs/lsst-scipipe-fcd27eb/lib/python3.6/traceback.py:216 sometime after this test case has finished, so a context in the test can't catch it.

I considered putting in a filter to drop all ResourceWarnings from traceback.py:216, but that seems less than optimal. Other suggestions welcome!

Maybe it's fixed in the new numpy that we will be using from tomorrow...

Pretty sure it's still broken in the latest version. :(

Thanks for filing a numpy ticket on this and linking it here. However, should we have a jira ticket to remove this comment once numpy is fixed? It might be confusing in "future numpy is fixed" land...

parejkoj

I think utf-8 is backwards compatible with ASCII, so I'm not sure that we really need that to be configurable. Otherwise, this is fine.

parejkoj · 2019-03-28T23:15:57Z

python/lsst/meas/algorithms/readTextCatalogTask.py

+    encoding = pexConfig.Field(
+        dtype=str,
+        default="utf-8",
+        doc="Encoding of text reference file."


Do we really need to make this configurable? Can't we just make it utf-8 forever and forget about it?

parejkoj · 2019-03-28T23:32:30Z

tests/test_readTextCatalog.py

+        config = ReadTextCatalogTask.ConfigClass()
+        config.encoding = "ascii"
+        task = ReadTextCatalogTask(config=config)
+        # This will generate a ResourceWarning due to a NumPy bug.


Thanks for filing a numpy ticket on this and linking it here. However, should we have a jira ticket to remove this comment once numpy is fixed? It might be confusing in "future numpy is fixed" land...

parejkoj · 2019-03-29T01:01:38Z

Your new commit is fine. Squash it.

Commas are needed to separate fields in the header.

timj reviewed Mar 27, 2019

View reviewed changes

jdswinbank force-pushed the tickets/DM-18203 branch 2 times, most recently from df2264f to 2c19f89 Compare March 28, 2019 21:06

parejkoj reviewed Mar 28, 2019

View reviewed changes

jdswinbank and others added 2 commits April 2, 2019 16:13

Set encoding for reading text catalogs.

a436cd6

Correct example in docstring.

ea4250c

Commas are needed to separate fields in the header.

jdswinbank force-pushed the tickets/DM-18203 branch from 0f72a82 to ea4250c Compare April 2, 2019 23:15

jdswinbank merged commit ea4250c into master Apr 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-18203: Set encoding for reading text catalogs. #158

DM-18203: Set encoding for reading text catalogs. #158

jdswinbank commented Mar 27, 2019

timj Mar 27, 2019

jdswinbank Mar 27, 2019 •

edited

timj Mar 27, 2019

jdswinbank Mar 27, 2019

parejkoj Mar 28, 2019 •

edited

parejkoj left a comment

parejkoj Mar 28, 2019

parejkoj Mar 28, 2019 •

edited

parejkoj commented Mar 29, 2019

DM-18203: Set encoding for reading text catalogs. #158

DM-18203: Set encoding for reading text catalogs. #158

Conversation

jdswinbank commented Mar 27, 2019

timj Mar 27, 2019

Choose a reason for hiding this comment

jdswinbank Mar 27, 2019 • edited

Choose a reason for hiding this comment

timj Mar 27, 2019

Choose a reason for hiding this comment

jdswinbank Mar 27, 2019

Choose a reason for hiding this comment

parejkoj Mar 28, 2019 • edited

Choose a reason for hiding this comment

parejkoj left a comment

Choose a reason for hiding this comment

parejkoj Mar 28, 2019

Choose a reason for hiding this comment

parejkoj Mar 28, 2019 • edited

Choose a reason for hiding this comment

parejkoj commented Mar 29, 2019

jdswinbank Mar 27, 2019 •

edited

parejkoj Mar 28, 2019 •

edited

parejkoj Mar 28, 2019 •

edited