Code refactor, setup.py, auto-formatting #7

sai-prasanna · 2019-03-16T13:29:04Z

I have created setup.py to make it pip installable. If you opt to accept these changes we can publish to PyPI under name errant.
Added Types everywhere. And add a way to type check. If add a CI in future we can automate checking it on pull requests.
Creating a Edit type, Error Type to use throughout instead of tuples.
Move scripts to commands. Now we can invoke errant parallel_to_m2 etc to get the job done.
Best practice for formatting code uniformly is to not do it by hand, so I have added black auto formatter, and simple way to invoke it to format the code.
Used Levenshtien package for character edits. In my initial tests it gave a huge increase in speed, yet to check it overall on diverse sentences.

chrisjbryant · 2019-03-17T15:54:30Z

Thanks again for all of this. I don't want to release it during the shared task as it might confuse people, but will certainly take a look and do some testing afterwards, probably April. Hope you can wait!

sai-prasanna · 2019-03-18T16:49:40Z

@chrisjbryant Cool, btw Spacy 2.1 released today https://explosion.ai/blog/spacy-v2-1 , so we can release a v1.0.1 with spacy 2.1 I guess.

chrisjbryant · 2019-03-25T15:14:51Z

Fyi, Spacy 2.1.2 finally changed the POS mappings so I'd definitely have to revisit the rules to be compatible with spacy 2.1. Cf. explosion/spaCy#3455.

sai-prasanna · 2019-04-04T12:15:09Z

@chrisjbryant A gentle reminder. If there is anything to be done my side, will do.

chrisjbryant · 2019-04-04T13:40:14Z

Yep, it's on my to-do list! The shared task finished last week and we're releasing the official results tomorrow so I still have a bit to do.
I'm also slightly concerned about releasing a new version so quickly since it will probably produce slightly different scores to the shared task. I'll test your version next week, but it would be good to have an option to use the official shared task setup as well as the newer, more updated version.

sai-prasanna · 2019-04-04T14:13:24Z

Awesome, Our team has a submission going there in the shared task 😀 . We can release multiple versions probably the 1.0.0 with official scorer and 1.1.0 with updated spacy. Or if you can adapt the setup.py and the scaffolding code to existing code without all these other changes, that can be 1.0.0.

chrisjbryant · 2019-04-10T13:11:36Z

Awkward question: how easy is it to get rid of all the explicit typing?
I read that typing is only supported by python 3.6 and later, but we only have 3.5 on our main server and can't update it in case it breaks other people's projects. I suspect other people might find themselves in a similar position, so would really like the code to be compatible with python >=3.4 to give us a reasonable degree of backwards compatibility (3.4 was released in 2014).

sai-prasanna · 2019-04-14T17:33:09Z

I guess they can be removed by a simple regex or some find and replace .. Will check if there is any automated way to remove them. I thought people generally used virtualenvs, but yeah for backward compatibility we can move types to comments for typepchecker to still function.

sam-writer · 2019-09-23T20:07:16Z

Sorry to resurrect this, but we'd love to help make ERRANT pip-installable, ideally as a module, so it can be called from within Python.

I was under the same impression about version restrictions, but it turns out older version of Python can support typing, similar to how from __future__ import print_function would allow you to use print() in Python 2 code.

This stackoverflow answer discusses how you can support type syntax with Python 3.0-3.4 (3.5+ already have typing support). It seems to be just adding a dependency on typing, "a backport of the standard library typing module to Python versions older than 3.5." See also a github issue in the Python org which seems to agree.

So to make progress on this PR, I propose:

Use Docker to verify that the code runs on 3.4, 3.5, 3.6 and 3.7 (possibly with the addition of a dependency on typing) by, for example, running the demo and confirming that it produces a file called test.m2 which is identical to out.m2.
revisit the rules to confirm they work with spaCy 2.1.x POS mappings (including this to be thorough, without being sure what it means)

@chrisjbryant , if you can tell me more about 2, I am happy to help. Also, if 1 were completed as described, would you feel that there is adequate backwards compatibility?

chrisjbryant · 2019-12-10T01:37:17Z

It took 9 months, but I finally found the time to learn setup.py and refactor everything (inspired by this pull request)!
Sorry it took so long, but I also took the opportunity to learn about python packaging and OOP.
I hope you don't mind I also acknowledged your help in the changelog. @sai-prasanna
:)

sai-prasanna and others added 8 commits March 13, 2019 19:56

Full Refactor initial commit

f3dec4a

Rename readme.md to README.md

f7799f3

Fix types, add manifest.in for ptbmaps and dictionary

29c197e

Fix replication problems 1

86ca6e6

Use Levenshtien in categorizer

fa5234d

Fix m2 to m2 command and add more typing

e673c44

Add auto-formatting scripts

d716ca8

Add install requirements file and package minimum versions

8692560

Fix alignment problem after mypy refactor

ded24d8

chrisjbryant closed this Dec 10, 2019

chrisjbryant reopened this Dec 10, 2019

chrisjbryant closed this Dec 10, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code refactor, setup.py, auto-formatting #7

Code refactor, setup.py, auto-formatting #7

sai-prasanna commented Mar 16, 2019 •

edited

chrisjbryant commented Mar 17, 2019

sai-prasanna commented Mar 18, 2019

chrisjbryant commented Mar 25, 2019 •

edited

sai-prasanna commented Apr 4, 2019

chrisjbryant commented Apr 4, 2019

sai-prasanna commented Apr 4, 2019

chrisjbryant commented Apr 10, 2019

sai-prasanna commented Apr 14, 2019

sam-writer commented Sep 23, 2019

chrisjbryant commented Dec 10, 2019

Code refactor, setup.py, auto-formatting #7

Code refactor, setup.py, auto-formatting #7

Conversation

sai-prasanna commented Mar 16, 2019 • edited

chrisjbryant commented Mar 17, 2019

sai-prasanna commented Mar 18, 2019

chrisjbryant commented Mar 25, 2019 • edited

sai-prasanna commented Apr 4, 2019

chrisjbryant commented Apr 4, 2019

sai-prasanna commented Apr 4, 2019

chrisjbryant commented Apr 10, 2019

sai-prasanna commented Apr 14, 2019

sam-writer commented Sep 23, 2019

chrisjbryant commented Dec 10, 2019

sai-prasanna commented Mar 16, 2019 •

edited

chrisjbryant commented Mar 25, 2019 •

edited