Skip to content

Conversation

@matgrioni
Copy link
Collaborator

No description provided.

@matgrioni matgrioni merged commit 1f3badd into dev Dec 5, 2025
5 checks passed
@matgrioni matgrioni deleted the matgrioni/last-prerelease-changes branch December 5, 2025 21:08
matgrioni added a commit that referenced this pull request Dec 5, 2025
* Initial Implementation of Structural Parsing (#148)

* Step 1 via cc. Change parsing logic into a class.

* Move parsing out of Token class.

* Lint fixes.

* Move everything up to 3.12 and small start on structural schema logic.

* Some wip.

* Attempt before reformat.

* Update formatting logic properly.

* Pretty good progress I'd say.

* MFer, it works.

* Still need to do more hardening with handling and surfacing errors, but it works for ConlluToken unit and int tests.

* Getting a move on.

* Fix version number and add more to CHANGELOG.

* Working through it.

* More implementation improvements.

* Handle more errors and inconsistencies, improve error handling distinctions, etc.

* Annoying string reformatting.

* More improvements.

* Fix all tests, make an opt-out for all the file hashing, and add a __repr__ for the generated classes.

* Work on restructuring modules.

* Proper annotation of public API, and fix ABC issue on Conllable so that type will be able to be piped in from top level.

* Revert load API to avoid breaking change, and change __init__ imports, and add custom schema descriptor.

* Small linting changes.

* Change minimum required version to 3.13 and start moving around code for parsing consolidation.

* Get mypy to shut-it.

* Update loader.

* Change tests back.

* Move back and consider consolidated and time to bring up quality before next checkpoint.

* Add new command.

* Update versions again.

* Big linting changes.

* Rename to better match concepts and fix syntax error.

* Remove legacy parsing.

* Almost there.

* More safe name generation.

* Move up python version to take advantage of template strings and increase safety.

* Change things to t-strings for safety and nail down the design finally.

* Try to address linting errors.

* Maybe python 3.14 was a mistake :).

* Update documentation build version.

* Add tests, small optimization, and fix line length and bug.

* Resign to the fact that Parse and FormatError should only really be thrown for Token parsing.

* Continue being a boss.

* Fix linting and test coverage.

* Fix formatting.

* Change Sentence to be data holder (#149)

* WIP.

* Add more types.

* Use Parser Logic Instead of Load (#150)

* I have more work cut out for me than I thought.

* Forgot these.

* Tidy up things.

* Solve linting issues.

* Type Generic Sentence (#151)

* Getting something on the board.

* Final changes.

* Fix comment and tests.

* Update test coverage.

* Linting fixes.

* Fix up comments.

* Fix typo.

* Serializer instead of Conllable (#152)

* WIP need to shift focus.

* A lot of changes.

* Bring back tests.

* Keeping that coverage at 98do.

* Working through linting issues.

* More linting fixes.

* More linting.

* Create via changes. (#153)

* Create via changes.

* Yeah buddy.

* Code Touchups (#154)

* Address whitespac parsing.

* Continued moving around.

* Yeah buddy.

* More stuff for linting.

* Refactor inclusions.

* Change unit to unit test meaning.

* More stuff.

* lints and types.

* It still passes.

* Import Changes (#155)

* Rename things.

* Move to module level import.

* Change internal schema method to be private.

* Add small helper to avoid using compile directly.

* Final lint changes.

* Refactorings for pre-release (#156)

* Update via claude.

* First change.

* Claude is a lifesaver.

* Update examples.

* Create an example.

* Change to dev version to try to release.

* Allow for workflow dispatch.

* Update allowed version identifiers.

* Update this again.

* One more fix to versioning?

* Who knows at this point.

* Support variable width columns (#157)

* Various cleanups.

* Correct tests and better support for collapsed delimiters.

* Final touches on something.

* Fix linting.

* fix version.

* Create Unified Format Interface (#158)

* Working tests again.

* Rename to make things easier.

* Improve example.

* Add examples to linting.

* Cache Workflow Data Dependencies (#159)

* Update triggers and add cache.

* Add key since we need it.

* Small test.

* More fine-grained caching logic.

* Smarter hashing for real this time.

* Quotes are good.

* Be better!

* Use trusted publishing. (#160)

* Update definition.

* Upload and download artifacts for inter-job use.

* Pyproject.toml over setup.py (#161)

* It's always a good day when there are less files at the end.

* Add the new files for builds.

* Change config location.

* Update version parsing.

* Unpollute namespace.

* Remove needless utils.

* Remove old references to util.

* actually fix build issue.

* actually fix build issue.

* Add at least some info about changes.

* Fix workflow.

* Documentation Update and then a realization for more features (#163)

* Great WIP on documentation and also realizing more of what I actually need to support.

* Update tests to go over both formats (for the most part, need to do more).

* Joya.

* Some doc changes, prob need several iterations of going through.

* Small updates.

* Actually remove slots.

* This should actually work.

* Change TokenSchema for @Tokenspec (#164)

* Take in updates.

* Small bug fix.

* Fix test and naming.

* Rename to avoid potential collisions, pyc isn't that unique.

* Improve test.

* Beautiful.

* On our way.

* Add comparison package for analysis

* Annoying, but there's progress.

* I think that properly fixes all my issues.

* Fix some small things.

* Change defaults.

* Fixes on usage and naming.

* WIP.

* Add initial cache and improve tuple creation for optimizations.

* Small tweaks, not worth the time unfortunately.

* Final progress on benchmark.

* Fix internal naming to be more consistent.

* Better guarantees on codegen.

* Linting fixes.

* Round Trip Mapping For Singleton and Full Benchmark (#165)

* Handle the roundtrip issue with mapping and compact pairs.

* Small ergonomic improvement.

* Final changes and full benchmark suite.

* Go back to old algorithm, since it seems faster.

* Linter pass.

* Generic Sentence Feature (#166)

* Beautiful.

* More awesomeness.

* Rename.

* Finish comment.

* Comment and script updates.

* Add Final Features (#167)

* Have to update documentation a lot, help claude!

* Fix linting.

* Final changes.

* Fix test.

* Change version back.

* Add performance information and add exclusions to benchmark.

* Last prerelease changes (#169)

* Remove csv data from git diffs, improve documentation, and add 2.17

* Update documentation as well as I can.

* I think that properly covers things.

* Fix automodule.

* I think I have the tentative date!

* Fix format.

* Update version.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants