-
Notifications
You must be signed in to change notification settings - Fork 12
Last prerelease changes #169
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
matgrioni
added a commit
that referenced
this pull request
Dec 5, 2025
* Initial Implementation of Structural Parsing (#148) * Step 1 via cc. Change parsing logic into a class. * Move parsing out of Token class. * Lint fixes. * Move everything up to 3.12 and small start on structural schema logic. * Some wip. * Attempt before reformat. * Update formatting logic properly. * Pretty good progress I'd say. * MFer, it works. * Still need to do more hardening with handling and surfacing errors, but it works for ConlluToken unit and int tests. * Getting a move on. * Fix version number and add more to CHANGELOG. * Working through it. * More implementation improvements. * Handle more errors and inconsistencies, improve error handling distinctions, etc. * Annoying string reformatting. * More improvements. * Fix all tests, make an opt-out for all the file hashing, and add a __repr__ for the generated classes. * Work on restructuring modules. * Proper annotation of public API, and fix ABC issue on Conllable so that type will be able to be piped in from top level. * Revert load API to avoid breaking change, and change __init__ imports, and add custom schema descriptor. * Small linting changes. * Change minimum required version to 3.13 and start moving around code for parsing consolidation. * Get mypy to shut-it. * Update loader. * Change tests back. * Move back and consider consolidated and time to bring up quality before next checkpoint. * Add new command. * Update versions again. * Big linting changes. * Rename to better match concepts and fix syntax error. * Remove legacy parsing. * Almost there. * More safe name generation. * Move up python version to take advantage of template strings and increase safety. * Change things to t-strings for safety and nail down the design finally. * Try to address linting errors. * Maybe python 3.14 was a mistake :). * Update documentation build version. * Add tests, small optimization, and fix line length and bug. * Resign to the fact that Parse and FormatError should only really be thrown for Token parsing. * Continue being a boss. * Fix linting and test coverage. * Fix formatting. * Change Sentence to be data holder (#149) * WIP. * Add more types. * Use Parser Logic Instead of Load (#150) * I have more work cut out for me than I thought. * Forgot these. * Tidy up things. * Solve linting issues. * Type Generic Sentence (#151) * Getting something on the board. * Final changes. * Fix comment and tests. * Update test coverage. * Linting fixes. * Fix up comments. * Fix typo. * Serializer instead of Conllable (#152) * WIP need to shift focus. * A lot of changes. * Bring back tests. * Keeping that coverage at 98do. * Working through linting issues. * More linting fixes. * More linting. * Create via changes. (#153) * Create via changes. * Yeah buddy. * Code Touchups (#154) * Address whitespac parsing. * Continued moving around. * Yeah buddy. * More stuff for linting. * Refactor inclusions. * Change unit to unit test meaning. * More stuff. * lints and types. * It still passes. * Import Changes (#155) * Rename things. * Move to module level import. * Change internal schema method to be private. * Add small helper to avoid using compile directly. * Final lint changes. * Refactorings for pre-release (#156) * Update via claude. * First change. * Claude is a lifesaver. * Update examples. * Create an example. * Change to dev version to try to release. * Allow for workflow dispatch. * Update allowed version identifiers. * Update this again. * One more fix to versioning? * Who knows at this point. * Support variable width columns (#157) * Various cleanups. * Correct tests and better support for collapsed delimiters. * Final touches on something. * Fix linting. * fix version. * Create Unified Format Interface (#158) * Working tests again. * Rename to make things easier. * Improve example. * Add examples to linting. * Cache Workflow Data Dependencies (#159) * Update triggers and add cache. * Add key since we need it. * Small test. * More fine-grained caching logic. * Smarter hashing for real this time. * Quotes are good. * Be better! * Use trusted publishing. (#160) * Update definition. * Upload and download artifacts for inter-job use. * Pyproject.toml over setup.py (#161) * It's always a good day when there are less files at the end. * Add the new files for builds. * Change config location. * Update version parsing. * Unpollute namespace. * Remove needless utils. * Remove old references to util. * actually fix build issue. * actually fix build issue. * Add at least some info about changes. * Fix workflow. * Documentation Update and then a realization for more features (#163) * Great WIP on documentation and also realizing more of what I actually need to support. * Update tests to go over both formats (for the most part, need to do more). * Joya. * Some doc changes, prob need several iterations of going through. * Small updates. * Actually remove slots. * This should actually work. * Change TokenSchema for @Tokenspec (#164) * Take in updates. * Small bug fix. * Fix test and naming. * Rename to avoid potential collisions, pyc isn't that unique. * Improve test. * Beautiful. * On our way. * Add comparison package for analysis * Annoying, but there's progress. * I think that properly fixes all my issues. * Fix some small things. * Change defaults. * Fixes on usage and naming. * WIP. * Add initial cache and improve tuple creation for optimizations. * Small tweaks, not worth the time unfortunately. * Final progress on benchmark. * Fix internal naming to be more consistent. * Better guarantees on codegen. * Linting fixes. * Round Trip Mapping For Singleton and Full Benchmark (#165) * Handle the roundtrip issue with mapping and compact pairs. * Small ergonomic improvement. * Final changes and full benchmark suite. * Go back to old algorithm, since it seems faster. * Linter pass. * Generic Sentence Feature (#166) * Beautiful. * More awesomeness. * Rename. * Finish comment. * Comment and script updates. * Add Final Features (#167) * Have to update documentation a lot, help claude! * Fix linting. * Final changes. * Fix test. * Change version back. * Add performance information and add exclusions to benchmark. * Last prerelease changes (#169) * Remove csv data from git diffs, improve documentation, and add 2.17 * Update documentation as well as I can. * I think that properly covers things. * Fix automodule. * I think I have the tentative date! * Fix format. * Update version.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.