Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Experimental JSON CAS support #168

Closed
reckart opened this issue Aug 13, 2021 · 0 comments · Fixed by #169
Closed

Experimental JSON CAS support #168

reckart opened this issue Aug 13, 2021 · 0 comments · Fixed by #169

Comments

@reckart
Copy link
Member

reckart commented Aug 13, 2021

Describe the solution you'd like
Experimental support for the JSON CAS format being developed under https://issues.apache.org/jira/browse/UIMA-6266

Cf. apache/uima-uimaj#137

@reckart reckart self-assigned this Aug 13, 2021
reckart added a commit that referenced this issue Aug 13, 2021
- Added very basic JSON CAS support
- No support for type systems yet
- No support for lenient loading
- Remove Cas:NULL via type name instead of puring simply the FS with ID 0 (which may not be a Cas:NULL fs)
- Added various constants for type names and feature names in the Cas class (analouge to the Apache UIMA Java SDK impl)
- WIP
reckart added a commit that referenced this issue Aug 13, 2021
- Fixed bad PyDoc comment
reckart added a commit that referenced this issue Aug 13, 2021
- Fixed linter error because type hint was referring to a dynamically created type
reckart added a commit that referenced this issue Aug 13, 2021
- Roll back change of Sofa.sofaArray range type from uima.cas.ByteArray back to uima.cas.TOP which is indeed the range type also used in the Apache UIMA Java SDK - despite only uima.cas.ByteArray being acceptable...
reckart added a commit that referenced this issue Aug 13, 2021
- Added generator for random CASes
- Added JSON tests using random CAS generator
- Added support for (de)serializing type system information in the JSON format
- Move the type/feature name constants from Cas to typesystem.py
reckart added a commit that referenced this issue Aug 13, 2021
- Added another generator for random CASes
- Added more tests
- Commented out all testing of arrays in the new generator since array handling in cassis seems to have a few conceptual problems when need to be looked at first
reckart added a commit that referenced this issue Aug 13, 2021
- Revert change to stripping the null FS
- Changed reference data so that IDs start at 1 and not at 0 leaving 0 reserved for the null FS
@jcklie jcklie added this to the 0.6.0 milestone Aug 14, 2021
@jcklie jcklie linked a pull request Aug 14, 2021 that will close this issue
2 tasks
reckart added a commit that referenced this issue Aug 19, 2021
- Fix array support
- Enable array tests
reckart added a commit that referenced this issue Aug 20, 2021
- Change view members field name
reckart added a commit that referenced this issue Aug 25, 2021
- Formatting
- Removed unused import
reckart added a commit that referenced this issue Aug 25, 2021
- Adjust to changes from #190
reckart added a commit that referenced this issue Aug 25, 2021
- Adjust to changes from #190
reckart added a commit that referenced this issue Aug 25, 2021
- Tune performance when serializing JSON to string instead of writing to disk
- Added rudimentary performance "test"
reckart added a commit that referenced this issue Sep 10, 2021
- Better check whether adding a TextIOWrapper is necessary during serialization
- Fixed bad access to element type name
- Formatting
reckart added a commit that referenced this issue Sep 20, 2021
- Better test if using a TextIOWrapper is really necessary
reckart added a commit that referenced this issue Sep 20, 2021
- Better check whether adding a TextIOWrapper is necessary during serialization
- Fixed bad access to element type name
- Formatting
reckart added a commit that referenced this issue Sep 20, 2021
- Better test if using a TextIOWrapper is really necessary
reckart added a commit that referenced this issue Sep 20, 2021
- Work around issues with cas_to_compareble_text and FSArrays
reckart added a commit that referenced this issue Sep 27, 2021
…ithub.com/dkpro/dkpro-cassis into feature/168-Experimental-JSON-CAS-support

* 'feature/168-Experimental-JSON-CAS-support' of https://github.com/dkpro/dkpro-cassis:
  No issue. Formatting.
  #215 - Ability to exclude types from cas_to_comparable_text
  #212 - Allow loading/saving XMI/typesystems from/to Path
  #211 - Serializing an FSArray without any elements breaks
  #212 - Allow loading/saving XMI/typesystems from/to Path
  #168 - Experimental JSON CAS support
  #168 - Experimental JSON CAS support
  #168 - Experimental JSON CAS support
  #209 - Parsing an array that was serialized using multipleReferencesAllowed=true fails
  - Do not execute performance "tests" when running make test - Update JSON reference data with new data from UIMA Java SDK - including CAS examples using emojis and other Unicode characters - Enabled character offset conversion on import/export in JSON (de)serializer
  #209 - Parsing an array that was serialized using multipleReferencesAllowed=true fails
reckart added a commit that referenced this issue Sep 27, 2021
- Support for floating point special values in JSON
- Support for not serializing the full type system in JSON but only the minimal or none at all
@reckart reckart modified the milestones: 0.6.0, 0.7.0 Sep 27, 2021
reckart added a commit that referenced this issue Dec 12, 2021
- Added mention about non-final status in README file
reckart added a commit that referenced this issue Dec 12, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants