Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dom bridge lexbor #32

Closed
wants to merge 7 commits into from
Closed

Dom bridge lexbor #32

wants to merge 7 commits into from

Conversation

nielsdos
Copy link
Owner

@nielsdos nielsdos commented Jul 18, 2023

TODO:

  • more tests
  • loadHTML5File
  • error reporting
  • custom memory allocators
  • Windows support
  • Hook up getLineNo? Set column too ?
  • Recover flag testen en eventueel error reporting adhv dat aanpassen is only sensible for XML, HTML auto-recovers by default
  • Measure increase in dom.so size
  • Perform some benchmarks of parsing on top X websites?
  • Minder lelijke error handling hacks?
  • codecov ignore file?
  • Hook up external entity loader Actually, let's keep this for a secondary RFC vote because I don't want to waste time on it if it isn't wanted.
  • README.REDIST.BINS
  • encoding/decoding
  • interaction ->encoding test
  • serializer
  • edge cases encoding (long documents, euc-jp finish)
  • serializer tests met verschillende root types
  • ifdefs
  • escape tests, nbsp tests, ...
  • new column handling
  • Encoding finalising test?
  • Rebase and clean history
  • element namespace handling (xhtml, svg, mathml)
  • default register xhtml namespace in xpath? NO_DEFAULT_NS migration
  • optimize jumping in utf-8 parsing?
  • Also apply jumping in Lexbor, but make sure the end pointer ends at the predetermined place
  • Put benchmark stuff in gist

Try it out instructions: https://gist.github.com/nielsdos/139099c54ddc4a43bc66f4c0b7ef02fc

@nielsdos nielsdos force-pushed the dom-bridge-lexbor branch 4 times, most recently from 3a007eb to a4aed3d Compare August 14, 2023 18:21
@nielsdos nielsdos force-pushed the dom-bridge-lexbor branch 2 times, most recently from 9ee6d17 to aa656ee Compare August 15, 2023 21:04
@nielsdos nielsdos force-pushed the dom-bridge-lexbor branch 2 times, most recently from ee6a50c to c19232f Compare August 20, 2023 21:35
@nielsdos nielsdos force-pushed the dom-bridge-lexbor branch 4 times, most recently from 2c39651 to 042b862 Compare August 26, 2023 11:33
@nielsdos nielsdos force-pushed the dom-bridge-lexbor branch 7 times, most recently from 9ed70c3 to 7c8b59a Compare August 31, 2023 20:35
@nielsdos nielsdos force-pushed the dom-bridge-lexbor branch 2 times, most recently from f875250 to 5c22671 Compare September 2, 2023 15:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
1 participant