Parsing Shenanigans

This repo is me experimenting with creating optimized parsers for .NET's new memory and I/O APIs:

Span
Memory
ReadOnlySequence
Pipelines (soon)

This consists of a pull parser (think XmlReader) for CSV that is more complex than an IndexOf but small enough to be easily re-written and optimized for each model. The parser is in-situ: it returns data from the existing buffers rather than copying. Each parser has not been fully tested for all inputs, so we should not place a huge importance, yet, on results of the benchmarks.

Best-case parsing

The first benchmark exercises the most optimal case for parsing: the entire document exists in a single complete memory segment. This tests the best-case overhead of each abstraction. Current results:

Method	Mean	Error	StdDev	Rank
Span	2.231 s	0.0087 s	0.0077 s	1
Memory	3.606 s	0.0393 s	0.0368 s	2
Sequence	16.760 s	0.0340 s	0.0301 s	3
SequenceReader	18.960 s	0.0456 s	0.0426 s	4

Stream parsing

The second benchmark exercises a more realistic scenario, where data is being streamed over network in chunks. Here the parsers must be able to move across memory segments, make as much progress as possible with data available, and be able to resume once more data comes through.

TODO

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
parsecore		parsecore
.gitignore		.gitignore
README.md		README.md
parsecore.sln		parsecore.sln

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parsing Shenanigans

Best-case parsing

Stream parsing

About

Releases

Packages

Languages

scalablecory/parsecore

Folders and files

Latest commit

History

Repository files navigation

Parsing Shenanigans

Best-case parsing

Stream parsing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages