zsvlib

Zero runtime-allocation csv handling library in Zig

Work is (indefinitely) deferred on this lib until shortcomings of C implementation in https://github.com/liquidaty/zsv become evident with motivating use cases. I'll probably implement something much simpler inspirated by nix-index as project database table. However, I'll be busy with a test runner REPL that can handle custom logic and allows user to specify dependencies in the near future.

use cases

use case 1: tool to compute+dump typed description for necessary file sizes
use case 2: tool to compute+dump typed description for necessary memory sizes
use case 2: tool to combine multiple typed descriptions for necessary memory sizes
use case 3: comptime layout generation for memory per csv line or whole file
use case 4: library to use optionally SIMD for parsing data
use case 5: library to write data to schemas.

Opt-in of allocations with size must be possible.

non-goal(for now): csv variations compatibility.
non-goal: support fancy or many types
non-goal: conversion to other formats
non-goal: dealing with other string encoding than ascii and utf8
non-goal: interaction besides the csv header generation
non-goal(for now): updating offset to peek and write to file positions

instructions

# for benchmarks fetch zsv
git clone https://github.com/liquidaty/zsv
# todo

planned interfaces

todo

status

wip.

todos

obsoletion plan

no plan for obsolation

notes

encoding examples for csv input without schema files

"string1"string2" is 1 continuous string number123 is another continuous string 65535 is a number, which is for simplicity of type Int (arbitrary precision integers) or at request u16 (depending on what min and max values of same range are). TODO: parsing Int? TODO: field delimiter at comptime vs runtime TODO: string symbols? TODO: Is there "a best default" to choose at comptime? perf?

other implementations

no realistic data handling https://github.com/geofflangdale/simdcsv
extensible, in C and MIT: https://github.com/liquidaty/zsv
simple: https://github.com/dbro/csvquote

tricks

https://nullprogram.com/blog/2021/12/04/ bit hacks for perf https://wunkolo.github.io/post/2020/05/pclmulqdq-tricks/ "Speculative Distributed CSV Data Parsing for Big Data Analytics" by Ge et al. https://www.microsoft.com/en-us/research/publication/speculative-distributed-csv-data-parsing-for-big-data-analytics/ "Instant Loading for Main Memory Databases" by Mühlbauar et al. https://www.semanticscholar.org/paper/Instant-Loading-for-Main-Memory-Databases-M%C3%BChlbauer-R%C3%B6diger/a1b067fc941d6727169ec18a882080fa1f074595?p2df

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.zig		build.zig

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

zsvlib

use cases

instructions

planned interfaces

status

todos

obsoletion plan

notes

encoding examples for csv input without schema files

other implementations

tricks

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

zsvlib

use cases

instructions

planned interfaces

status

todos

obsoletion plan

notes

encoding examples for csv input without schema files

other implementations

tricks

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages