feat: Implement icebug-disk format#429
Conversation
69f32c5 to
4c5f9bf
Compare
|
How's this different from: |
updating description. Give me a minute |
|
Improvements:
However, the main motivation behind adding new classes is having separate entities for icebug-format because it may diverge from existing columnar implementations. As discussed earlier, we are planning to use/replace exisiting parquet or arrow classes for normal tables or delegate to duckDB anyway |
|
With this implementation, we can have a working db as an output from non-icedisk-to-icedisk-converter or user can pass the The main issue is path though. If the user moves the db or path, he/she might need to trigger Couple of clarifications:
|
|
The improvements are nice. But we can't have two implementations with 2x the bugs. Need to remove one and justify the other with benchmarks/data comparing the improvement. DuckDB and Lance could be additional ColumnarBaseTable implementations. They don't remove the need for parquet. |
It already creates a metadata table thusly: Yes, this is a good place to add
We can specify that The distinction between schema.cypher and catalog entry are mechanical. I don't think we should spend a lot of time specifying that or we'll be duplicating a significant chunk of docs.ladybugdb.com. However, getting other Graph databases and analytical packages to adopt icebug is an explicit goal. So I'd try to strike a balance between over-specifying (as opposed to just refer to reference implementation) and adding ladybug specific assumptions that would rub the non-ladybug people the wrong way. |
|
dataset addition PR: LadybugDB/dataset#1 |
I will run a benchmark tmrw and attach the results But, what about support for normal parquet tables? We can just remove them from docs until we change the existing classes to normal tables |
I will update the tool and add it in this repo tmrw |
Ladybug-Memory/icebug-format#2 (comment) If the issue is |
Not really about about the company. But since the cli tool necessarily converts to ladybugDB' icebug-format impl, It should live under LadybugDB org Here's my propposal:
|
|
The cli tool should be generic and usable with another database that wants to implement icebug-format. Previously people thought of specs as detailed docs that someone else would read to clean room implement an idea. This existed in the era of closed source software. An emerging consensus (probably in the last few weeks of social media) now claims that "AI slop" or a working prototype is a better idea than writing specs. While that idea has problems, it's likely how things are going to work in the near future. People are going use |
Understood. I thought it's gonnna be used only for ladybug |
Closes #469