refactor(core): simplify and future-proof database schema #34

ayuhito · 2024-06-22T00:40:23Z

Overview

This is a major refactor to remove a lot of the complexity focused on optimising storage. As DuckDB uses FSST compression on strings, a lot of the manual enum logic we had to encourage better bitpacking wasn't necessary and thus made the code more complicated. That had also made filtering much more difficult for these enums as we could not run our string search algorithms on them. This change was also applied to countries, in which we now use full country names instead of country codes.

Languages

Additionally, to partially address #10 and #11, I've renamed a bunch of the columns in advance so we don't have an additional migration when we add these features post-release. This also changed how we processed languages, only storing the base language and not the dialect (until we address the mentioned issues).

ayuhito added 6 commits June 21, 2024 00:46

fix(core): use base language tag instead of dialect

1acf7a8

feat: early prep for migrations

4fe40d3

feat: simply languages structure

7b89f15

fix: remove fixed filter complexity from dashboard

1b7de8b

fix: sql binder errors

82ffb81

test: update golden fixture

67205ac

ayuhito marked this pull request as ready for review June 22, 2024 09:25

ayuhito merged commit 81a8678 into main Jun 22, 2024
5 checks passed

ayuhito deleted the fix/lang branch June 22, 2024 09:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(core): simplify and future-proof database schema #34

refactor(core): simplify and future-proof database schema #34

ayuhito commented Jun 22, 2024

refactor(core): simplify and future-proof database schema #34

refactor(core): simplify and future-proof database schema #34

Conversation

ayuhito commented Jun 22, 2024

Overview

Languages