-
-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature: Switch from CSV to sqlite3 #89
Comments
- Update Paths model to have optional repository_names
- Update Paths model to have optional repository_names
- Update Paths model to have optional repository_name field - Create data directory for sqlite db if necessary
- Update Paths model to have optional repository_name field - Create data directory for sqlite db if necessary
The referrer tables do not have a timestamp. As such, we need to merge them and include the date the code was run (via the filename). I'm looking for my case whether there are duplicates with a simple merge. In most cases, the records are same with duplicate entries except for the following: 2021-03-17,Evolution-of-Galaxies,github.com,6,1 (first entry gave 4,1) |
One limitation right now with this feature is that with each run, there are overlap with SQLite db. We need to address this by ensure that the new numbers are used. There was an ask for this in |
We currently generate individual tables. This could in fact be improved with using a SQL library. We could perhaps use
SQLModel
to supportpydantic
andpandas
.SQLModel
models forclone
,traffic
andreferring
pydantic
models for each data view/statsdb
moduleconfigure
function for setting upSQLAlchemy
engine
query
function to be able to query SQLite byrepository_name
anddate
migrate_csv
that will work withmerged
and individual run CSV files (difference is withskip_rows
settings).sqlite3.db
sqlite3
database at some point when thegts_run_all_repos
script is executedpandas
DataFrame
to construct figures/chartsThe text was updated successfully, but these errors were encountered: