This project converts JSON files to CSV for database data imports.
-
prepare env
$ pyenv local 3.11.6
-
raw data
download json files to ./tag_data
-
exec
$ cd scripts; sh run.sh
-
extract
extract n item from each ./tag_data/*.json
-
clean
- standardize all the json to one structure
- remove
\n - use eamil as orcid if orcid is null
-
convert tag to id
read index to map, then convert the text to id
-
export CSV for PostgreSQL insertion
refer to table strcture of academic_products and tags, convert the taged json to csv.