TeslaSCIToolkit (abbrev. tesci) is a scientific mapping tool that comes with the following features:
- Merging data from multiple citation databases
- Restricting access to sensitive columns in data sources with aggregations
- Exporting transformed data into other repositories
- CI/CD integration, currently GitHub Actions
For examples and use-cases, see examples directory.
To create an aggregation of simple.csv based on average salary and age.
tesci start -d simple.csv -o exported.csv
tesci aggregate avg -c salary -a avg_salary
tesci aggregate avg -c age -a avg_age
tesci applyaggregate:
- alias: avg_salary
column: salary
function: avg
- alias: avg_age
column: age
function: avg
data:
dest: exported.csv
src: simple.csvThe result is a transformation from simple.csv to exported.csv:
|
→ |
|
After retrieving data sources from citation databases of your choice, place the databases in a directory of your choice. Then, specify the configuration used for merging. An example of a configuration is here.
After specifying your configuration choices, merge can then by run with:
tesci similarity merge --src PATH --src PATH --dest DIR
where PATH and DIR refer to relative or absolute filesystem paths and directories.
If you find tesci useful in your research, please support our work by citing our paper.
Nikolić, D., Ivanović, D., & Ivanović, L. (2024).
An open-source tool for merging data from multiple citation databases.
Scientometrics, 1-23.
https://doi.org/10.1007/s11192-024-05076-2
Licensed under either of Apache License, Version 2.0 or MIT license.