Light weight data transfer containers with Command-line interface
DataMule requires Python 3 to run, navigate to the same directory as setup.py and do
$ pip install .
To run the app, make sure you are in the same directory as before. And then run
$ datamule run nyu_staff postgres
To see all currently running data processes, run
$ datamule ps
To remove a dataset, run
$ datamule rm nyu_staff postgres
The yaml file is made by the community for the users. It specifies any protocol that the dataset could be retrieved from, which is what we used in our application.
get_data:
protocol: 'rest'
connectors:
- connector:
connection_string: 'https://sandbox.api.it.nyu.edu/staff-exp/users'
table_name: "nyu_staff"
delta:
type: 'page'
value: 1
schema: 'auto'
auth: True
format_type: 'json'