Skip to content

ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

License

Notifications You must be signed in to change notification settings

bruin-data/ingestr

Repository files navigation

Copy data from any source to any destination without any code


ingestr is a command-line app that allows you to ingest data from any source into any destination using simple command-line flags, no code necessary.

  • ✨ copy data from your database into any destination
  • βž• incremental loading: append, merge or delete+insert
  • 🐍 single-command installation

ingestr takes away the complexity of managing any backend or writing any code for ingesting data, simply run the command and watch the data land on its destination.

Installation

We recommend using uv to run ingestr.

pip install uv
uvx ingestr

Alternatively, if you'd like to install it globally:

uv pip install --system ingestr

While installation with vanilla pip is possible, it's an order of magnitude slower.

Quickstart

ingestr ingest \
    --source-uri 'postgresql://admin:admin@localhost:8837/web?sslmode=disable' \
    --source-table 'public.some_data' \
    --dest-uri 'bigquery://<your-project-name>?credentials_path=/path/to/service/account.json' \
    --dest-table 'ingestr.some_data'

That's it.

This command:

  • gets the table public.some_data from the Postgres instance.
  • uploads this data to your BigQuery warehouse under the schema ingestr and table some_data.

Documentation

You can see the full documentation here.

Community

Join our Slack community here.

Contributing

Pull requests are welcome. However, please open an issue first to discuss what you would like to change. We maybe able to offer you help and feedback regarding any changes you would like to make.

Note

After cloning ingestr make sure to run make setup to install githooks.

Supported sources & destinations

Source Destination
Databases
BigQuery βœ… βœ…
ClickHouse βœ… βœ…
Databricks βœ… βœ…
DuckDB βœ… βœ…
DynamoDB βœ… -
Local CSV file βœ… βœ…
Microsoft SQL Server βœ… βœ…
MongoDB βœ… ❌
MySQL βœ… ❌
Oracle βœ… ❌
Postgres βœ… βœ…
Redshift βœ… βœ…
SAP Hana βœ… ❌
Snowflake βœ… βœ…
SQLite βœ… ❌
Platforms
Adjust βœ… -
Airtable βœ… -
Amazon Kinesis βœ… -
Apache Kafka βœ… -
AppsFlyer βœ… -
App Store βœ… -
Asana βœ… -
Chess.com βœ… -
Facebook Ads βœ… -
Github βœ… -
Gorgias βœ… -
Google Sheets βœ… -
Google Ads βœ… -
Google Analytics βœ… -
Intercom βœ… -
Klaviyo βœ… -
LinkedIn Ads βœ… -
Notion βœ… -
Personio βœ… -
S3 βœ… -
Salesforce βœ… -
Shopify βœ… -
Slack βœ… -
Stripe βœ… -
TikTok Ads βœ… -
Zendesk βœ… -

Feel free to create an issue if you'd like to see support for another source or destination.

Acknowledgements

This project would not have been possible without the amazing work done by the SQLAlchemy and dlt teams. We relied on their work to connect to various sources and destinations, and built ingestr as a simple, opinionated wrapper around their work.