Skip to content
#

datacleaning

Here are 74 public repositories matching this topic...

amora-data-build-tool

Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and select statements with SQLAlchemy. Amora is able to transform Python code into SQL data transformation jobs that run inside the warehouse.

  • Updated Jun 18, 2024
  • Python
data-observability-installer

Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.

  • Updated Jun 18, 2024
  • Python

Data analysis and visualization of New York Yellow Taxi Trip data, The core objective of this is to find the most pickups, drop-offs of public based on their location, time of most traffic and how to overcome the needs of the public, by using BigData Technologies and Tableau.

  • Updated Feb 4, 2021
  • Python

Improve this page

Add a description, image, and links to the datacleaning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the datacleaning topic, visit your repo's landing page and select "manage topics."

Learn more