data-pipeline

Here are 18 public repositories matching this topic...

swilliamc / SparkSQL

UC Davis Distributed Computing with Spark SQL (with Databricks) and Databricks Apache Spark SQL for Data Analysts

machine-learning sql spark apache-spark linear-regression distributed-computing sparksql logistic-regression data-pipeline engineering-data python-sklearn

Updated Jul 10, 2021
HTML

mrzzy / providence

Star

Apply Data Engineering to Personal Finance

automation sql dashboard superset pandas data-visualization data-engineering web-scraping dbt data-pipeline prefect duckdb

Updated Jul 1, 2024
HTML

WiljamiT / google-cloud-data-eng

Star

Collect & stream data, notify user on Telegram

python kafka google-cloud data-pipeline youtube-api-v3 ksqldb

Updated Mar 21, 2024
HTML

gaurav-aiml / ecommerce-data-pipeline

Star

As a Data Engineer for a fictional E-commerce startup, this project addresses the task of analyzing the web server logs to find the number of product pages visited and the number of items in the cart.

airflow real-time kafka spark-streaming data-pipeline data-engineer

Updated Mar 28, 2021
HTML

AnthonyByansi / Data_Engineering_Insights

Star

A comprehensive repository housing a collection of insightful blog posts, in-depth documentation, and resources exploring various facets of data engineering. From ETL processes and database management to orchestration tools, data quality, monitoring, and deployment strategies

big-data monitoring etl orchestration data-engineering data-analysis data-processing data-pipeline data-quality

Updated Nov 20, 2023
HTML

gopiashokan / Financial-Document-Classification-using-Deep-Learning

Star

Developed a deep learning model utilizing TensorFlow to automate the classification of financial documents. Leveraging a Bidirectional LSTM RNN, we accurately categorize the documents. Our user-friendly Streamlit application ensures high accuracy & efficiency in document management, all deployed on the Hugging Face platform for seamless integration

Updated Jun 10, 2024
HTML

andre-balbi / rotatividade-funcionarios

Star

Neste projeto de Análise de Recursos Humanos, temos como objetivo responder questões-chave sobre gestão de talentos e rotatividade de colaboradores em uma empresa fictícia.

docker sql data-lake minio predictive-modeling cluster-analysis data-pipeline classification-algorithm automated-machine-learning apache-airflow streamlit pycaret

Updated Aug 20, 2023
HTML

anchitagarwal / decoding-stackoverflow

Star

Bringing you the posts that matter.

flask aws scala database kafka spark mongodb aws-s3 stackoverflow postgresql aws-ec2 batch-processing data-pipeline

Updated Feb 2, 2018
HTML

latuyetmai / mla-folio

Star

machine-learning oop data-analysis hypothesis-testing causal-inference data-pipeline data-parsing statistical-regression

Updated Apr 27, 2023
HTML

afairless / pandas_polars

Star

Speed comparisons for dataframe libraries

python rust pandas-dataframe pandas python3 speed rust-lang data-pipelines data-pipeline pandas-python polars python-polars polars-dataframe rust-polars

Updated Jul 30, 2023
HTML

chintanp / demand_acep

Star

A data-pipeline for high-resolution power meter data

python data-science data-pipeline power-meter-data

Updated Jul 3, 2019
HTML

itisWasp / vsit-rwanda-sentiment

Star

This is the sentiment analysis on the #VisitRwanda on twitter, this is a campaign of eco tourism in Rwanda that promotes touristic places attraction in Rwanda.

nlp bigquery twitter tourism sentiment-analysis twitter-api twitter-streaming-api data-engineering twitter-sentiment-analysis data-pipeline bigquery-schema data-warehouse-cloud tourism-data visit-rwanda

Updated Jan 10, 2022
HTML

andrejanesic / Spark-News-Stock-Market-Prediction

Star

Data science and Spark applied to 7 hypotheses regarding the DJIA stock ticker and daily news.

python nlp data-science machine-learning etl stock-price-prediction data-pipeline

Updated Feb 27, 2023
HTML

Zhao-Weng / KanjiApp

Star

nodejs ocr hiragana google-cloud vision data-pipeline ejs-templates kanji-readings

Updated Mar 15, 2018
HTML

demand-consults / demand_acep

Star

A data-pipeline for high-resolution power meter data

data-science data-pipeline power-meter-data

Updated Dec 6, 2022
HTML

tvdboom / ATOM

Star

Automated Tool for Optimized Modelling

visualization python data-science machine-learning scikit-learn modelling model-predictions data-exploration data-pipeline interactive-visualizations automl mlflow shap dagshub

Updated Jul 5, 2024
HTML

tikal-fuseday / delta-architecture

Star

Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline

kafka spark databases streams data-pipeline debezium delta-lake

Updated Feb 15, 2023
HTML

elementary-data / elementary

Star

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

bigquery snowflake data-warehouse dataops data-analysis redshift dbt data-pipelines data-pipeline lineage data-governance data-lineage analytics-engineer dbt-packages data-observability data-reliability dbt-artifacts

Updated Jul 5, 2024
HTML

Improve this page

Add a description, image, and links to the data-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data-pipeline

Here are 18 public repositories matching this topic...

swilliamc / SparkSQL

mrzzy / providence

WiljamiT / google-cloud-data-eng

gaurav-aiml / ecommerce-data-pipeline

AnthonyByansi / Data_Engineering_Insights

gopiashokan / Financial-Document-Classification-using-Deep-Learning

andre-balbi / rotatividade-funcionarios

anchitagarwal / decoding-stackoverflow

latuyetmai / mla-folio

afairless / pandas_polars

chintanp / demand_acep

itisWasp / vsit-rwanda-sentiment

andrejanesic / Spark-News-Stock-Market-Prediction

Zhao-Weng / KanjiApp

demand-consults / demand_acep

tvdboom / ATOM

tikal-fuseday / delta-architecture

elementary-data / elementary

Improve this page

Add this topic to your repo