bigdata
Here are 334 public repositories matching this topic...
ThereForYou: Your mental health ally. Kai, our AI assistant, offers compassionate support. Track your mood trends, find solace in a secure community, and access crisis resources swiftly. We're here to empower your journey towards improved well-being, leveraging technology for a brighter tomorrow.
-
Updated
Jul 11, 2024 - Python
This project implements an end-to-end techstack for a data platform, for local development.
-
Updated
Jul 10, 2024 - Python
Possibly the fastest DataFrame-agnostic quality check library in town.
-
Updated
Jul 9, 2024 - Python
General purpose framework to run CMS experiment workflows on HDFS/Spark platform
-
Updated
Jul 8, 2024 - Python
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
-
Updated
Jul 8, 2024 - Python
Web scraper that extracts all daily tennis matches, and analyse them to predict the probability in the "First Set Player To Break Serve" market.
-
Updated
Jul 7, 2024 - Python
This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data engineering, data analysis and data science parts.
-
Updated
Jul 5, 2024 - Python
A cross-platform Echarts dashboard application,Powerpoint-like, designed based on Excel data, with the capability to update data remotely.supports line, spline, area, areaspline, column, bar, pie, scatter, angular gauges, arearange, areasplinerange, columnrange, bubble, box plot, error bars, funnel, waterfall. 支持柱状图、条形图、折线图、曲线图、折线填充图、曲线填充图、气泡图、扇形图。
-
Updated
Jul 3, 2024 - Python
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
-
Updated
Jul 1, 2024 - Python
⏱ Real-Time Sentiment Analysis using PySpark and simulation of Twitter/X API using FastAPI
-
Updated
Jun 13, 2024 - Python
An AWS based solution using AWS CloudWatch and AWS Lambda based on Python to automatically terminate AWS EMR clusters that have been idle for a specified period of time.
-
Updated
Jun 5, 2024 - Python
Interview coding questions and experiences for several companies merged into one repository
-
Updated
Jun 5, 2024 - Python
This project aims to propose and evaluate the performance of the Entity Component System (ECS) architecture for Big Data and AI pipelines.
-
Updated
Jun 3, 2024 - Python
Django app for managing long-running data operations on large and/or schemaless databases
-
Updated
Jun 3, 2024 - Python
A python library with scripts and helpers classes for quantms workflow
-
Updated
May 31, 2024 - Python
Apache Airavata Django Portal Framework
-
Updated
May 31, 2024 - Python
Improve this page
Add a description, image, and links to the bigdata topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the bigdata topic, visit your repo's landing page and select "manage topics."