big-data-analytics

Here are 55 public repositories matching this topic...

ingef / conquery

Visual, interactive queries against big databases

Updated Jul 16, 2024
Java

Big Data Analysis of NYC Fire Incident data to analyze casual relationship between fires, govt. inspections, socio-ecnomic factors and enviroment. Used Hadoop MapReduce for data pre-processing, Trino for complex queries and Tableau for visualizations and interactive dashboards

nyc big-data hadoop etl mapreduce big-data-analytics nyc-opendata

Updated Jun 24, 2024
Java

Dare-marvel / Big-Data-Analytics--BDA--

Star

💾 Welcome to the Big Data Analytics Repository! 📚✨ Immerse yourself in a carefully curated reservoir of knowledge on Big Data Analytics. 🌐💡 Explore the intricacies of deriving insights from vast datasets and navigating powerful analytics tools. 🚀🔍

big-data hadoop tableau case-study big-data-analytics mapreduce-java tableau-dashboards walmart-case-study

Updated May 31, 2024
Java

GMAP / DSPBench

Star

a suite of benchmark applications for distributed data stream processing systems

big-data apache-spark storm data-stream bigdata evaluation stream-processing spark-streaming apache-storm apache-flink experiments big-data-analytics

Updated May 27, 2024
Java

yaoguangluo / ChromosomeDNA

Star

《DNA元基催化与肽计算》在进化计算中, 软件函数文件进行 DNA 语义元基索引编码的 PDE 新陈代谢优化方式, 是一种有效的进化方式.

search-engine data-science database prediction dnn plsql dna vision sorting-algorithms shell-script metabolism catalyst word-segmentation big-data-analytics nerotechnology etl-pipeline vpcs-rest dataswap

Updated Apr 25, 2024
Java

marcocolangelo / Big-Data-processing-and-Analytics

Star

The current repository contains all the code developed during the Big Data processing and Analytics laboratories. Data are processed and analyzed using Hadoop and Spark

java spark spark-streaming data-analysis hadoop-mapreduce spark-sql spark-mllib big-data-analytics hadoop-hdfs

Updated Jan 15, 2024
Java

Ayoub-etoullali / Practical-Activities-Parallel-Processing-BigData

Star

Implementing parallel processing techniques for efficient handling of Big Data through practical activities.

Updated Dec 25, 2023
Java

ICT-BDA / EasyML

Star

Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.

machine-learning learning-platform big-data-analytics machine-learning-studio machine-learning-platform

Updated Dec 18, 2023
Java

amitkedia007 / Analysis-of-AirBnB-data-Hadoop-Mapreduce

Star

This repo explains the implementation of Map-Reduce Algorithm on the AirBnb data to understand the consumer satisfaction region and country wise. This is the effective use of parallel distributed computing to resolve the big data problems

java hadoop parallel-computing map-reduce hadoop-mapreduce big-data-analytics

Updated Oct 2, 2023
Java

eskimo-sh / eskimo

Star

Eskimo is a state of the art Big Data Infrastructure and Management Web Console to build, manage and operate Big Data 2.0 Analytics clusters on Kubernetes. This is the git repository of Eskimo Community Edition.

Updated Sep 14, 2023
Java

vvittis / FlinkSampling

Star

Reservoir Sampling for Group-By Queries in Flink Platform. Answering effectively Single Aggregate.

java topic stratum apache-flink sampling reservoir-sampling streaming-data big-data-analytics group-by big-data-processing streaming-tuples

Updated Aug 12, 2023
Java

BhagiaSheri / apache-spark-SQL

Star

Big Data Pipeline | Querying Data from Hive Table Phase

spark hive java-8 spark-sql big-data-analytics hive-metastore

Updated Jun 17, 2023
Java

jowilf / big-data-showcase

Star

This repository contains a project showcasing the use of Big Data technologies in processing and visualizing real-time data from an eCommerce electronics store using tools such as Apache Kafka, Spark Streaming, Spark SQL, HBase, and Plotly

kafka hbase spark-streaming spark-sql big-data-analytics plotly-dash

Updated Apr 30, 2023
Java

asilkan-ai / click_event_elasticsearch

Star

Real-time click event project with Elasticsearch

ecommerce big-data bigdata data-visualization big-data-analytics

Updated Apr 7, 2023
Java

grahman20 / ADF

Star

Adaptive Decision Forest(ADF) is an incremental machine learning framework called to produce a decision forest to classify new records. ADF is capable to classify new records even if they are associated with previously unseen classes. ADF also is capable of identifying and handling concept drift; it, however, does not forget previously gained kn…

Updated Mar 24, 2023
Java

ronellsalunke / Titanic-BigData

Star

Java Hadoop MapReduce code for my Big Data Analytics Project using the Titanic dataset

java big-data hadoop titanic-kaggle hadoop-mapreduce big-data-analytics

Updated Feb 25, 2023
Java

mohamedsaleh1984 / twitter-spark

Star

Fetch data from Twitter and push it through Kafka to Spark then HDFS

python kafka hive spark-streaming big-data-analytics big-data-processing

Updated Sep 30, 2022
Java

jamestiotio / dbsys

Sponsor

Star

SUTD 2021 50.043 Database and Big Data Systems Code Dump

Updated May 17, 2022
Java

klugem / watchdog

Star

Workflow management system for the automated and distributed analysis of large-scale experimental data.

bioinformatics bioinformatics-pipeline rna-seq-analysis workflow-management-system cluster-computing big-data-analytics

Updated Apr 25, 2022
Java

yashwanth-eshwarappa / BigData-Covid19-Vaccination-Impact

Star

The objective of the project is to gather, process and analyze publicly available COVID related data acquired from various reliable sources such as CDC and JHU. The application finds strong correlation between the features such as number of cases across different cohorts and how it affects COVID case count in that particular geography, over time…

data-visualization spark-sql big-data-analytics

Updated Apr 13, 2022
Java

Improve this page

Add a description, image, and links to the big-data-analytics topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the big-data-analytics topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

big-data-analytics

Here are 55 public repositories matching this topic...

ingef / conquery

Anoushka21 / IgniTech

Dare-marvel / Big-Data-Analytics--BDA--

GMAP / DSPBench

yaoguangluo / ChromosomeDNA

marcocolangelo / Big-Data-processing-and-Analytics

Ayoub-etoullali / Practical-Activities-Parallel-Processing-BigData

ICT-BDA / EasyML

amitkedia007 / Analysis-of-AirBnB-data-Hadoop-Mapreduce

eskimo-sh / eskimo

vvittis / FlinkSampling

BhagiaSheri / apache-spark-SQL

jowilf / big-data-showcase

asilkan-ai / click_event_elasticsearch

grahman20 / ADF

ronellsalunke / Titanic-BigData

mohamedsaleh1984 / twitter-spark

jamestiotio / dbsys

klugem / watchdog

yashwanth-eshwarappa / BigData-Covid19-Vaccination-Impact

Improve this page

Add this topic to your repo