This repository is going to update based on my challenges in installing and using the Hadoop's tools Spark
-
Updated
Mar 29, 2020
This repository is going to update based on my challenges in installing and using the Hadoop's tools Spark
[Work in progress] Client library for simplified access to Apache Accumulo
This project focuses on analyzing movie data using Pyspark tailored for efficient data processing on Hadoop Distributed File System (HDFS)
Processing and transforming data via Hadoop Ecosystem
Getting tweets using Flume service and analyzing tweets
資料平行批次與串流處理以及搭建機器學習環境會用到的container
[BigData] one year weblog analysis using PIG
Helm chart for Apache Knox
Ambiente com o objetivo de praticar o uso das ferramentas Ansible e Hadoop usando uma única instância
Mapreduce program developed in Java for analyzing movie dataset
Learn and implement the Hadoop Ecosystem to drive Big Data Analytics.
Apache Hadoop Components Installation Guide on Windows
Some basic procedures for parallel computing in the Hadoop environment
Learning Spark 2 on Cloudera, programming with scala 2.10.
Add a description, image, and links to the hadoop-ecosystem topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-ecosystem topic, visit your repo's landing page and select "manage topics."