sqoop
Here are 23 public repositories matching this topic...
The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Postgres, Cassandra, Hue, Zeppelin, Kadmin, Kafka Control Center and pgAdmin. This cluster is solely intended for usage in a development environment. Do not use it to run any production workloads.
-
Updated
Feb 27, 2023 - Shell
Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.
-
Updated
Sep 10, 2019 - Shell
Predictive Analysis using Big Data platforms and Machine Learning Libraries
-
Updated
Aug 1, 2016 - Shell
This repository contains the H1B_Visa Applicants Data Analysis project/case study using Hadoop undertaken during the training at NIIT. MapReduce,Hive,Pig,Scoop and Shell-scripting are the technologies used.
-
Updated
Jun 26, 2019 - Shell
Created a data pipeline using sqoop to ingest data from sql server into the hive table and used hive for feature engineering and analysis.
-
Updated
Jun 5, 2020 - Shell
Created a utility to import data from traditional databases to hdfs using sqoop and implemented using bash
-
Updated
Jun 11, 2019 - Shell
Improve this page
Add a description, image, and links to the sqoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the sqoop topic, visit your repo's landing page and select "manage topics."