Big Data Handling

This image is retrieved from Datashader organization.

Objective & Purpose

The objective of this project is to create a tutorial on big data handling. The purpose is to learn how to deal with big data in an efficient and elegant manner using common data science techniques.

Techniques

This project uses Bash and Python in the Jupyter Notebook interface to complete multiple tasks, including data acquisition, data cleaning, data analysis, and data visualization.

How to run it locally

Clone this repo to your local machine.
Build the enviroment by following the instructions in the environmental.yml file.
Open the index.ipynb and run the code.

How to run it remotely

Click the badge below. This badge will bring you to the virtual enviroment boosted through the BinderHub and you can run the code in the index.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
image		image
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
apt.txt		apt.txt
environment.yml		environment.yml
index.ipynb		index.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image

image

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

apt.txt

apt.txt

environment.yml

environment.yml

index.ipynb

index.ipynb

Repository files navigation

Big Data Handling

Objective & Purpose

Techniques

How to run it locally

How to run it remotely

About

Releases

Packages

Languages

License

Ray800413/Big_Data_Handling

Folders and files

Latest commit

History

Repository files navigation

Big Data Handling

Objective & Purpose

Techniques

How to run it locally

How to run it remotely

About

Resources

License

Stars

Watchers

Forks

Languages