Anime_ELT_dashboard

General description

An end-to-end data science project is implemented in this repository. Using data available on the MyAnimeList public website, a web scraping data extraction process is implemented, a data warehouse and a datalake are created in AWS and metrics of interest are displayed in a streamlit dashboard deployed in an EC2 AWS instance.

Project structure

Anime_ETL_Project
├── project_description
│   ├── pipeline_design.py # Python script that draws the project architecture using graphviz.
│   ├── project_stages.txt # Txt file that displays the steps to implement the project.
│   └── verbal_description.txt # Txt file containing a general description of every step in the architecture. 
├── README.md
└── requirements.txt

Project replicability

conda activate env

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
data		data
deployment		deployment
notebooks		notebooks
project_description		project_description
src		src
streamlit_dashboard		streamlit_dashboard
web_scrapping		web_scrapping
.gitignore		.gitignore
01_web_scrapping.py		01_web_scrapping.py
02_transform.py		02_transform.py
AnimeEnv.yaml		AnimeEnv.yaml
README.md		README.md
config.yaml		config.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Anime_ELT_dashboard

General description

Project structure

Project replicability

To complete

About

Releases

Packages

Contributors 3

Languages

JuanPalms/Anime_ELT_dashboard

Folders and files

Latest commit

History

Repository files navigation

Anime_ELT_dashboard

General description

Project structure

Project replicability

To complete

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages