Database Normalization

I take a dataset contained in a CSV file (called movies.csv), of movie information, clean it and turn it into a nice, normalized set of tables.

Tables

Initial movie table

Normalized set of tables

Steps for normalization

- [X] Remove special characters from columns **movies** & **year**
- [X] Set null values in empty rows
- [X] Trim spaces and remove newlines from columns
- [X] Remove multivalues
- [X] Remove duplicate values
- [X] Find Functional Dependencies
- [X] Decompose Tables
- [X] Set surrogate keys 
- [X] Check for lossless joins

Functions & Techniques

aggregate functions
window functions
views
joins
unions
unnest()
replace()
substring()
trim()
nullif()
regexp_replace()
left()
right()
string_to_array()
cast()

Getting started

Clone repository

	$ git clone https://github.com/AposLaz/POSTGRESQL_NORMALIZATION.git
		
	$ cd POSTGRESQL_NORMALIZATION

	# Remove current origin repo
	$ git remote remove origin

Docker

	$ docker-compose up

	#then you have to configure pg_admin
	$ localhost:5050
	$ username: admin@admin.com
	$ password: root

	#server
	$ host: pg_container
	$ username: root
	$ password: root

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
images		images
movies_data		movies_data
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Database Normalization

Tables

Steps for normalization

Functions & Techniques

Getting started

About

Releases

Packages

Languages

AposLaz/POSTGRESQL_NORMALIZATION

Folders and files

Latest commit

History

Repository files navigation

Database Normalization

Tables

Steps for normalization

Functions & Techniques

Getting started

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages