Skip to content

CSFelix/faker-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“‹ Faker Dataset πŸ“‹

Amazing library to generate fake datas


πŸ“ Problem Description


Pandas Cartoon

Fig. 1 - Pandas Library Cartoon. Β©Real Python



You have been hired for a Data Science job where you must explore the Faker Python Library to generate a dataset with fake datas.

Your dataset file must be named faker-dataset.csv and be stored into dataset folder located at this project directory, contain at least 5 features and 3,000 rows, and all features' values must be generated using the Faker Library.

After generating and saving the datas into a csv file, you must read it as utf-8 charset and show its first 5 rows in order to check out whether the dataset is ok to be used for other Data Scientists.

Good Luck!! πŸ€ ☘️


πŸ“ Files Description

faker-dataset.csv - contains at least 3,000 rows of fake datas generated by Faker Python Library.


❓ Library Features

region/locale - the person's locale;

name - the person's name;

email - the person's e-mail;

adress - the person's address;

license_plate - the person's automative's license plate;

company - the company where the generated person works at;

job - the job occupied by the person;

color_name - the person's fav color.


You can check out all the possible features here: Faker Library - Standard Providers


βš™οΈ Setup

Tools

Python Version 3.9.x+;

Jupyter Notebook.


Libraries

Faker;

Numpy, Pandas.


πŸŽ‰ Acknowledgments

Fzaninotto and Faker Library team!


πŸ“« Reach Me

Email: csfelix08@gmail.com

Linkedin: linkedin.com/in/csfelix/

Portfolio: CSFelix.io

Kaggle: DSFelix

About

πŸ“‹ Fake Dataset generated via Faker Python Library πŸ“‹

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published