Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fake Data Generator #3

Merged
merged 5 commits into from Nov 18, 2021
Merged

Fake Data Generator #3

merged 5 commits into from Nov 18, 2021

Conversation

Jen-Hall
Copy link
Contributor

@Jen-Hall Jen-Hall commented Oct 20, 2021

Generating fake data

Overview and Purpose

The purpose of generate_fake_data.py is to create a .csv file with fake data with the following intended applications:

  • An example of how data needs to be formatted to be passed into the model
  • To ensure the files are being generated correctly to test GUI setup.
  • To test the setup and running of the repo

The purpose of the fake data is NOT too:

  • DO NOT use the model artifacts (.pickle and .state files) generated from the fake data to make predictions which will be used in any real world application.
  • DO NOT use the fake data generated to inform any insights to be applied to a real world setting.
  • DO NOT use to test the performance of the model

The data is generated completely randomly, with each field having random values generated independently of other fields. This generator was created having never been exposed to the real data.

@amadeuspzs amadeuspzs marked this pull request as draft November 4, 2021 13:12
@Jen-Hall Jen-Hall changed the title Data Generator Fake Data Generator Nov 5, 2021
@Jen-Hall Jen-Hall marked this pull request as ready for review November 8, 2021 12:34
Copy link

@amadeuspzs amadeuspzs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great work @Jen-Hall; I can confirm running the scripts and deploying works.

README.md Show resolved Hide resolved
fake_data_generation/README.md Show resolved Hide resolved
fake_data_generation/README.md Outdated Show resolved Hide resolved
fake_data_generation/README.md Outdated Show resolved Hide resolved
fake_data_generation/README.md Outdated Show resolved Hide resolved
fake_data_generation/README.md Outdated Show resolved Hide resolved
fake_data_generation/README.md Outdated Show resolved Hide resolved
training/README.md Outdated Show resolved Hide resolved
training/README.md Outdated Show resolved Hide resolved
Copy link

@amadeuspzs amadeuspzs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Closes #5

@Jen-Hall Jen-Hall merged commit 90c524a into main Nov 18, 2021
@Jen-Hall Jen-Hall deleted the dummy_data_generator branch November 18, 2021 09:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants