Time-series Forecast ZenFile #35

lukyrasocha · 2022-06-02T13:52:32Z

Predict electricity power generation from wind forecast using ZenML:

Load data directly from Google Cloud BigQuery
Train a model remotely in Vertex AI

Also contains a thorough description on how to set up and configure a google cloud project together with a zenml stack.

Used in the step operator

You can use this data and upload it under your project in GCP

lukyrasocha · 2022-06-02T14:11:56Z

The spell check is failing on 'iam', but it is an abbreviation of Identity and Access Management.

htahir1 · 2022-06-02T14:40:37Z

@strickvl any possibility to add iam to the spell check ignore?

strickvl · 2022-06-02T14:46:24Z

@strickvl any possibility to add iam to the spell check ignore?

I would probably just ignore it for now. We're going to replace codespell spellchecks with pyspelling in the ZenFiles pretty soon.

ayush714

Hi,

First of all, It's an amazing ZenFile which showcases the power of ZenML, I do have some changes requested, but in overall I think:-

It would be good if you write good explanation of every step which you write in README.md.
I think you haven't added doc strings in your code, so add comments in every step and pipelines.
I think it would be nice if you can be standardised a bit with other ZenFiles. Like the structure of the project, I think you can organise this project in better way.
I also feel README.md is bit empty, so useful explanations would be nice.
I think for setting up this project, I need to have poetry, but I think you should give a traditional option as well for setting up this project just like we did in other ZenFiles because that is bit common. I also think you can reference a link which says how one can install poetry and use poetry for installation.

If I think there can be any other changes, I will add other reviews for sure. If you are complete with current feedbacks, you can re request me for reviewing it again.

Amazing work :)

time-series-forecast/README.md

time-series-forecast/src/main.py

time-series-forecast/README.md

time-series-forecast/src/steps/transformer.py

ayush714 · 2022-06-02T19:53:27Z

time-series-forecast/src/steps/transformer.py

+    X_train=np.ndarray, X_test=np.ndarray, y_train=np.ndarray, y_test=np.ndarray
+):
+    df = data.copy()
+    cardinal_directions = {'N': 0.0,


this looks a bit long, what about adding it in config file?

I changed the code a bit so it looks better and added a description of how the feature engineering works. I find it more clear if the code stays together instead of using a config file in this case

time-series-forecast/README.md

Co-authored-by: Ayush Singh <81796368+ayush714@users.noreply.github.com>

lukyrasocha · 2022-06-08T12:24:48Z

Hey @ayush714, thank you so much for the feedback. I tried to change and adjust all the things you mentioned, but please let me know any additional feedback that you might have. Thanks!

ayush714

It looks good from my side, I will wait for other peoples to approve it maybe @AlexejPenner and @strickvl.

ayush714 · 2022-06-10T12:56:50Z

Hi, I will run your whole code on my system and see everything work as per your readme instructions. I will be done by 14th June ( most probably ).

ayush714

I again did a small review, I have a comment in README, I see that you are starting from very scratch which is good, what if I already have those things set up? I just need to create my stack, so in that case, it might get confused for other peoples who have things setup, so it would be nice to have a general note on the steps which starts from scratch.

I will be running your code once and will give more feedback.

time-series-forecast/requirements.txt

Co-authored-by: Ayush Singh <81796368+ayush714@users.noreply.github.com>

Add a description of how to upload the data set to bigquery from CLI and also separated GCP steps and Zenml steps into two separate sections based on the latest comments

lukyrasocha · 2022-06-16T10:31:56Z

In the readme I now separated the steps into two sections - Steps on how to set up a GCP project and Steps on how to set up components for zenml stack. That should clear things up. I also added a description on how to upload the data set into BigQuery from CLI

ayush714

I ran it on my own system and it seems to be working, I didn't found any such issues, It will be helpful if you can add a diagram/pipeline diagram in high level so it will be easier for peoples to understand what's going on!?

However I don't feel a strong requirement for that, if u can it's good otherwise I am approving it.

htahir1

I love this! Very well explained and a valuable showcase. Thank you so much for the amazing contribution @lukyrasocha !

htahir1 · 2022-06-21T09:09:16Z

Ok im merging it now. Amazing contribution and we'll be happy to release this tomorrow to the world!

lukyrasocha and others added 18 commits June 2, 2022 12:36

Add gitignore to ignore my credentials for gcp

9fa416a

Gitignore my credentials for gcp

b3a0367

Delete old gitignore

8a27269

Add template for README

db36bc2

Need to update dependencies in .toml

9413291

Add working version of the pipeline

f849009

Add a Dockerfile to build a custom image

71f76ad

Used in the step operator

Abstract away the steps of the pipeline

1555ab5

Delete not used packages from main

67203b6

Update README.md

82cbae9

Update README.md

32a1d8a

Update README.md

848042d

Update README.md

107ca35

Data that was uploaded to BQ

be04f20

You can use this data and upload it under your project in GCP

Zenml dependency and pandas gbq

b838f45

Merge branch 'main' of https://github.com/lukyrasocha/zenfiles

7762b61

Update README.md

9480d66

Update README.md

5968658

lukyrasocha changed the title ~~zenfiles/time-series-forecast~~ Time-series-forecast Zenfile Jun 2, 2022

lukyrasocha mentioned this pull request Jun 2, 2022

Task8 - Create a ZenML pipeline downloading data from BigQuery in one step in local stack halvgaard/DataScience#23

Merged

strickvl added the enhancement New feature or request label Jun 2, 2022

htahir1 requested a review from ayush714 June 2, 2022 14:40

lukyrasocha added 3 commits June 2, 2022 16:56

Merge branch 'zenml-io:main' into main

500c120

Remove comments from transformer.py

140015f

Remove comments from preparator.py

f4980b5

ayush714 suggested changes Jun 2, 2022

View reviewed changes

Update README from auysh's suggestions

f32584a

Co-authored-by: Ayush Singh <81796368+ayush714@users.noreply.github.com>

lukyrasocha and others added 10 commits June 8, 2022 13:41

Add requirements.txt

4d372d2

Merge branch 'main' of https://github.com/lukyrasocha/zenfiles

36edf0c

Update README.md

e3f6acf

Update README.md

1f9d55e

Add explanatory figures

ff2d5e9

Merge branch 'main' of https://github.com/lukyrasocha/zenfiles

826a959

Add description of the steps

7437f64

Update README.md

87cd492

Update README.md

af451e7

Update README.md

06d2bed

lukyrasocha requested a review from ayush714 June 8, 2022 12:25

ayush714 suggested changes Jun 9, 2022

View reviewed changes

strickvl changed the title ~~Time-series-forecast Zenfile~~ Time-series Forecast ZenFile Jun 9, 2022

AdamVPro requested a review from ayush714 June 10, 2022 09:13

ayush714 suggested changes Jun 16, 2022

View reviewed changes

time-series-forecast/requirements.txt Outdated Show resolved Hide resolved

time-series-forecast/requirements.txt Outdated Show resolved Hide resolved

lukyrasocha and others added 3 commits June 16, 2022 12:02

Update time-series-forecast/requirements.txt

58a2e95

Co-authored-by: Ayush Singh <81796368+ayush714@users.noreply.github.com>

Update time-series-forecast/requirements.txt

34e0fc6

Co-authored-by: Ayush Singh <81796368+ayush714@users.noreply.github.com>

Update README

3bafd50

Add a description of how to upload the data set to bigquery from CLI and also separated GCP steps and Zenml steps into two separate sections based on the latest comments

lukyrasocha added 2 commits June 16, 2022 12:47

Update requirements.txt

250795e

Update README.md

1a9154d

ayush714 self-requested a review June 20, 2022 10:55

ayush714 approved these changes Jun 20, 2022

View reviewed changes

strickvl requested review from strickvl and htahir1 June 21, 2022 08:53

htahir1 approved these changes Jun 21, 2022

View reviewed changes

htahir1 merged commit 7dc7680 into zenml-io:main Jun 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Time-series Forecast ZenFile #35

Time-series Forecast ZenFile #35

lukyrasocha commented Jun 2, 2022

lukyrasocha commented Jun 2, 2022

htahir1 commented Jun 2, 2022

strickvl commented Jun 2, 2022

ayush714 left a comment

ayush714 Jun 2, 2022

lukyrasocha Jun 8, 2022

lukyrasocha commented Jun 8, 2022

ayush714 left a comment

ayush714 commented Jun 10, 2022

ayush714 left a comment

lukyrasocha commented Jun 16, 2022

ayush714 left a comment

htahir1 left a comment

htahir1 commented Jun 21, 2022

Time-series Forecast ZenFile #35

Time-series Forecast ZenFile #35

Conversation

lukyrasocha commented Jun 2, 2022

lukyrasocha commented Jun 2, 2022

htahir1 commented Jun 2, 2022

strickvl commented Jun 2, 2022

ayush714 left a comment

Choose a reason for hiding this comment

ayush714 Jun 2, 2022

Choose a reason for hiding this comment

lukyrasocha Jun 8, 2022

Choose a reason for hiding this comment

lukyrasocha commented Jun 8, 2022

ayush714 left a comment

Choose a reason for hiding this comment

ayush714 commented Jun 10, 2022

ayush714 left a comment

Choose a reason for hiding this comment

lukyrasocha commented Jun 16, 2022

ayush714 left a comment

Choose a reason for hiding this comment

htahir1 left a comment

Choose a reason for hiding this comment

htahir1 commented Jun 21, 2022