Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

added a few eda plotting functions #373

Closed
wants to merge 4 commits into from
Closed

added a few eda plotting functions #373

wants to merge 4 commits into from

Conversation

Ariel77
Copy link
Contributor

@Ariel77 Ariel77 commented Feb 27, 2021

eda 5 plotting functions

Description

Please include a summary of the change and which issue is fixed.

  1. time series heat map
  2. correlcation heatmap
  3. Year over year outcome vs event
  4. Dual axis time series ploot
  5. a wrap grid chart for quick glance of selected features

Fixes # (issue)

Type of change

Please delete options that are not relevant.

  • New feature

How Has This Been Tested?

Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests.
manual tests

eda plotting functions
@Ariel77 Ariel77 added the new idea / feature request New feature or request label Feb 27, 2021
@Ariel77 Ariel77 self-assigned this Feb 27, 2021
@CLAassistant
Copy link

CLAassistant commented Feb 27, 2021

CLA assistant check
All committers have signed the CLA.

@Ariel77 Ariel77 added the WIP Someone is aware of this and working in progress label Feb 27, 2021
@edwinnglabs edwinnglabs added this to In Progress in Version 1.0.13 Feb 27, 2021
@wangzhishi
Copy link
Contributor

thanks @Ariel77 for this. could you please add an example notebook so that it is easier to review and understand what each plotting utility is doing? in orbit, we have many example data sets. if you want, you could use iclaims data in your example.

Add weekly_trend_decomposition function to plot weekly trend, seasonality and residual.
'''
df_dt = deepcopy(df)
df_dt.index = pd.to_datetime(df_dt[date_col])
res_weely = seasonal_decompose(df_dt[[var]], period=7, model='multiplicative')
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am trying to generate some EDA plots so looking at this code to borrow some ideas. A quick question, where do we get this seasonal_decompose(). Thanks!

@wangzhishi
Copy link
Contributor

@Ariel77 @xiaoyangzhou thanks for the new commits. still, could we have one notebook to demo these functions? it can help review the code.

@xiaoyangzhou
Copy link
Collaborator

@Ariel77 @xiaoyangzhou thanks for the new commits. still, could we have one notebook to demo these functions? it can help review the code.

Sure. @Ariel77 I can work on the demo notebook. Let me know if you already create one.

@Ariel77
Copy link
Contributor Author

Ariel77 commented Mar 11, 2021 via email

1. eda_plot.py: using seasonal_decompose function from statsmodels.tsa.seasonal.
2. eda_function_demo.ipynb: used iclaims_example.csv to generate eda plots.
@wangzhishi
Copy link
Contributor

addressed in a separate PR

@wangzhishi wangzhishi closed this Mar 31, 2021
@wangzhishi wangzhishi deleted the ariel_branch branch March 31, 2021 05:06
@edwinnglabs edwinnglabs removed this from In Progress in Version 1.0.13 Apr 3, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new idea / feature request New feature or request WIP Someone is aware of this and working in progress
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants