Skip to content

Commit

Permalink
Anomalous sequence update (#65)
Browse files Browse the repository at this point in the history
* Include modellable_params argument in Model class.

This can be used if you do not want rough heuristics to be used to decide which params have modellable values

* rerun black formatting

* - seperate functions for laplace smoothing
- store intermediate non laplace smoothed counts in model class

* use deepcopy instead of copy to prevent original variable from being edited

* try change the way laplace smoothing is done for param counts

* undo experimental change to laplace smoothing for params

* change how the param probabilities are computed. (changed from categorical mistake to bernoulli)

* update tests based on changes on how to calculate param probabilities

* fix one docstring issue in laplace_smooth module

* Add module for sessionizing data

* handle nans when sessionizing and aggregating

* fix bug in create_session_col function when we have 0 rows in df

* write tests for the new sessionize module

* include notebook showing how to use anomalous_sequence sub package

* add kql magic temp files to gitignore

* add AnomalousSequence rst file to DataAnalysis section

* Some formatting fixes

Co-authored-by: Ian Hellen <ianhelle@microsoft.com>
  • Loading branch information
karishma-dixit and ianhelle committed Jun 6, 2020
1 parent 3cd7293 commit e8cff74
Show file tree
Hide file tree
Showing 21 changed files with 42,248 additions and 167 deletions.
3 changes: 3 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -115,3 +115,6 @@ morphchart_package/

# Merge conflict files
**.orig

#kql magic temp files
/docs/notebooks/kqlmagic/*
4,563 changes: 4,563 additions & 0 deletions docs/notebooks/AnomalousSequence.ipynb

Large diffs are not rendered by default.

0 comments on commit e8cff74

Please sign in to comment.