New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Example with financial data #195

Merged

gcattan merged 70 commits into pyRiemann:main from gcattan:main

Nov 14, 2023

Collaborator

gcattan commented Oct 18, 2023 •

edited

Loading

This is an example based on a patent application exploiting RG+quantum for detecting fraudulent behavior.


          Slim vector (#32)

09ee61d

* add dependence to imbalanced_learn
add example with financial data

* print score
remove dead code

* add patent application number

* [pre-commit.ci] auto fixes from pre-commit.com hooks

* Update financial_data.py

* Update financial_data.py

Co-authored-by: fbarroso24 <fbarroso24@gmail.com>

---------

Co-authored-by: Gregoire Cattan <gregoire.cattan@ibm.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: fbarroso24 <fbarroso24@gmail.com>

gcattan changed the title ~~Slim vector (#32)~~ Example with financial data

gcattan added 3 commits

October 18, 2023 15:41


          Update Dockerfile

16fbb5a


          Update financial_data.py

0b5d02a


          Update Dockerfile

249f6c8

gcattan requested a review from qbarthelemy

October 18, 2023 13:53

gcattan marked this pull request as ready for review

October 18, 2023 13:55

qbarthelemy requested changes

View reviewed changes

Member

qbarthelemy left a comment

It's really great to have a real example on another type of data than biosignals!

examples/other_datasets/financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/financial_data.py Outdated

+              )
+              ##############################################################################
+              # Run evaluation

Member

qbarthelemy Oct 18, 2023

What are the non-quantum state-of-the-art methods for detecting financial fraud?
It would be good to add them in the comparison.

Collaborator Author

gcattan Oct 19, 2023

Good idea! May be some decision tree/random forst. @fbarroso24 what do you think?

examples/other_datasets/financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/financial_data.py Outdated

+                  pipe,
+                  param_grid={
+                      "toepochs__n": [10, 20],
+                      "xdawncovariances__nfilter": [1, 2],

Member

qbarthelemy Oct 18, 2023

You should test higher values for nfilter.
What is the number of features?

Collaborator Author

gcattan Oct 19, 2023

Only three in this example. We could add more features, but the simulation time is quite long.

Member

qbarthelemy Oct 19, 2023

With more features, classical pipeline would perform much better.
It seems unfair that time issues linked to quantum pipeline hamper the performance of classical one.

Collaborator Author

gcattan Oct 19, 2023

Makes sense. I can try, and find a compromise for the pipeline afterwhat.

examples/other_datasets/financial_data.py Outdated Show resolved Hide resolved

gcattan and others added 11 commits

October 19, 2023 12:39


          Update examples/other_datasets/financial_data.py

Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>


          Update examples/other_datasets/financial_data.py

Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>


          - rename file to run on Ci

d485d8d

- improve description
- add train/test split


          [pre-commit.ci] auto fixes from pre-commit.com hooks

d1faa49


          - print gridsearch results

8cf752b

- missing propagation of X_train/X_test changes


          change location of a comment

bfad074


          plot a sample of the epochs

8b314cc


          let's try to plot waveforms

f99eff8


          [pre-commit.ci] auto fixes from pre-commit.com hooks

f72d31b


          - transpose missing

e51e7c8

- display best csv_results without pandas


          correct warning in doc building

5072a13

qbarthelemy reviewed

View reviewed changes

examples/other_datasets/plot_financial_data.py Outdated Show resolved Hide resolved

qbarthelemy reviewed

View reviewed changes

examples/other_datasets/plot_financial_data.py Outdated Show resolved Hide resolved

qbarthelemy reviewed

View reviewed changes

examples/other_datasets/plot_financial_data.py Outdated

+              score_qsvm = gs.best_estimator_.fit(X_train, y_train).score(X_test, y_test)
+              # Print the results
+              print(f"Classical: {score_svm} \nQuantum  : {score_qsvm}")

Member

qbarthelemy Oct 19, 2023

Quantum pipeline gives a binary classification score of 0,5.
Flipping a coin would do the same thing... unless I missed something.

Collaborator Author

gcattan Oct 19, 2023

No, this is weird.
In the first version, it was 100%... but with the same data.
I need to investigate this.

qbarthelemy reviewed

View reviewed changes

examples/other_datasets/plot_financial_data.py Outdated Show resolved Hide resolved

gcattan and others added 4 commits

October 19, 2023 21:30


          standardscaler

0d1fd19


          [pre-commit.ci] auto fixes from pre-commit.com hooks

011f5ec


          small modification


          fix doc building

8e59ed1

qbarthelemy reviewed

View reviewed changes

examples/other_datasets/plot_financial_data.py Outdated Show resolved Hide resolved

gcattan added 2 commits

October 21, 2023 20:39


          test standardscaler fix on CI

f97e437


          add randomforest

6042b80

gcattan added 10 commits

October 28, 2023 22:23


          Plot ERPs, change some variables.

1b6dda0


          move ERP plotting to another location

448b720


          improve display

9400bea


          improve graphics

9e83112


          typo

a7a26c6


          - Try Tomeklinks

29160ce

- switch to halvinggridsearchcv


          Select SALDO_ANTES_PRESTAMO

14243c9


          use balanced accuracy

f5394d8


          change balance ratio

496f3db


          small clean-up

50a0c90

Collaborator Author

gcattan commented Nov 12, 2023

@qbarthelemy I made another pass on the example. There are mainly two changes:

I keep the NearMiss for computational reasons but increased the number of non-fraud example (that was also one of your remarks at the beginning if I remember correctly)
I changed the pb to "predict the type of fraud" rather than predict if the transaction is a fraud.

I also gave a tried to the halving grid search. It is quicker, but may be less accurate than the standard grid search.

gcattan and others added 6 commits

November 12, 2023 15:24


          Merge branch 'main' into main

77af437


          [pre-commit.ci] auto fixes from pre-commit.com hooks

e449957


          lint

8b78f8a


          [pre-commit.ci] auto fixes from pre-commit.com hooks

62dae07


          fix seaborn version

65ea77d


          fix Dockerfile

ad94160

qbarthelemy approved these changes

View reviewed changes

examples/other_datasets/plot_financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/plot_financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/plot_financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/plot_financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/plot_financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/plot_financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/plot_financial_data.py Show resolved Hide resolved

examples/other_datasets/plot_financial_data.py Outdated Show resolved Hide resolved

examples/other_datasets/plot_financial_data.py Outdated Show resolved Hide resolved

gcattan and others added 10 commits

November 14, 2023 10:48


          Update examples/other_datasets/plot_financial_data.py

4d866a2

Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>


          Update examples/other_datasets/plot_financial_data.py

d4bd2fa

Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>


          Update examples/other_datasets/plot_financial_data.py

e052e92

Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>


          Update examples/other_datasets/plot_financial_data.py

73f8004

Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>


          Update examples/other_datasets/plot_financial_data.py

517d6b1

Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>


          Update examples/other_datasets/plot_financial_data.py

216dcbd

Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>


          Update examples/other_datasets/plot_financial_data.py

ed9ec59

Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>


          Update examples/other_datasets/plot_financial_data.py

585d026

Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>


          Update examples/other_datasets/plot_financial_data.py

e1810bf

Co-authored-by: Quentin Barthélemy <q.barthelemy@gmail.com>


          [pre-commit.ci] auto fixes from pre-commit.com hooks

eb048b4

gcattan merged commit 2da3069 into pyRiemann:main

11 checks passed

qbarthelemy mentioned this pull request

Remove warnings in financial example #208

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet