[ENH] implement efficient _evaluate_by_index for forecast performance metrics #4304 Implementation for GeometricMeanAbsoluteError #6244

KaustubhUp025 · 2024-04-01T05:38:52Z

Reference Issues/PRs

Towards #4304

What does this implement/fix? Explain your changes.

Implementation of efficient _evaluate_by_index performance metrics by taking reference from #4302

Does your contribution introduce a new dependency? If yes, which one?

No

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

No

Any other comments?

PR checklist

For all contributions

I've added myself to the list of contributors with any new badges I've earned :-)
How to: add yourself to the all-contributors file in the sktime root directory (not the CONTRIBUTORS.md). Common badges: code - fixing a bug, or adding code logic. doc - writing or improving documentation or docstrings. bug - reporting or diagnosing a bug (get this plus code if you also fixed the bug in the PR).maintenance - CI, test framework, release.
See here for full badge reference
Optionally, for added estimators: I've added myself and possibly to the maintainers tag - do this if you want to become the owner or maintainer of an estimator you added.
See here for further details on the algorithm maintainer role.
[ 👍] The PR title starts with either [ENH], [MNT], [DOC], or [BUG]. [BUG] - bugfix, [MNT] - CI, test framework, [ENH] - adding or improving code, [DOC] - writing or improving documentation or docstrings.

For new estimators

I've added the estimator to the API reference - in docs/source/api_reference/taskname.rst, follow the pattern.
I've added one or more illustrative usage examples to the docstring, in a pydocstyle compliant Examples section.
If the estimator relies on a soft dependency, I've set the python_dependencies tag and ensured
dependency isolation, see the estimator dependencies guide.

…etricMeanAbsoluteError

fkiraly

Thanks!

I think there has been a mistake, you copied the mean absolute error (MAE)?
This metric is supposed to be GMAE, geometric mean absolute error.

Btw, it would also be nice if you provide the formula in the docstring, like in MeanAbsoluteError, that helps avoiding math errors and make the review easier.

KaustubhUp025 · 2024-04-01T10:54:58Z

Thank You @fkiraly , I will check it once.

…MeanAbsoluteError with documentation as well

KaustubhUp025 · 2024-04-01T11:35:02Z

@fkiraly I have added the changes that I missed earlier.
Really sorry, I didn't check my code the first time and made the PR.
Will definitely work on this.

fkiraly

Thanks, this looks correct now!

Before we merge, can we discuss numerical stability?
If we implement the formula directly, we use np.prod which can lead to a numerical explosion.

Would it be better to use the exp/log representation instead?
That is, using that geometric averages are arithmetic averages of logarithms, then exp?

For discussion - we could test with a vector of length 100, with value 1000. My conjecture is that the current formula produces nans, while exp/log will be fine.

KaustubhUp025 · 2024-04-01T12:01:50Z

@fkiraly Thank you, I will give it a check and will let you know.

KaustubhUp025 · 2024-04-01T12:35:56Z

@fkiraly I checked your conjecture and yepp you are right in that.
With the given values of a vector of length 100, with value 1000.

and implementing the given functions to check it, I found this as my result.

# Function to compute GMAE using direct product approach
def gmae_direct(y_true, y_pred):
errors = np.abs(y_true - y_pred)
product_of_errors = np.prod(errors)
gmae = np.power(product_of_errors, 1 / len(errors))
return gmae

And :-

def gmae_log_exp(y_true, y_pred, epsilon=1e-10):
errors = np.abs(y_true - y_pred)
errors = np.maximum(errors, epsilon) # Ensure errors are strictly positive
log_errors = np.log(errors)
log_mean = np.mean(log_errors)
gmae = np.exp(log_mean)
return gmae

And so this was the result I got :-

GMAE using direct product approach: 0.0
GMAE using log/exp representation approach: 9.999999999999996e-11

fkiraly · 2024-04-01T12:41:38Z

hm, something is not right here - the result should be 1000

KaustubhUp025 · 2024-04-01T12:42:55Z

Okay I will check it once.
Is there any problem with the given function definition.

fkiraly · 2024-04-01T12:43:36Z

I see - I meant when the errors are a vector of 1000. To create that situation, you could make y_true a vector of 1s, and y_pred a vector of 1001-s.

KaustubhUp025 · 2024-04-01T13:00:42Z

@fkiraly I gave the input as :-

test_length = 100
y_true = np.ones(test_length)
y_pred = np.ones(test_length) * 1001

and I got this as the result :-

GMAE using direct product approach: 1000.0000000000001

GMAE using log/exp representation approach: 999.9999999999989

fkiraly · 2024-04-01T13:02:44Z

hm, is direct more accurate? Surprising.

How about you make the errors larger, e.g., 1e-10 and see what happens?

KaustubhUp025 · 2024-04-01T13:15:35Z

@fkiraly So I used the input values given and here is the output for the same :-

GMAE using direct product approach: 0.0
GMAE using log/exp representation approach: 1.0000000827403598e-10

fkiraly · 2024-04-01T13:21:10Z

yes, seems more stable for extreme values. Shall we go with the log/exp approach then?

KaustubhUp025 · 2024-04-01T13:23:03Z

Yes I have written the code for that as well. I will just add it.

…MAE. This modification ensures that the errors are strictly positive before taking their logarithm, improving the numerical stability of the calculation.

fkiraly

Thanks! This is great!

We're not quite done yet, I think we should clarify what evaluate and evaluate_by_index are doing. evaluate computes the overall metric (the geometric mean), while evaluate_by_index produces the contribution per time index, to the error metric at the end. If the overall metric is not a mean, it's supposed to be producing jackknife pseudo-values.

So, your implementation of _evaluate_by_index seems like it should be in _evaluate, and you still need to work out a fast algorithm for the pseudo-values.

To help you, I've done the same for MSE/RMSE, have a look at the RMSE part (if squared=True), you can take this PR as a template: #6248

KaustubhUp025 · 2024-04-01T17:38:54Z

Thank you @fkiraly, I will work on the same.
Could you give any reference of where could I find a fast algorithm for pseudo-values.

fkiraly · 2024-04-01T17:46:52Z

Could you give any reference of where could I find a fast algorithm for pseudo-values.

You have to work it out, but it's almost the same as in #6248:

compute the sum of logarithms first
subtract individual logarithms, divide by n-1
take the exp to get GMAE for all samples minus one point
substitute in the pseudovalue formula

KaustubhUp025 · 2024-04-01T18:00:52Z

@fkiraly yes I used the similar method in my code, I will share it as soon as possible.

… efficient _evaluate_by_index for GMAE. Additional methods added :- _compute_pseudo_values: Computes the jackknife pseudo-values for the Geometric Mean Absolute Error (GMAE) metric, estimating the influence of each observation on the overall metric. _evaluate: Evaluates the GMAE metric on given inputs, providing the overall metric value. This method is the core logic called from the evaluate method and computes the arithmetic mean over time points by default.

KaustubhUp025 · 2024-04-04T05:32:41Z

@fkiraly , Does the code need any modification now??

fkiraly · 2024-04-04T14:19:20Z

Can you kindly make sure it passes the code formatting test?
Guide: https://www.sktime.net/en/stable/developer_guide/coding_standards.html

fkiraly

Thanks for your contribution!

This does not look correct though:

the pseudo-values should be returned in _evaluate_by_index, not _evaluate
_evaluate_by_index should return a pandas object with the same index as y_true

KaustubhUp025 · 2024-04-08T05:54:43Z

@fkiraly thanks for this. I will give it a check and then give the correct code.

…bject

KaustubhUp025 added 6 commits March 24, 2024 21:02

Update README.md to add hall-of-fame

80371b6

Update README.md to decrease the width of hall-of-fame

7006870

Update README.md to modify the hall-of0fame

eea1c03

Update README.md to add the scripture emoji with Hall-of-Fame line

7cd9961

Update README.md changed the emoji for Hall-of-Fame section

ced89d8

Update _classes.py Implementing efficient _evaluate_by_index for Geom…

c516ec7

…etricMeanAbsoluteError

KaustubhUp025 requested review from achieveordie, benHeid, fkiraly and yarnabrina as code owners April 1, 2024 05:38

Merge branch 'main' into pr/6244

6d16e36

fkiraly requested changes Apr 1, 2024

View reviewed changes

fkiraly added enhancement Adding new functionality module:metrics&benchmarking metrics and benchmarking modules labels Apr 1, 2024

Update _classes.py added the correct _evaluate_by_index for Geometric…

5822583

…MeanAbsoluteError with documentation as well

KaustubhUp025 requested a review from fkiraly April 1, 2024 11:49

fkiraly requested changes Apr 1, 2024

View reviewed changes

Update _classes.py by improving the code for _evaluate_by_index for G…

ca8e04d

…MAE. This modification ensures that the errors are strictly positive before taking their logarithm, improving the numerical stability of the calculation.

fkiraly requested changes Apr 1, 2024

View reviewed changes

KaustubhUp025 requested a review from fkiraly April 2, 2024 07:58

fkiraly requested changes Apr 4, 2024

View reviewed changes

KaustubhUp025 added 10 commits April 18, 2024 12:41

Update _classes.py modified the code to give final output as pandas o…

85cde26

…bject

Update _classes.py making check for code quality

8a8d0b4

Merge branch 'sktime:main' into main

5f4db02

Update _classes.py correcting code for code quality test

1f0fcfc

Merge branch 'sktime:main' into main

7dd03ca

Update _classes.py running code quality test.

219ec7b

Update _classes.py checking the code quality checks

33ab949

Update _classes.py check number 2

9a8b8a4

Merge branch 'sktime:main' into main

8bdcbde

Merge branch 'sktime:main' into main

fab449d

KaustubhUp025 mentioned this pull request May 21, 2024

[ENH] efficient _evaluate_by_index for GeometricMeanAbsoluteError #6461

Open

5 tasks

Merge branch 'sktime:main' into main

935ff0a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] implement efficient _evaluate_by_index for forecast performance metrics #4304 Implementation for GeometricMeanAbsoluteError #6244

[ENH] implement efficient _evaluate_by_index for forecast performance metrics #4304 Implementation for GeometricMeanAbsoluteError #6244

KaustubhUp025 commented Apr 1, 2024 •

edited by fkiraly

fkiraly left a comment

KaustubhUp025 commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024

fkiraly left a comment

KaustubhUp025 commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024

fkiraly commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024 •

edited

fkiraly commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024

fkiraly commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024

fkiraly commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024

fkiraly left a comment •

edited

KaustubhUp025 commented Apr 1, 2024 •

edited

fkiraly commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024 •

edited

KaustubhUp025 commented Apr 4, 2024

fkiraly commented Apr 4, 2024

fkiraly left a comment

KaustubhUp025 commented Apr 8, 2024

[ENH] implement efficient _evaluate_by_index for forecast performance metrics #4304 Implementation for GeometricMeanAbsoluteError #6244

Are you sure you want to change the base?

[ENH] implement efficient _evaluate_by_index for forecast performance metrics #4304 Implementation for GeometricMeanAbsoluteError #6244

Conversation

KaustubhUp025 commented Apr 1, 2024 • edited by fkiraly

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Does your contribution introduce a new dependency? If yes, which one?

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

Any other comments?

PR checklist

For all contributions

For new estimators

fkiraly left a comment

Choose a reason for hiding this comment

KaustubhUp025 commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024

fkiraly left a comment

Choose a reason for hiding this comment

KaustubhUp025 commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024

fkiraly commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024 • edited

fkiraly commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024

fkiraly commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024

fkiraly commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024

fkiraly left a comment • edited

Choose a reason for hiding this comment

KaustubhUp025 commented Apr 1, 2024 • edited

fkiraly commented Apr 1, 2024

KaustubhUp025 commented Apr 1, 2024 • edited

KaustubhUp025 commented Apr 4, 2024

fkiraly commented Apr 4, 2024

fkiraly left a comment

Choose a reason for hiding this comment

KaustubhUp025 commented Apr 8, 2024

KaustubhUp025 commented Apr 1, 2024 •

edited by fkiraly

KaustubhUp025 commented Apr 1, 2024 •

edited

fkiraly left a comment •

edited

KaustubhUp025 commented Apr 1, 2024 •

edited

KaustubhUp025 commented Apr 1, 2024 •

edited