feature request: save Report.comparisons as JSON #4

PaulLerner · 2022-01-10T12:57:58Z

Hi,

It’d be nice to be able to save a Report comparisons as a JSON file.
However, since it uses frozenset as keys, it is not JSON serializable.

Maybe you could add a method in https://github.com/AmenRa/ranx/blob/master/ranx/frozenset_dict.py to convert the _map to a JSON serializable dict, i.e. with str keys?
The str keys could be converted from the frozenset like: ', '.join(frozenset({'foo', 'bar'}))

The text was updated successfully, but these errors were encountered:

AmenRa · 2022-01-10T13:11:36Z

Hi, an export option for the Report class is already on my to-do list! :)

I will come back with a proposal so that we can discuss it before I implement the functionality.

AmenRa · 2022-01-14T14:49:32Z

Hey, sorry for the delay.

This is my proposal for the Report.to_dict function (I can add a Report.save_as_json function for convenience too):

{
    # metrics and model_names allows to read the report without
    # inspecting the json to discover the used metrics and
    # the compared models
    "metrics": ["metric_1", "metric_2", ...],
    "model_names": ["model_1", "model_2", ...],
    #
    "model_1": {
        "scores": {
            "metric_1": ...,
            "metric_2": ...,
            ...
        },
        "comparisons": {
            "model_2": {
                "metric_1": ...,  # p-value
                "metric_2": ...,  # p-value
                ...
            },
            ...
        },
        "win_tie_loss": {
            "model_2": {
                "W": ...,
                "T": ...,
                "L": ...,
            },
            ...
        },
    },
    ...
}

Let me know what you think. :)

PaulLerner · 2022-01-14T15:56:50Z

Looks great (and there was not so much delay 😅)!

AmenRa · 2022-01-14T16:55:27Z

I added Report.to_dict and Report.save.
I updated ranx on PyPi with these new features.

Closing.

PaulLerner · 2022-01-25T16:16:57Z

I’m getting a "TypeError: Object of type int64 is not JSON serializable" which is probably coming from numba or numpy

AmenRa · 2022-01-25T16:21:16Z

Yeah, I know about that issue. I will look into it soon.

As a workaround, you can call report.to_dict() and save the dictionary as a JSON by yourself with the exact same code I wrote for the report.save function.

That issue it's kinda weird.

PaulLerner · 2022-01-25T16:36:23Z

don’t you need to convert int64 to int in to_dict?

PaulLerner · 2022-01-25T16:40:02Z

for example in transformers they use:

def denumpify_detensorize(metrics):
    """
    Recursively calls `.item()` on the element of the dictionary passed
    """
    if isinstance(metrics, (list, tuple)):
        return type(metrics)(denumpify_detensorize(m) for m in metrics)
    elif isinstance(metrics, dict):
        return type(metrics)({k: denumpify_detensorize(v) for k, v in metrics.items()})
    elif isinstance(metrics, np.generic):
        return metrics.item()
    elif is_torch_available() and isinstance(metrics, torch.Tensor) and metrics.numel() == 1:
        return metrics.item()
    return metrics

PaulLerner · 2022-01-26T14:47:15Z

this fixes it but you probably want to deal with it some other way? If not I can open a PR PaulLerner@7e2218d

AmenRa · 2022-02-01T17:12:07Z

I will look into it soon.

AmenRa · 2022-02-02T12:18:49Z

Fixed in 0.1.10. Sorry for the inconvenience.

PaulLerner · 2022-02-07T14:38:12Z

I’m still getting TypeError: Object of type int64 is not JSON serializable

PaulLerner · 2022-02-07T14:40:51Z

oops, looks like I was on the wrong branch, sorry about that

AmenRa added the enhancement New feature or request label Jan 13, 2022

AmenRa closed this as completed Jan 14, 2022

PaulLerner added a commit to PaulLerner/ranx that referenced this issue Jan 26, 2022

fix: AmenRa#4 make report JSON-serializable

7e2218d

AmenRa reopened this Feb 1, 2022

AmenRa closed this as completed Feb 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature request: save Report.comparisons as JSON #4

feature request: save Report.comparisons as JSON #4

PaulLerner commented Jan 10, 2022

AmenRa commented Jan 10, 2022

AmenRa commented Jan 14, 2022

PaulLerner commented Jan 14, 2022

AmenRa commented Jan 14, 2022

PaulLerner commented Jan 25, 2022

AmenRa commented Jan 25, 2022

PaulLerner commented Jan 25, 2022

PaulLerner commented Jan 25, 2022

PaulLerner commented Jan 26, 2022

AmenRa commented Feb 1, 2022

AmenRa commented Feb 2, 2022

PaulLerner commented Feb 7, 2022

PaulLerner commented Feb 7, 2022

feature request: save Report.comparisons as JSON #4

feature request: save Report.comparisons as JSON #4

Comments

PaulLerner commented Jan 10, 2022

AmenRa commented Jan 10, 2022

AmenRa commented Jan 14, 2022

PaulLerner commented Jan 14, 2022

AmenRa commented Jan 14, 2022

PaulLerner commented Jan 25, 2022

AmenRa commented Jan 25, 2022

PaulLerner commented Jan 25, 2022

PaulLerner commented Jan 25, 2022

PaulLerner commented Jan 26, 2022

AmenRa commented Feb 1, 2022

AmenRa commented Feb 2, 2022

PaulLerner commented Feb 7, 2022

PaulLerner commented Feb 7, 2022