feat: implement funtions to retrieve metrics and examples #687

guenthermi · 2023-03-07T13:29:01Z

Implement functions to retrieve metrics and examples

This PR references an open issue
I have added a line about this change to CHANGELOG

bwanglzu

minor comment

bwanglzu · 2023-03-09T08:09:58Z

finetuner/run.py

+    def example_results(self) -> Dict:
+        """Get the results of example queries from the evaluation data of the
+        :class:`Run`.
+
+        :return: dictionary with results before and after fine-tuning.
+        """
+        self._check_run_status_finished()
+        return self._client.get_run_examples(
+            experiment_name=self._experiment_name, run_name=self._name
+        )


if we can make this function more user friendly, would be even better. For example, 2 colums compare, if it is image/mesh, try to display

Good idea! However, I think sometimes it is good to get them as dictionaries, if you want to further process them. For the case that you want to display them directly to the console, I implemented two additional functions display_metrics and display_examples. It looks like this:

LMMilliken

Two small things

LMMilliken · 2023-03-10T11:45:21Z

finetuner/console.py

+        for i, match in enumerate(results[query]):
+            if i >= k:
+                break


Suggested change

for i, match in enumerate(results[query]):

if i >= k:

break

for i, match in enumerate(results[query])[:k]:

LMMilliken · 2023-03-10T11:46:11Z

docs/advanced-topics/using-callbacks.md

+You can retrieve them with the {func}`~Run.example_results()` function. 
+Alternatively, you can use the {func}`~Run.display_metrics()` function to display a table of the Top-K results before and after fine-tuning to the console.


Maybe explain what K is in this context and how to set it

ok I mention what Top-K means, how to set k is explained in the developer documentation which you get when clicking on the link

gmastrapas · 2023-03-10T12:36:56Z

docs/advanced-topics/using-callbacks.md

+### Show evaluation metrics
+
+During the fine-tuning process, the evaluation metrics are displayed in the logs, which you can retrieve via the {func}`~Run.logs()` function.
+After running the fine-tuning job has finished, the evaluation metrics can be retrieved from the cloud by calling the {func}`~Run.metrics()` function.


Suggested change

After running the fine-tuning job has finished, the evaluation metrics can be retrieved from the cloud by calling the {func}`~Run.metrics()` function.

After the fine-tuning job has finished, the evaluation metrics can be retrieved from the cloud by calling the {func}`~Run.metrics()` function.

gmastrapas · 2023-03-10T12:37:54Z

docs/advanced-topics/using-callbacks.md

+If you want to compare the top-k retrieval results before and after fine-tuning, you can set the `gather_examples` parameter of the evaluation callback to `True`.
+In this case, the evaluation callback will store the top-k results for each query document before and after fine-tuning.
+You can retrieve them with the {func}`~Run.example_results()` function. 
+Alternatively, you can use the {func}`~Run.display_metrics()` function to display a table of the Top-K results before and after fine-tuning to the console.


Suggested change

Alternatively, you can use the {func}`~Run.display_metrics()` function to display a table of the Top-K results before and after fine-tuning to the console.

Alternatively, you can use the {func}`~Run.display_examples()` function to display a table of the Top-K results before and after fine-tuning to the console.

gmastrapas · 2023-03-10T12:39:50Z

finetuner/run.py

@@ -99,6 +99,45 @@ def stream_logs(self, interval: int = 5) -> Iterator[str]:
            experiment_name=self._experiment_name, run_name=self._name
        )

+    def metrics(self) -> Dict:


return type hint? Dict[str, float]?

no its Dicct[str, Dict[str, float]] but yes I can add it

gmastrapas · 2023-03-10T12:40:07Z

finetuner/run.py

+        for stage in metrics:
+            print_metrics(stage, metrics[stage])
+
+    def example_results(self) -> Dict:


Suggested change

def example_results(self) -> Dict:

def example_results(self) -> Dict[str, str]:

```?

github-actions · 2023-03-13T08:19:27Z

📝 Docs are deployed on https://ft-feat-support-metrics-and-examples--jina-docs.netlify.app 🎉

LMMilliken

LGTM

bwanglzu

LGTM!

feat: implement funtions to retrieve metrics and examples

ce92088

github-actions bot added size/s area/client area/core area/testing This issue/PR affects testing labels Mar 7, 2023

chore: add line to changelog

8a1cea0

github-actions bot added size/m and removed size/s labels Mar 7, 2023

guenthermi self-assigned this Mar 7, 2023

guenthermi marked this pull request as ready for review March 7, 2023 13:55

guenthermi requested review from gmastrapas, bwanglzu, LMMilliken and matousek-martin and removed request for gmastrapas March 9, 2023 07:43

bwanglzu reviewed Mar 9, 2023

View reviewed changes

feat: add functions to display to console

2ecc21a

github-actions bot added the area/docs label Mar 10, 2023

docs: update images

f88c459

LMMilliken reviewed Mar 10, 2023

View reviewed changes

gmastrapas requested changes Mar 10, 2023

View reviewed changes

guenthermi added 2 commits March 10, 2023 16:05

refactor: implement review notes

b09c42c

refactor for loop

9c252ce

guenthermi requested review from gmastrapas, LMMilliken and bwanglzu March 13, 2023 08:45

LMMilliken approved these changes Mar 13, 2023

View reviewed changes

gmastrapas approved these changes Mar 13, 2023

View reviewed changes

bwanglzu approved these changes Mar 14, 2023

View reviewed changes

guenthermi merged commit e649785 into main Mar 14, 2023

guenthermi deleted the feat-support-metrics-and-examples branch March 14, 2023 10:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement funtions to retrieve metrics and examples #687

feat: implement funtions to retrieve metrics and examples #687

guenthermi commented Mar 7, 2023 •

edited

bwanglzu left a comment

bwanglzu Mar 9, 2023

guenthermi Mar 10, 2023 •

edited

LMMilliken left a comment

LMMilliken Mar 10, 2023 •

edited

LMMilliken Mar 10, 2023

guenthermi Mar 10, 2023

gmastrapas Mar 10, 2023

gmastrapas Mar 10, 2023

gmastrapas Mar 10, 2023

guenthermi Mar 10, 2023

gmastrapas Mar 10, 2023

github-actions bot commented Mar 13, 2023

LMMilliken left a comment

bwanglzu left a comment

		You can retrieve them with the {func}`~Run.example_results()` function.
		Alternatively, you can use the {func}`~Run.display_metrics()` function to display a table of the Top-K results before and after fine-tuning to the console.

	After running the fine-tuning job has finished, the evaluation metrics can be retrieved from the cloud by calling the {func}`~Run.metrics()` function.
	After the fine-tuning job has finished, the evaluation metrics can be retrieved from the cloud by calling the {func}`~Run.metrics()` function.

	def example_results(self) -> Dict:
	def example_results(self) -> Dict[str, str]:
	```?

feat: implement funtions to retrieve metrics and examples #687

feat: implement funtions to retrieve metrics and examples #687

Conversation

guenthermi commented Mar 7, 2023 • edited

Implement functions to retrieve metrics and examples

bwanglzu left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

guenthermi Mar 10, 2023 • edited

Choose a reason for hiding this comment

LMMilliken left a comment

Choose a reason for hiding this comment

LMMilliken Mar 10, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Mar 13, 2023

LMMilliken left a comment

Choose a reason for hiding this comment

bwanglzu left a comment

Choose a reason for hiding this comment

guenthermi commented Mar 7, 2023 •

edited

guenthermi Mar 10, 2023 •

edited

LMMilliken Mar 10, 2023 •

edited