Adding evaluation of TURNA-encoder #74

zeynepyirmibes · 2024-05-01T11:44:10Z

In this PR, the evaluation of fine-tuned TURNA-encoder models was implemented. Changes include:

Fix of the test_params issue in the evaluation script.
Fix of erroneous label post-processing of classification and STS datasets.
Evaluation of fine-tuned TURNA-encoder models.

gokceuludogan

Thanks a lot for your efforts on fixing the TURNA encoder and its evaluation!

I've left a few minor comments, but other than that, I'm happy to approve this PR!

gokceuludogan · 2024-05-01T12:03:40Z

turkish_lm_tuner/t5_classifier.py

+        try:
+            self.encoder = T5EncoderModel.from_pretrained(pretrained_model_name)
+        except Exception as e:
+            pretrained_model_name = config._name_or_path
+            self.encoder = T5EncoderModel.from_pretrained(pretrained_model_name)
+


Do we really need exception handling here? When does the encoder initialization fail with pretrained_model_name?

This is really bad code I realize :D but there was a problem with the from_pretrained method, when it calls the init of T5forSequenceClassification, that argument is overwritten as the config.. I couldn't solve it, so I added this.. If I can find a solution, I will fix it! 👍

Can we predict if it's going to fail beforehand and update the pretrained_model_name accordingly? In what scenarios do we use the first from_pretrained method compared to the other one?

gokceuludogan · 2024-05-01T12:09:11Z

turkish_lm_tuner/tr_datasets.py

            try:
                return(float(label.strip()))
            except:
-                return 0
+                try:
+                    return(float(label))
+                except:
+                    return 0
+


Doesn't float(label.strip()) cover float(label) already?

Did we check the outputs to see if they contain similarity scores where they cannot immediately cast to float?

When we use conditional generation, the output is a string, so float(label.strip()) works. When we use classification, the output is a number, so the strip() function naturally gives an error, so I try converting into a float.

I see. Rather than handling it with exceptions, I suggest checking its type to choose the conversion. It would be much clearer.

zeynepyirmibes and others added 3 commits March 11, 2024 06:52

Fixed test params for evaluation

42c0160

Revised loading pre-trained Turna encoder models

034c367

Fixed classification and STS label post-processing

9df38db

gokceuludogan approved these changes May 1, 2024

View reviewed changes

gokceuludogan assigned zeynepyirmibes May 1, 2024

gokceuludogan added the bug Something isn't working label May 1, 2024

zeynepyirmibes merged commit a0e954b into main May 5, 2024

zeynepyirmibes mentioned this pull request May 5, 2024

Problems about the evaluation script #70

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding evaluation of TURNA-encoder #74

Adding evaluation of TURNA-encoder #74

zeynepyirmibes commented May 1, 2024

gokceuludogan left a comment

gokceuludogan May 1, 2024

zeynepyirmibes May 5, 2024

gokceuludogan May 6, 2024

gokceuludogan May 1, 2024

zeynepyirmibes May 5, 2024

gokceuludogan May 6, 2024

Adding evaluation of TURNA-encoder #74

Adding evaluation of TURNA-encoder #74

Conversation

zeynepyirmibes commented May 1, 2024

gokceuludogan left a comment

Choose a reason for hiding this comment

gokceuludogan May 1, 2024

Choose a reason for hiding this comment

zeynepyirmibes May 5, 2024

Choose a reason for hiding this comment

gokceuludogan May 6, 2024

Choose a reason for hiding this comment

gokceuludogan May 1, 2024

Choose a reason for hiding this comment

zeynepyirmibes May 5, 2024

Choose a reason for hiding this comment

gokceuludogan May 6, 2024

Choose a reason for hiding this comment