Add support for number_to_word test #352

alytarik · 2023-04-27T09:40:06Z

Integrate the following module in nlptest for NER and text classification: https://github.com/GEM-benchmark/NL-Augmenter/tree/main/nlaugmenter/transformations/number-to-word

It will fall under the Robustness category

Make sure to watch out for changes in Span start and end indexes when swapping words

RakshitKhajuria · 2023-05-02T11:07:39Z

According to the issue "implement the following module in nlptest for NER and text classification: https://github.com/GEM-benchmark/NL-Augmenter/tree/main/nlaugmenter/transformations/number-to-word" it uses inflect lib that helps in the natural language generation of English words and phrases based on numerical input but it doesn't work with all the test cases.

In text-classification
For Eg
It will work with this
Input : Virat Kohli hits 150 in world cup.
Expected Output : Virat Kohli hits one hundred and fifty in world cup.
Output : Virat Kohli hits one hundred and fifty in world cup.

Input : My brother is 12 years old
Expected Output : My brother is twelve years old
Output : My brother is twelve years old

Where it will fail
Input : "The price of the product is $10"
Expected Output : "The price of the product is ten dollars"
Output : The price of the product is $10
(It will give the expected output when there will be a space between $ and 10)

Input : "The price of the product is $10.99"
Expected Output : "The price of the product is ten dollars and ninety nine cents"
Output : The price of the product is $10.99

There are cases where this approach using inflect lib might not work correctly. For example, if the input contains decimal numbers or if there is no space between number and text.

luca-martial · 2023-05-02T11:16:43Z

Thanks for the detailed comment @Ryzxxl - @alytarik can you check to see if a custom implementation of inflect is worth it (effort vs impact)? If we can get rid of that dependency and transform those edge cases (numbers with special characters) then it will be worth implementing it ourselves.

For now, @Ryzxxl you can ignore this issue since it does not really affect the context in which we run tests

alytarik added 🔔 Good First Issue Good first issue for new contributors ⭐ Feature Indicates new feature requests labels Apr 27, 2023

luca-martial linked a pull request May 4, 2023 that will close this issue

Feature: Add support for number to words robustness test #377

Merged

4 tasks

luca-martial assigned RakshitKhajuria May 12, 2023

luca-martial closed this as completed May 12, 2023

luca-martial linked a pull request May 12, 2023 that will close this issue

added number_to_words test to robustness nb #410

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for number_to_word test #352

Add support for number_to_word test #352

alytarik commented Apr 27, 2023

RakshitKhajuria commented May 2, 2023

luca-martial commented May 2, 2023

Add support for number_to_word test #352

Add support for number_to_word test #352

Comments

alytarik commented Apr 27, 2023

RakshitKhajuria commented May 2, 2023

luca-martial commented May 2, 2023