Integrate Basic QE feature #144

abhi-agg · 2022-03-07T12:35:39Z

- Modified worker to construct TranslationModel with the appropriate model config for Quality Estimate feature when it is enabled by user - Supported QE feature only for in-page translation and not for Outbound translation

- For in-page translation, the flag is passed at it is - For outbound translation, we don't need QE. So passing it as false and restoring it after translate call is done.

- An api change has made it necessary to specify the size

abhi-agg · 2022-03-11T17:35:34Z

Integrate Basic Quality Estimation #132

Modify extension to encode plain text and send it as HTML to the engine whenever QE feature is on

This is required as engine always expects html text to return QE scores (as per this)

The translation doesn't need to be decoded back as the engine always returns translation as html whenever QE is on and this output can be used directly to show colors

Switch Quality Estimation specific model config off when not in use in order to have better performance #133

Implement a mechanism to show colors for in-page translation from the quality scores returned by the engine

Quality scores are embedded in translation result in form of font attributes

Fixes first 3 parts of #26. The last part is left (i.e. using CSS to show colors for bad scores) which will be taken up the next week.

jelmervdl · 2022-03-11T23:23:23Z

extension/controller/translation/translationWorker.js

@@ -246,7 +288,7 @@ class TranslationHelper {
        // instantiate the Translation Service
        constructTranslationService() {
            if (!this.translationService) {
-                let translationServiceConfig = {};
+                let translationServiceConfig = { cacheSize: 10 };


Just a note: a cache size of 10 won't do you much good. The cache is non-probing, it is basically just cache[hash(sentence) % cacheSize]. The description of the parameter is a bit indirect about that. If you set it too low, you'll end up having too many different sentences hitting the same cache entry, constantly overwriting each other, and no cache benefit at all.

In my experience you'd get about 20% occupancy, so if you set it to 50 you'd be caching about 10 sentences. But from testing in TranslateLocally and my extension fork, I'd suggest starting with something around 1000 or higher, and see whether you can notice it in the memory usage.

Thanks for the tip Jelmer, please keep them coming. I think caching will be particularly helpful after I move the engine to the background script last week. I heard from the security folks that's fine, so I think that will be helpful.

abhi-agg marked this pull request as draft March 7, 2022 12:35

abhi-agg added 9 commits March 11, 2022 18:01

Loading translation engine with appropriate QE config

bb45bb9

- Modified worker to construct TranslationModel with the appropriate model config for Quality Estimate feature when it is enabled by user - Supported QE feature only for in-page translation and not for Outbound translation

Pas quality estimation flag for each text item of translate api

4901ca5

- For in-page translation, the flag is passed at it is - For outbound translation, we don't need QE. So passing it as false and restoring it after translate call is done.

Added cacheSize field in config to construct TranslationService

e5c40cd

- An api change has made it necessary to specify the size

Ran linting

11217d8

Updated bergamot engine to 0.4.2

c51083c

Encode plain text to HTML before sending it to engine when QE is on

f989473

Ran linter

549b4cb

(Discard later) Debugging

4e47bd5

Fixed merge code when QE is on

6436fc7

abhi-agg force-pushed the qe-integration branch from 0a7aefa to 6436fc7 Compare March 11, 2022 17:24

abhi-agg added 2 commits March 11, 2022 18:24

Removed a debug log

dc333bb

Ran linter

5ea4451

abhi-agg marked this pull request as ready for review March 11, 2022 17:30

abhi-agg requested review from andrenatal and eu9ene March 11, 2022 17:35

andrenatal approved these changes Mar 11, 2022

View reviewed changes

andrenatal merged commit c1b62c6 into mozilla:main Mar 11, 2022

This was referenced Mar 11, 2022

Switch Quality Estimation specific model config off when not in use in order to have better performance #133

Closed

Integrate Basic Quality Estimation #132

Closed

Use translation cache #96

Closed

jelmervdl reviewed Mar 11, 2022

View reviewed changes

abhi-agg deleted the qe-integration branch March 12, 2022 20:36

This was linked to issues Mar 12, 2022

Switch Quality Estimation specific model config off when not in use in order to have better performance #133

Closed

Integrate Basic Quality Estimation #132

Closed

Use translation cache #96

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate Basic QE feature #144

Integrate Basic QE feature #144

abhi-agg commented Mar 7, 2022 •

edited

Loading

abhi-agg commented Mar 11, 2022 •

edited by andrenatal

Loading

jelmervdl Mar 11, 2022

andrenatal Mar 12, 2022

Integrate Basic QE feature #144

Integrate Basic QE feature #144

Conversation

abhi-agg commented Mar 7, 2022 • edited Loading

abhi-agg commented Mar 11, 2022 • edited by andrenatal Loading

jelmervdl Mar 11, 2022

Choose a reason for hiding this comment

andrenatal Mar 12, 2022

Choose a reason for hiding this comment

abhi-agg commented Mar 7, 2022 •

edited

Loading

abhi-agg commented Mar 11, 2022 •

edited by andrenatal

Loading