[RFC][11.0] Translation of Imported Data (ICD-10, etc.) #136

lasley · 2016-12-05T17:22:04Z

The current plan in #134 is to outright remove the giant data files that we have in oe_medical_emr_data in favor of import systems for dynamic update of data. These XML files are more than 10 MiB each & are updated yearly, so this is essentially the only way to go about handling these processes in a forward-efficient manor.

This brings up an interesting problem that I will not be solving in that PR - translations.

Most of the code data files are available in other languages, so obtaining translations for the data itself won't be an issue. What will though is when two languages come into play for the same datasets. For example:

English ICD-10-CM is imported
French ICD-10-CM is needed

In order to import the French ICD-10 data, the English data would either need to be replaced or duplicated if using standard record creates/writes. This is because records are naive of languages.

My only idea is to convert all translatable text in imports into a unique identifier string for the field (uuid-4 or something). The import would then create/update a record of the non-translatable data + all of the identifier strings. It would then add the actual text into ir.translation so that the identifiers are translated by the system.

The disadvantage here is that translations essentially become impossible to maintain manually, although I'm not sure if this was an option anyways given the size of the data. There will also likely be data duplications due to the identifier system not taking into account word lemmas.

I think this disadvantage is negligible though and outweighed the advantage of not storing and maintaining these giant XML files in source control.

I'm wondering if anyone has some ideas or strategies for the translations that I'm not thinking of?

The text was updated successfully, but these errors were encountered:

github-actions · 2022-11-06T12:19:02Z

There hasn't been any activity on this issue in the past 6 months, so it has been marked as stale and it will be closed automatically if no further activity occurs in the next 30 days.
If you want this issue to never become stale, please ask a PSC member to apply the "no stale" label.

lasley added enhancement question labels Dec 5, 2016

lasley added this to the 11.0 milestone Dec 5, 2016

lasley changed the title ~~[RFC] Translation of Imported Data (ICD-10, etc.)~~ [RFC][11.0] Translation of Imported Data (ICD-10, etc.) Dec 6, 2016

github-actions bot added the stale PR/Issue without recent activity, it'll be soon closed automatically. label Nov 6, 2022

github-actions bot closed this as completed Dec 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC][11.0] Translation of Imported Data (ICD-10, etc.) #136

[RFC][11.0] Translation of Imported Data (ICD-10, etc.) #136

lasley commented Dec 5, 2016 •

edited

github-actions bot commented Nov 6, 2022

[RFC][11.0] Translation of Imported Data (ICD-10, etc.) #136

[RFC][11.0] Translation of Imported Data (ICD-10, etc.) #136

Comments

lasley commented Dec 5, 2016 • edited

github-actions bot commented Nov 6, 2022

lasley commented Dec 5, 2016 •

edited