Skip to content

v1.0.35 - Search multilingual PT-BR/EN-US/ES com normalizacao de acentos

Choose a tag to compare

@mrhenrike mrhenrike released this 10 Jun 05:01
· 7 commits to master since this release

Multilingual Search Engine

industrialxpl/core/search/engine.py

New SearchEngine with:

  • Unicode normalization: accepts accented and non-accented forms
    mineracao = mineracao = mineracao (all map to mining modules)
    petroleo = petroleo = oilgas modules
    agua = agua = water modules
    clp = plc modules (CLP is PLC in PT-BR)
    ihm = hmi modules (IHM is HMI in PT-BR)
    varredura = scanner modules
  • 200+ multilingual aliases covering:
    PT-BR: mineracao, petroleo, agua, energia, fabrica, predio, maritimo...
    ES: mineria, manufactura, quimico, maritimo, ferrocarril...
    Synonyms: garimpo, lavra, dutovia, subestacao, barramento...
    Abbreviations: CLP (PLC), IHM (HMI), SIS, OT, ICS, RTU...
  • Sector filter: search sector=mineracao (PT-BR sector names work)
  • Type filter: search type=varredura (PT-BR type names work)
  • Fuzzy suggestions: Did you mean X? (Levenshtein edit distance)
  • Expansion notes: shows when multilingual alias was used

Examples that now work:

search mineracao -> 4 mining modules
search minério -> same (accent normalized)
search mineração -> same
search petroleo -> modbus, dnp3, night_dragon (oil & gas related)
search agua -> water/scada modules
search clp -> plc modules (CLP = PLC in PT-BR)
search varredura -> 114 scanner modules
search xxxxxxinvalid -> Did you mean? suggestions