Skip to content

"Collapse consecutive whitespace" operation does not collapse all possible unicode whitespace #4883

@wetneb

Description

@wetneb

The "Collapse consecutive whitespace" operation does not work when applied to certain whitespace unicode characters.

To Reproduce

Steps to reproduce the behavior:

  1. First, import this openrefine project: Clipboard.openrefine.tar.gz
  2. Then, run the "Collapse consecutive whitespace" operation on the only column

Current Results

The cell is not edited

Expected Behavior

The cell should be edited to "hello world"

Versions

  • JRE or JDK Version: 11
  • OpenRefine: 3.6-SNAPSHOT

Datasets

Real world dataset where this appears: https://opendata.paris.fr/explore/dataset/lieux-de-tournage-a-paris/information/?disjunctive.type_tournage&disjunctive.nom_tournage&disjunctive.nom_realisateur&disjunctive.nom_producteur&disjunctive.ardt_lieu

Additional context

Discovered while doing a demo at Dataharvest 2022

Metadata

Metadata

Labels

Type: BugIssues related to software defects or unexpected behavior, which require resolution.localizationanything to do with i18n Internationalization and I10n localization

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions