You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
isaomatsunami opened this issue
Mar 15, 2021
· 2 comments
· Fixed by #3736
Assignees
Labels
encodingSelection of encoding at import time, or encoding issues in data cleaningexportExporting a project to some format. Use the format-specific sub-label if availableType: BugIssues related to software defects or unexpected behavior, which require resolution.
East Asian Characters in Project name are corrupted in exported files.
To Reproduce
First, import the attached xls file, which has a filename of German-Japanese-French-Chinese characters.(and contains such characters inside cells too). Project names are not corrupted. äöüßいろはにàçéùè§繁體字.xls.zip
Then, Export data as tsv/csv/excel
Finally, the output filename is "äöüß????àçéùè§???-xls.csv". German/French are preserved but Japanese /Chinese are not.
Versions
Operating System: Mac OS
Browser Version: Firefox 61, Safari
JRE or JDK Version: java version "11.0.1" 2018-10-16 LTS/Java(TM) SE Runtime Environment 18.9 (build 11.0.1+13-LTS)/Java HotSpot(TM) 64-Bit Server VM 18.9 (build 11.0.1+13-LTS, mixed mode
OpenRefine: 3.5-SNAPSHOT
Additional context
The problem of the same kind on importing files with exotic characters is perfectly resolved. Huge thanks to dev team!
The text was updated successfully, but these errors were encountered:
isaomatsunami
added
Type: Bug
Issues related to software defects or unexpected behavior, which require resolution.
Status: Pending Review
Indicates that the issue or pull request is awaiting review by project maintainers or collaborators
labels
Mar 15, 2021
wetneb
added
encoding
Selection of encoding at import time, or encoding issues in data cleaning
export
Exporting a project to some format. Use the format-specific sub-label if available
labels
Mar 16, 2021
Actually, it looks like the filename is set here and passed to the browser in the Content-Disposition header.
This doc seems to suggest the unicode needs to be url encoded with a byte-encoding specified. I think OpenRefine already packages PercentEscaper from guava.
encodingSelection of encoding at import time, or encoding issues in data cleaningexportExporting a project to some format. Use the format-specific sub-label if availableType: BugIssues related to software defects or unexpected behavior, which require resolution.
East Asian Characters in Project name are corrupted in exported files.
To Reproduce
äöüßいろはにàçéùè§繁體字.xls.zip
Versions
Additional context
The problem of the same kind on importing files with exotic characters is perfectly resolved. Huge thanks to dev team!
The text was updated successfully, but these errors were encountered: