Request #69086 - enhancement for mb_convert_encoding #1098

masakielastic · 2015-02-20T06:04:49Z

This pull request improves the value of subsitute charahcter when the value of third argument of mb_convert_encoding is different from the value of mb_internal_encoding.

…current_filter_illegal_substchar)

Fix bug #69086 enhancement for mb_convert_encoding

php-pulls · 2016-08-10T05:47:39Z

Comment on behalf of yohgaki at php.net:

Merged. Thank you for PR.

nikic · 2017-08-03T19:39:36Z

Reviewing more code related to #1094 (comment)... there are some problems here as well. Again the check is being made against the source encoding of the string, not the target encoding, which is where the substitution character has to be mapped. For example:

<?php
mb_internal_encoding("UTF-8");
mb_substitute_character(0xfffd);
var_dump(bin2hex(mb_convert_encoding("\x80", "UTF-8", "EUC-JP-2004")));

This will result in U+3F, even though UTF-8 clearly supports U+FFFD.

However, even if the target encoding is checked instead of the source encoding, the check would still be too strict in the case where the target encoding is a "non-Unicode" encoding and does not match the internal encoding. There are many encodings that support large ranges of non-ASCII Unicode codepoints, but with the current logic they would always fall back to using U+3F.

nikic · 2017-08-03T19:59:43Z

This is now fixed by fb9bf5b. No upfront check is performed anymore, instead mbfl_convert will simply try to use the character and if that fails, fall back to ?. This way all characters supported by the target encoding should be usable.

added check for the combination of encodings and the value of MBSTRG(…

ac8b7b0

…current_filter_illegal_substchar)

jpauli added the Feature label Feb 20, 2015

masakielastic added 2 commits February 22, 2015 13:59

add extra test case and fix typo.

1ce5f91

update the functions for checking the names of encodings

e756c9f

php-pulls pushed a commit that referenced this pull request Aug 10, 2016

Merge pull request #1098

850a0b5

Fix bug #69086 enhancement for mb_convert_encoding

php-pulls closed this Aug 10, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Request #69086 - enhancement for mb_convert_encoding #1098

Request #69086 - enhancement for mb_convert_encoding #1098

Uh oh!

masakielastic commented Feb 20, 2015

Uh oh!

php-pulls commented Aug 10, 2016

Uh oh!

nikic commented Aug 3, 2017

Uh oh!

nikic commented Aug 3, 2017

Uh oh!

Uh oh!

Request #69086 - enhancement for mb_convert_encoding #1098

Request #69086 - enhancement for mb_convert_encoding #1098

Uh oh!

Conversation

masakielastic commented Feb 20, 2015

Uh oh!

php-pulls commented Aug 10, 2016

Uh oh!

nikic commented Aug 3, 2017

Uh oh!

nikic commented Aug 3, 2017

Uh oh!

Uh oh!