Added multibyte string functions #161

adriansuter · 2016-10-31T10:55:25Z

What?

Adds the possibilty to use multibyte string functions in the rules "length" and "lengthBetween".

Checklist

Added unit test for added/fixed code
Updated the documentation
Scrutinizer code coverage is 100%
Scrutinizer code quality is as high as possible

Linked issue

#160

Notes

I have not yet updated the documentation.
I modified the current unit tests for LengthTest only. Probably it would be better to make a new class LengthTestMultibyte which actually tests the multibyte cases. Because right now, I have change the LengthTest such that it always uses the multibyte functions. But in my opinion this unit test should only contain the original rule without encoding.

Finesse · 2017-12-12T02:18:30Z

src/Rule/Length.php

+        if (is_null($this->encoding) || !function_exists('mb_strlen')) {
+            $actualLength = strlen($value);
+        } else {
+            $actualLength = mb_strlen($value, $this->encoding);


Why not to use just $actualLength = function_exists('mb_strlen') ? mb_strlen($value) : strlen($value)?

I want to just use ->length(10) with default encoding, I don't want to pass 'utf8' everywhere.

You are right. I just tried to be backwards compatible. Maybe someone uses the rule and knows about that non-multibyte behaviour. Then this person maybe decided to count the number of special chars like äöüéàè and to remove that number from the measured wrong length (because in utf8, these would get treated as two chars respecively).

But of course, it would be better to handle this as default.

@adriansuter Now it is clear. I agree that your implementation is technically backward compatible and my is not.

This is an ambiguous question. I think that counting bytes (not characters) is a bug, but other may think vice versa because the documentation is not precise about it. Hope that this question will be clarified in the next major release.

Me too. I think PHP should change the default behaviour of strlen(). The expected result for most people I suppose, is the number of characters. In PHP (in case PHP is used in web development) one rarely has to count the byte length of a string. And if so, there is always unpack().

We will see, if PHP would implement that (I doubt it :-)).

adunsulag · 2023-08-02T11:20:30Z

@rick-nu Just wondering if this PR was not merged due to the documentation updates missing? If OpenEMR picked this PR up and added the documentation, could we get this merged in? Hope all is well!

Added multibyte string functions

1620009

adriansuter mentioned this pull request Oct 31, 2016

Multibyte String Functions #160

Closed

rick-nu assigned berry-langerak and rick-nu May 17, 2017

rick-nu mentioned this pull request Sep 28, 2017

Fixed issue #97 - support multi-byte string #106

Closed

Finesse reviewed Dec 12, 2017

View reviewed changes

vova07 mentioned this pull request Nov 6, 2018

Use mb_strlen instead of strlen in Length rules #189

Closed

adriansuter closed this Jun 11, 2019

adriansuter deleted the feature/multibyte-strings branch June 11, 2019 13:22

adunsulag mentioned this pull request Aug 2, 2023

bug: Issues with multibyte characters on textbox fields openemr/openemr#6706

Open

sergio-ropero mentioned this pull request Aug 8, 2023

Length validation does not adhere multibyte chars #97

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added multibyte string functions #161

Added multibyte string functions #161

adriansuter commented Oct 31, 2016 •

edited

Loading

Finesse Dec 12, 2017

adriansuter Dec 12, 2017 •

edited

Loading

Finesse Dec 12, 2017 •

edited

Loading

adriansuter Dec 12, 2017

adunsulag commented Aug 2, 2023

Added multibyte string functions #161

Added multibyte string functions #161

Conversation

adriansuter commented Oct 31, 2016 • edited Loading

What?

Checklist

Linked issue

Notes

Finesse Dec 12, 2017

Choose a reason for hiding this comment

adriansuter Dec 12, 2017 • edited Loading

Choose a reason for hiding this comment

Finesse Dec 12, 2017 • edited Loading

Choose a reason for hiding this comment

adriansuter Dec 12, 2017

Choose a reason for hiding this comment

adunsulag commented Aug 2, 2023

adriansuter commented Oct 31, 2016 •

edited

Loading

adriansuter Dec 12, 2017 •

edited

Loading

Finesse Dec 12, 2017 •

edited

Loading