[feature] Add utf8ndup #41

warmwaffles · 2017-11-21T22:20:57Z

This is my current implementation. I am using it to replace all of my strndups

#include <utf8.h>

void*
utf8ndup(const void* src, size_t n)
{
    const char* s = (const char*)src;
    char* c       = 0;

    // figure out how many bytes (including the terminator) we need to copy first
    size_t bytes = utf8size(src);

    c = (char*)malloc(n);

    if (0 == c) {
        // out of memory so we bail
        return 0;
    }

    bytes = 0;
    size_t i = 0;

    // copy src byte-by-byte into our new utf8 string
    while ('\0' != s[bytes] && i < n) {
        c[bytes] = s[bytes];
        bytes++;
        i++;
    }

    // append null terminating byte
    c[bytes] = '\0';
    return c;
}

I don't know if this is desirable. I am almost just half tempted to calloc an memcpy the results.

The text was updated successfully, but these errors were encountered:

f2404 · 2017-11-21T22:23:05Z

What's the point in

size_t bytes = utf8size(src);
bytes = 0;

?

warmwaffles · 2017-11-21T22:24:34Z

I think originally I intended to check to see if the new string will be smaller than the requested size.

But this is literally the utf8dup code with a tacked on size_t n

f2404 · 2017-11-21T22:26:44Z

Also, you don't need two iterators (bytes and i). One would be enough.

warmwaffles · 2017-11-21T22:32:05Z

void*
utf8ndup(const void* src, size_t n)
{
    const char* s = (const char*)src;
    char* c       = 0;

    // figure out how many bytes (including the terminator) we need to copy first
    size_t bytes = utf8size(src);

    if (n < bytes) {
        c = (char*)malloc(n + 1);
    } else {
        c = (char*)malloc(bytes);
        n = bytes;
    }

    if (!c) {
        // out of memory so we bail
        return 0;
    }

    bytes = 0;

    // copy src byte-by-byte into our new utf8 string
    while ('\0' != s[bytes] && bytes < n) {
        c[bytes] = s[bytes];
        bytes++;
    }

    // append null terminating byte
    c[bytes] = '\0';
    return c;
}

warmwaffles · 2017-11-21T22:33:36Z

Anyways, this could probably be better and probably share the code used in utf8dup if the string is shorter than the requested n

sheredom · 2017-11-22T11:03:45Z

Thanks for looking at this!

Two options:

Are you willing to do a PR to add this?
Otherwise, would you rather I incorporated this?

I'm happy to do the work, but some people would rather there name was on the commit if they did the work!

warmwaffles · 2017-11-22T20:33:19Z

@sheredom I would be more than happy to submit a PR for this. Just wanted to test the waters here first.

warmwaffles changed the title ~~Add utf8ndup~~ [feature] Add utf8ndup Nov 21, 2017

warmwaffles mentioned this issue Nov 22, 2017

Add utf8ndup #42

Merged

sheredom closed this as completed in #42 Nov 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature] Add utf8ndup #41

[feature] Add utf8ndup #41

warmwaffles commented Nov 21, 2017

f2404 commented Nov 21, 2017

warmwaffles commented Nov 21, 2017

f2404 commented Nov 21, 2017

warmwaffles commented Nov 21, 2017 •

edited

warmwaffles commented Nov 21, 2017

sheredom commented Nov 22, 2017

warmwaffles commented Nov 22, 2017

[feature] Add utf8ndup #41

[feature] Add utf8ndup #41

Comments

warmwaffles commented Nov 21, 2017

f2404 commented Nov 21, 2017

warmwaffles commented Nov 21, 2017

f2404 commented Nov 21, 2017

warmwaffles commented Nov 21, 2017 • edited

warmwaffles commented Nov 21, 2017

sheredom commented Nov 22, 2017

warmwaffles commented Nov 22, 2017

warmwaffles commented Nov 21, 2017 •

edited