Quit using ForeignPtr in favor of ByteArray# #193

andrewthad · 2019-11-26T13:29:41Z

Long term we'd like to eliminate the use of ForeignPtr and use only pinned or unpinned ByteArray#s

Is this still planned. Is there still interest in doing this?

The text was updated successfully, but these errors were encountered:

hvr · 2019-12-19T07:48:01Z

@andrewthad well, if we can figure out a good way to do that it's still on the table; but in the mean-time there's also the direction explored in #175 to slim down the type.

Eliminating the use of ForeignPtr would however come at a cost in terms of regression no? ByteStrings are used alot for IO & FFI purposes and you'd lose the ability to seamlessly refer to foreign memory (including mmap(2)ed memory and other shared buffers) with a uniform ByteString type. Would we need to introduce a separate type to cover those lost use-cases?

andrewthad · 2019-12-22T13:54:08Z

Here is what we would lose, along with my gut feeling about how much of a problem these are:

Functionality: ByteString could not be used with mmap. I'm skeptical that it is ever a good idea to use ByteString with mmap. You'll either end up needing to close() a file descriptor in a finalizer, which is a great way to leak file descriptors, or you just need to open the mmap at the beginning with no plans to ever call close(). I think of this as an uncommon use case although someone may be doing it.
Performance: If you are dealing with a foreign function that allocates memory, you would have to copyAddrToByteArray# this into a MutableByteArray#, freeze the byte array in-place, and then free() the original memory. Most C libraries, rather than mallocing memory and returning it, have interfaces that allow the user to pass in a buffer. In these case, performance would not suffer.
Functionality: If the C library that handed you the memory expects you to pass it back sometime later at the same address (meaning that you could not free() it), the ByteArray#-backed variant would not work. I don't know if this ever happens. I've not integrated with a C library that requires this, but maybe there's one out there.

Here are the benefits:

Performance: I suspect that the overwhelming majority of ByteStrings out there use PlainPtr and do not need to be pinned. These should perform better. I have no idea what kind of improvement should be expected.
Space: Even with Remove the offset parameter completely #175, a PlainPtr-backed ByteString has an overhead of 8 machine words when it is on the heap (ByteString data constructor header + length field + addr field + ptr to ForeignPtrContents + PlainPtr data constructor header + ptr to ByteArray# + ByteArray# header + length). The ByteArray# variant would only have an overhead of 6 machine words, and it would result in only 2 objects on the heap instead of 3.
Simplicity. bytestring uses IO in a number of confusing ways. Pervasive use of ByteArray# would make it possible to use ST and uneffectful functions (like indexWord8Array#) to model effectfulness with types. Also, all of the touch-related code would go away.

In my mind, the obstacle is that it's hard to measure the performance impact. In a ideal world, I'd like to try out this change and then build something huge like GHC (or anything else with good macrobenchmarks) and see if it gets noticeably faster. However, this isn't possible because bytestring forces users to use the internal API to do a ton of common stuff. For example, it's not currently possible to implement fromByteArray# :: ByteArray# -> ByteString without importing an internal module.

What I would like to do is add functions to bytestring that capture the common patterns that show up when people import the internal modules. After this lands in a release, I could chip away at the uses of bytestring's internal API in several of the standard libraries. (The nice thing about this is that it's probably a good idea to do this anyway). Then it actually be possible to swap out the implementation in a branch and measure the impact.

hvr · 2020-01-12T09:42:05Z

If the C library that handed you the memory expects you to pass it back sometime later at the same address (meaning that you could not free() it), the ByteArray#-backed variant would not work. I don't know if this ever happens. I've not integrated with a C library that requires this, but maybe there's one out there.

In general, you have to assume that if the C library allocates and gives you a memory chunk and retains ownership, you ought not to shuffle it around in memory unless the C API tells you explicitly that you're allowed to. I can't name you a concrete C library that does this ottomh, but I recall having encountered a couple of such libraries in my past C++ life (both OSS and proprietary) where the C library would silently screw up if you handed it back a cloned memory buffer rather than the original because it actually used the pointer-addr as key into a hashmap, or did its own memory allocation management and would get utterly confused if it got passed a memory buffer back that it didn't recognize.

Interfacing with C APIs is definitely one of bytestring's primary use-cases/goals. But maybe one size doesn't fit all and maybe the solution to this is really to split up ByteString into two different types and un-conflate its two current goals (efficient representation of Word8 strict+lazy strings (+ associated operations) vs. FFI-optimized memory-pinned ByteStrings with support for ForeignPtr backed payloads).

Data.ByteString.Short kinda does already move into this direction (and I basically wrap ShortByteString in text-short); so maybe we should just expand this direction (and e.g. add more operations (some of which I can donate from text-short) as well as adding a "lazy" ShortByteString)?

What I would like to do is add functions to bytestring that capture the common patterns that show up when people import the internal modules.

Could you identify/enumerate those patterns? Reducing the need to reach for .Internals (assuming those patterns don't require allowing breaking invariants) sounds like a sensible goal in its own right.

vdukhovni · 2020-02-24T07:27:04Z

Functionality: ByteString could not be used with mmap. I'm skeptical that it is ever a good idea to use ByteString with mmap. You'll either end up needing to close() a file descriptor in a finalizer, which is a great way to leak file descriptors, or you just need to open the mmap at the beginning with no plans to ever call close(). I think of this as an uncommon use case although someone may be doing it.

Just a comment that mmap()ed files are not automatically unmapped on close, so there's no need to close in the finalizer, the file can be closed right away. To actually remove the mapping one needs to call munmap(). This is not a comment in support of or against the proposal to use ByteArray#, just a clarification re mmap().

lyokha · 2020-08-24T12:01:26Z

If the C library that handed you the memory expects you to pass it back sometime later at the same address (meaning that you could not free() it), the ByteArray#-backed variant would not work. I don't know if this ever happens. I've not integrated with a C library that requires this, but maybe there's one out there.

In general, you have to assume that if the C library allocates and gives you a memory chunk and retains ownership, you ought not to shuffle it around in memory unless the C API tells you explicitly that you're allowed to. I can't name you a concrete C library that does this ottomh, but I recall having encountered a couple of such libraries in my past C++ life (both OSS and proprietary) where the C library would silently screw up if you handed it back a cloned memory buffer rather than the original because it actually used the pointer-addr as key into a hashmap, or did its own memory allocation management and would get utterly confused if it got passed a memory buffer back that it didn't recognize.

Pinning memory in ByteString is crucial in nginx-haskell-module for interoperability between the core Nginx C code and Haskell handlers which produce ByteStrings to be directly consumed in the core. When a Haskell handler finishes its task, it notifies the core via an event channel (which can be an eventfd or a pipe). The core part takes ownership of the ByteString's internals via a StablePtr to the ByteString which has been returned to the core. StablePtr guarantees that the ByteString can be reconstructed when accessed in the C core. But, if I understand this correctly, with only StablePtr as a container, there is no guarantee that the ByteString buffers will be located in the same addresses while the StablePtr is alive, and so, pinning the buffers is important in the C core as the C value is taken as C string(s) from the address of the ByteString's buffer(s). So, I am afraid that removing ByteString memory pinning would possibly make this simple sharing mechanism unreachable.

chessai · 2020-08-24T12:06:42Z

It would be fairly easy to provide ways to create pinned ByteStrings using this proposed representation, so that wouldn't really become out of reach.

…

On Mon, Aug 24, 2020, 07:01 Alexey Radkov ***@***.***> wrote: If the C library that handed you the memory expects you to pass it back sometime later at the same address (meaning that you could not free() it), the ByteArray#-backed variant would not work. I don't know if this ever happens. I've not integrated with a C library that requires this, but maybe there's one out there. In general, you have to assume that if the C library allocates and gives you a memory chunk and retains ownership, you ought not to shuffle it around in memory unless the C API tells you explicitly that you're allowed to. I can't name you a concrete C library that does this ottomh, but I recall having encountered a couple of such libraries in my past C++ life (both OSS and proprietary) where the C library would silently screw up if you handed it back a cloned memory buffer rather than the original because it actually used the pointer-addr as key into a hashmap, or did its own memory allocation management and would get utterly confused if it got passed a memory buffer back that it didn't recognize. Pinning memory in ByteString is crucial in nginx-haskell-module <https://github.com/lyokha/nginx-haskell-module> for interoperability between the core Nginx C code and Haskell handlers which produce ByteStrings to be directly consumed in the core. When a Haskell handler finishes its task, it notifies the core via an event channel (which can be an eventfd or a pipe). The core part takes ownership of the ByteString's internals via a *StablePtr* to the ByteString which has been returned to the core. StablePtr guarantees that the ByteString can be *reconstructed* when accessed in the C core. But, if I understand this correctly, with only StablePtr as a container, there is no guarantee that the ByteString buffers will be located in the same addresses while the StablePtr is alive, and so, pinning the buffers is important in the C core as the C value is taken as C string(s) from the address of the ByteString's buffer(s). So, I am afraid that removing ByteString memory pinning would possibly make this simple sharing mechanism unreachable. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#193 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AEOIX27755YILYJCG7QVGQ3SCJJCNANCNFSM4JRYAMBQ> .

Bodigrim · 2020-10-08T18:16:09Z

What I would like to do is add functions to bytestring that capture the common patterns that show up when people import the internal modules.

@andrewthad This would be highly appreciated. Cf. #253.

vdukhovni · 2020-10-08T21:26:37Z

I'd like to ask what the objective is here. Is it making the storage visible to GC, so that the total memory in use is not under-estimated, and GC is better able to infer when collection is needed, ... Or is it to try to reduce fragmentation by using unpinned storage?

ByteStrings could certainly be backed by pinned MutableByteArray#, addressing the first concern, but if the goal is indeed to "unpin" the storage, then that rather changes the compatibility story, since they are used in a broad range of FFI use-cases, where it is crucial to know that GC will not move the bytestring content.

vdukhovni · 2020-10-08T22:02:26Z

Here is what we would lose, along with my gut feeling about how much of a problem these are:

Functionality: ByteString could not be used with mmap. I'm skeptical that it is ever a good idea to use ByteString with mmap. You'll either end up needing to close() a file descriptor in a finalizer, which is a great way to leak file descriptors, or you just need to open the mmap at the beginning with no plans to ever call close(). I think of this as an uncommon use case although someone may be doing it.

This is a misunderstanding. The mmap(2) system call DOES NOT require the file to stay open. After you mmap a region of memory, you can close the file descriptor immediately, the mapping remains valid. To release the mapping you use munmap(2). Of course one still needs a finaliser to do the munmap.

Performance: If you are dealing with a foreign function that allocates memory, you would have to copyAddrToByteArray# this into a MutableByteArray#, freeze the byte array in-place, and then free() the original memory. Most C libraries, rather than mallocing memory and returning it, have interfaces that allow the user to pass in a buffer. In these case, performance would not suffer.

Of course only if the underlying MutableByteArray# is not pinned in the first place. If the idea is to just expose the memory usage to the GC, but not unpin the memory, it should be easy to just change the ForeignPtr associated with ByteStrings to capture and keep alive a pinned MutableByteArray# RealWorld, and set the associated Ptr to the address returned by byteArrayContents#, and pretty much have the same behaviour as before.

Functionality: If the C library that handed you the memory expects you to pass it back sometime later at the same address (meaning that you could not free() it), the ByteArray#-backed variant would not work. I don't know if this ever happens. I've not integrated with a C library that requires this, but maybe there's one out there.

I don't think that's on obstacle, ByteString creation primitives could be provided that wrap around an Addr# with either no finalisation, or whatever finalisation is appropriate for the returned buffer.

Here are the benefits:

Performance: I suspect that the overwhelming majority of ByteStrings out there use PlainPtr and do not need to be pinned. These should perform better. I have no idea what kind of improvement should be expected.

If the bytestring only lives a short time, and does not contribute to "fragmentation" I would not expect a performance advantage from unpinned memory.

Simplicity. bytestring uses IO in a number of confusing ways. Pervasive use of ByteArray# would make it possible to use ST and uneffectful functions (like indexWord8Array#) to model effectfulness with types. Also, all of the touch-related code would go away.

The touch code needs to remain whenever the content is passed to FFI functions, ensuring that the storage is kept alive for the duration of the FFI call.

What I would like to do is add functions to bytestring that capture the common patterns that show up when people import the internal modules. After this lands in a release, I could chip away at the uses of bytestring's internal API in several of the standard libraries. (The nice thing about this is that it's probably a good idea to do this anyway). Then it actually be possible to swap out the implementation in a branch and measure the impact.

I've been tinkering with streaming-bytestring lately, it uses the internal representation in multiple places in that PS fp off len is expected to yield access to a foreign pointer and length that can be used to directly access the storage via peek, peekElemOff, ... Basically, bytestring needs to continue to provide Foreign.Storable + ForeigPtr interface.

chessai · 2020-12-14T18:54:59Z

What I would like to do is add functions to bytestring that capture the common patterns that show up when people import the internal modules. After this lands in a release, I could chip away at the uses of bytestring's internal API in several of the standard libraries. (The nice thing about this is that it's probably a good idea to do this anyway). Then it actually be possible to swap out the implementation in a branch and measure the impact.

I've been tinkering with streaming-bytestring lately, it uses the internal representation in multiple places in that PS fp off len is expected to yield access to a foreign pointer and length that can be used to directly access the storage via peek, peekElemOff, ... Basically, bytestring needs to continue to provide Foreign.Storable + ForeigPtr interface.

note that this can be achieved with

data ByteString = ByteString ByteArray Int Int

mpickering · 2021-01-05T15:35:17Z

In my opinion, the primary advantage to performing this change is to avoid fragmentation. It is very easy to end up with a severely fragmented heap by accident if you use ByteStrings, or use libraries which do (ie http-client etc..)

The cost is to people who actually use the pinned nature of ByteStrings, this can be mitigated by providing a Data.ByteString.Pinned module which uses pinned memory.

vdukhovni · 2021-01-05T16:58:20Z

The Builder code fundamentally relies on FFI calls to generate the various serialisations it supports, and much of the networking code, Data.Binary, ... rely on ByteStrings not getting moved around during FFI calls. I'd say most of the uses of ByteString are not just because users want 8-bit strings, but rather because they do I/O with ByteStrings. Is memory fragmentation really that important? For ByteStrings one intents to keep around for a long time (as Map keys, ...) ShortByteString already provides a suitable API. The ByteStrings used as I/O buffers don't tend to be long-lived (at least used correctly), so I am not sure that memory fragmentation is a substantial issue.

What happens if pinning is removed? What can we still do in terms of FFI?

mpickering · 2021-01-05T17:05:54Z

Fragmentation is a serious issue. One small bytestring can easily retain a whole megablock (1mb) if you are allocating a lot of small bytestrings you get into this situation quite quickly.

Perhaps a better approach is to use a mesh allocation strategy for pinned objects in GHC. This solve the fragmentation issue - https://raw.githubusercontent.com/plasma-umass/Mesh/master/mesh-pldi19-powers.pdf

andrewthad · 2021-01-05T18:26:03Z

@vdukhovni Only safe FFI calls need pinned bytestring. GHC explicitly guarantees that no objects of any type can be moved during an unsafe FFI call. So we could still do a lot in terms of the FFI. On UNIX and UNIX-like systems, network does not need the objects to be pinned since it only makes nonblocking calls. I have no idea what it does on Windows though.

@mpickering MESH is extremely cool. The paper blew my mind. Realistically, I suspect it might be the only option for GHC at this point. One wrinkle is that the implementation that the paper describes wants objects of the same size class to be located together in the same block. So a block would only have objects of size 32 bytes, 64 bytes, etc. GHC doesn't do this. So I'm not sure if MESH could be incorporated into the runtime.

Bodigrim · 2021-01-30T18:44:40Z

Independently of its merits or their absence, this ticket is blocked by existing ecosystem, which routinely, without a second thought reaches out for Data.ByteString.Internal and foreign pointer guts. If someone is serious about moving this forward, I'd be happy to discuss in #253 what could be done to aleviate this situation.

emilypi · 2021-05-14T21:48:06Z

I have an extremely good appetite for this breakage.

mpickering · 2021-07-23T07:35:17Z

An issue was reported to the GHC issue tracker which demonstrated very bad memory fragmentation due to the large amount of small ByteStrings which were allocated.

https://gitlab.haskell.org/ghc/ghc/-/issues/20065#note_366130

Bodigrim · 2022-02-16T00:09:03Z

I'm tempted to close this issue. There is no migration path to unpin ByteString, so I don't see it happening. Users, concerned about fragmentation, have an option to use ShortByteString or ByteArray.

Bodigrim · 2022-03-16T20:27:20Z

@sjakobi how do you feel about closing this issue?

sjakobi · 2022-03-20T20:06:51Z

There is no migration path to unpin ByteString, so I don't see it happening. Users, concerned about fragmentation, have an option to use ShortByteString or ByteArray.

I agree with this, although I suspect that migrating from ByteString to ByteArray#-based types in user code may not be viable in many cases.

I hope that GHC can quickly start using MESH (https://gitlab.haskell.org/ghc/ghc/-/issues/19175) to reduce memory fragmentation.

ulidtko · 2023-05-17T14:03:17Z

Such an embarrassment for Haskell ecosystem that this is "closed". 😕

That ByteString is backed by pinned memory and naive usage may cause heap fragmentation — is a shenanigan, it would surprise any developer not intimately familiar with GHC RTS innards, who's just looking for a type to hold some bytes. All too often, the bytes would just be pre-encoded text, with no particular FFI or IO requirements, except perhaps for easier interactions with other library APIs.

It's good that the caveat is documented:

bytestring/Data/ByteString/Short.hs

Lines 60 to 68 in 54cd761

    
           -- ** Heap fragmentation 
        
           -- | With GHC, the 'B.ByteString' representation uses /pinned/ memory, 
        
           -- meaning it cannot be moved by the GC. This is usually the right thing to 
        
           -- do for larger strings, but for small strings using pinned memory can 
        
           -- lead to heap fragmentation which wastes space. The 'ShortByteString' 
        
           -- type (and the @Text@ type from the @text@ package) use /unpinned/ memory 
        
           -- so they do not contribute to heap fragmentation. In addition, with GHC, 
        
           -- small unpinned strings are allocated in the same way as normal heap 
        
           -- allocations, rather than in a separate pinned area.

— but IMO, it's not enough to have this on ShortByteString only; ByteString should mention it too.

I'll submit a PR if such a doc-patch is welcome. It's the least we can do.

Bodigrim · 2023-05-17T18:01:48Z

Documentation improvements are welcome.

Such an embarrassment for Haskell ecosystem that this is "closed". 😕

If you can offer a specific migration plan, I'm all ears. If not, please choose your words more carefully. It's unfair to transfer the responsibility for using wrong data type from a developer to bytestring authors/maintainers. Sure, by all means, do not abuse ByteString and go for ShortByteString or ByteArray instead.

ulidtko · 2023-05-17T18:10:25Z

@Bodigrim I didn't try to do that... Apologies for the tone — it only reflects the deep sadness of situation for the finders of this ticket. Carries no intent to blame, as indeed, I see no viable migration plan either.

Observe that it's difficult to know upfront what abuse of ByteString constitutes — unless one has already found the fragmentation note on ShortByteString. That's why I'm offering a doc patch. There surely can't be any compatibility concerns about that, right?

Bodigrim · 2023-05-17T18:12:45Z

As I said, documentation improvements are welcome.

hasufell · 2023-05-18T00:52:21Z

ShortByteString is now on par with ByteString in terms of API, since 0.11.3.0. So there's no reason to not use it excessively.

sjakobi added the discussion/rfc label Jun 25, 2020

pchiusano mentioned this issue Oct 7, 2020

Cryptographic primitives for hashing and HMAC unisonweb/unison#1712

Merged

Bodigrim mentioned this issue Oct 13, 2020

Expand ShortByteStrings API? #83

Closed

Bodigrim mentioned this issue Jan 3, 2021

Untie Win32 from bytestring haskell/win32#166

Closed

kozross mentioned this issue Feb 11, 2021

Reimplement with ByteArray# haskell-text/text-ascii#14

Closed

Bodigrim mentioned this issue May 14, 2021

UTF-8 Encoded Text haskellfoundation/tech-proposals#1

Merged

sjakobi mentioned this issue Dec 2, 2021

RFC: consider splitting out shortbytestring #444

Closed

sjakobi closed this as completed Mar 20, 2022

oberblastmeister mentioned this issue Aug 16, 2022

Sliced interface for ShortByteString #541

Open

ulidtko mentioned this issue May 17, 2023

Query caching + ByteString pinning = heap fragmentation haskellari/postgresql-simple#114

Open

ulidtko added a commit to ulidtko/bytestring that referenced this issue Aug 13, 2023

Docs: link to ShortByteString from ByteString module (haskell#193)

85f1e08

ulidtko mentioned this issue Aug 13, 2023

Doc patch: link to ShortByteString from ByteString module #609

Merged

ulidtko added a commit to ulidtko/bytestring that referenced this issue Aug 18, 2023

Docs: link to ShortByteString from ByteString module (haskell#193)

b2b36b5

ulidtko added a commit to ulidtko/bytestring that referenced this issue Sep 5, 2023

Docs: link to ShortByteString from ByteString module (haskell#193)

8a72621

Bodigrim pushed a commit that referenced this issue Sep 8, 2023

Docs: link to ShortByteString from ByteString module (#193) (#609)

6e6b115

sol mentioned this issue Sep 27, 2023

Remove code duplication #618

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quit using ForeignPtr in favor of ByteArray# #193

Quit using ForeignPtr in favor of ByteArray# #193

andrewthad commented Nov 26, 2019

hvr commented Dec 19, 2019

andrewthad commented Dec 22, 2019

hvr commented Jan 12, 2020 •

edited

Loading

vdukhovni commented Feb 24, 2020

lyokha commented Aug 24, 2020

chessai commented Aug 24, 2020 via email

Bodigrim commented Oct 8, 2020

vdukhovni commented Oct 8, 2020

vdukhovni commented Oct 8, 2020

chessai commented Dec 14, 2020

mpickering commented Jan 5, 2021

vdukhovni commented Jan 5, 2021

mpickering commented Jan 5, 2021

andrewthad commented Jan 5, 2021

Bodigrim commented Jan 30, 2021

emilypi commented May 14, 2021

mpickering commented Jul 23, 2021

Bodigrim commented Feb 16, 2022

Bodigrim commented Mar 16, 2022

sjakobi commented Mar 20, 2022 •

edited

Loading

ulidtko commented May 17, 2023

Bodigrim commented May 17, 2023

ulidtko commented May 17, 2023

Bodigrim commented May 17, 2023

hasufell commented May 18, 2023

Quit using ForeignPtr in favor of ByteArray# #193

Quit using ForeignPtr in favor of ByteArray# #193

Comments

andrewthad commented Nov 26, 2019

hvr commented Dec 19, 2019

andrewthad commented Dec 22, 2019

hvr commented Jan 12, 2020 • edited Loading

vdukhovni commented Feb 24, 2020

lyokha commented Aug 24, 2020

chessai commented Aug 24, 2020 via email

Bodigrim commented Oct 8, 2020

vdukhovni commented Oct 8, 2020

vdukhovni commented Oct 8, 2020

chessai commented Dec 14, 2020

mpickering commented Jan 5, 2021

vdukhovni commented Jan 5, 2021

mpickering commented Jan 5, 2021

andrewthad commented Jan 5, 2021

Bodigrim commented Jan 30, 2021

emilypi commented May 14, 2021

mpickering commented Jul 23, 2021

Bodigrim commented Feb 16, 2022

Bodigrim commented Mar 16, 2022

sjakobi commented Mar 20, 2022 • edited Loading

ulidtko commented May 17, 2023

Bodigrim commented May 17, 2023

ulidtko commented May 17, 2023

Bodigrim commented May 17, 2023

hasufell commented May 18, 2023

hvr commented Jan 12, 2020 •

edited

Loading

sjakobi commented Mar 20, 2022 •

edited

Loading