Crazy idea: Unsafe C# extensions #2011

alexrp · 2015-08-31T00:39:49Z

This is a quite possibly insane idea that came to mind over the weekend and was fairly easy to hack into mcs. This patch adds a new unsafe language version (i.e. -langversion:unsafe) to mcs which implies experimental and enables the following extensions:

Pointers can be used in generic type arguments. (int i = 42; List<int*> list; list.Add (&i);)
Pointers to managed objects are allowed. (string s = "..."; string* p = &s;)
Managed types can be used in stackalloc declarations. (string* strings = stackalloc string [16];)
The size of managed references can be computed. (var sz = sizeof (string); /* 4 or 8 */)

All of these still require unsafe context. Also, there is no guarantee that the generated code will work with any other VM.

Pointers in generic type arguments

This one is interesting because it adds type safety that C# doesn't currently have when dealing with pointers in collections. Today, you have to maintain a List<IntPtr> which is not really any better than List<object> in terms of type safety.

Today, you do:

List<IntPtr> ptrs = new List<IntPtr> ();
int i = 42;
ptrs.Add ((IntPtr) &i);

If the type of i ever changes, you likely won't notice that you're now putting mistyped pointers into ptrs.

With the extension:

List<int*> ptrs = new List<int*> ();
int i = 42;
ptrs.Add (&i); // OK
short j = 21;
ptrs.Add (&j); // Not OK

Pointers to managed objects

This is a simple but surprisingly useful feature. It's as unsafe as any other kind of pointer, but when used with care can allow some patterns that are currently impossible in C# due to the restrictive nature of ref and out.

LLVM's instruction matching framework is one example of where this feature can be really useful. They have this code in lib/Transforms/InstCombine/InstCombineAddSub.cpp:

    Value *A = 0, *B = 0;
    if (match(RHS, m_Xor(m_Value(A), m_Value(B))) &&
        (match(LHS, m_And(m_Specific(A), m_Specific(B))) ||
         match(LHS, m_And(m_Specific(B), m_Specific(A)))))
      return BinaryOperator::CreateOr(A, B);

What happens here is that A and B are stored into temporary structures by reference via the m_... calls which set up match specifications. They are then set once the match call goes through the matching tree and calls match on each node. If the whole match passes, A and B have been set to the desired values from the instruction tree.

This can't really be done reasonably in C# with ref/out as these references can't be stored into fields. It can't be done with pointers currently, either, as pointers can only point to things that don't contain managed references, which is highly restrictive and impractical.

This change enables the above.

Managed types in `stackalloc` declarations

This quite simply allows the following:

struct ManagedData
{
    string SomeString;
}

object* objectArr = stackalloc object [16];
ManagedData* dataArr = stackalloc ManagedData [16];

That is, managed references can be stored in stackalloc'd memory, as can structures containing managed references. I suspect the latter case will be the most useful one in practice.

Note that this relies entirely on Mono doing stack scanning conservatively for managed stack frames. If that ever changes, this won't work, and for that reason I'm not really convinced this is as worthwhile an idea as the other features. On the other hand, even a stack-precise Mono could scan stackalloc'd memory conservatively to enable this feature. Who knows.

Size of managed references

This is a small feature that makes the above features really come together. Marshal.SizeOf has all sorts of special cases and only sometimes does what the user wants. sizeof will now simply do what you'd expect for any type: Return the size of storing that type in memory. For structures, this means the size of all its fields as the runtime sees it, while for references, this means the size of a managed reference. No special cases where certain types are not allowed.

This means that unsafe code that computes sizes of types will be clearer and more maintainable.

TL;DR

This off-by-default extension enables more practical and 'safe' unsafe programming in C# but generates code that won't necessarily work in other VMs. Useful enough to merge? I don't know. But I could see myself using these features in some tools that aren't intended to run anywhere but on Mono.

Therzok · 2015-08-31T06:13:05Z

These look awesome to me. 👍

txdv · 2015-08-31T11:55:14Z

Looks very interesting, what is the purpose though of getting a pointer to an object on the heap?

dori4n · 2015-08-31T12:14:16Z

Is this comparable to or the same as this?
https://msdn.microsoft.com/en-us/library/aa288474(v=vs.71).aspx

txdv · 2015-08-31T12:15:44Z

@dorianmuthig not really, the functionality you show is the default unsafe functionality. This is an addition on top, usually in C# you can get pointers only to blittable types.

redknightlois · 2015-09-01T02:53:43Z

Loved it. Specially "Pointers can be used in generic type arguments"

tritao · 2015-09-01T14:20:27Z

Me and @ddobrev have been discussing the usefulness of these for CppSharp. We could use these features to improve the generated bindings in some places but unfortunately we won't be able to because of our compatibility to .NET.

Still nice work on this, I'd like to see these getting in to allow for further experimentation.

alexrp · 2015-09-23T06:10:31Z

@txdv

Looks very interesting, what is the purpose though of getting a pointer to an object on the heap?

The LLVM example is one use case. What it's doing can't be expressed in C# today.

@tritao

We could use these features to improve the generated bindings in some places but unfortunately we won't be able to because of our compatibility to .NET.

The thing is, if this is added as an extension and it turns out people find this useful in practice, we could probably push for standardization with Microsoft. Ecma 335 even states that pointers not being allowed in generics is merely an arbitrary limitation.

ddobrev · 2015-09-23T11:43:28Z

If this is pushed to ECMA, I am all for. I am afraid though that up until then we cannot use it even just on Mono because there would be a difference in the API. We do need to provide different binding assemblies per OS anyway but our API can and must be the same.

tritao · 2015-10-11T11:59:51Z

Someone else (Alexandre Mutel) has been playing with some similar extensions for CoreCLR and Roslyn.

http://xoofx.com/blog/2015/09/27/struct-inheritance-in-csharp-with-roslyn-and-coreclr/
http://xoofx.com/blog/2015/10/08/stackalloc-for-class-with-roslyn-and-coreclr/

https://github.com/xoofx/StackAllocForClass

lewurm · 2016-03-28T20:02:30Z

is there still interest to merge this or should it be closed?

alexrp · 2016-03-29T11:01:39Z

I think that's a @marek-safar question. I believe he was looking into how an extension like this could be integrated neatly?

dnfclas · 2016-04-06T20:46:06Z

@alexrp, Thanks for signing the contribution license agreement so quickly! Actual humans will now validate the agreement and then evaluate the PR.

Thanks, DNFBOT;

dnfclas · 2016-04-11T03:56:05Z

@alexrp, Thanks for signing the contribution license agreement so quickly! Actual humans will now validate the agreement and then evaluate the PR.

Thanks, DNFBOT;

alexrp · 2016-06-06T14:03:59Z

Since we're probably switching to Roslyn in the near future, I'll close this.

Unsafe C#

d50bcfa

dnfclas added the cla-signed label Apr 6, 2016

dnfclas added the cla-signed label Apr 11, 2016

alexrp closed this Jun 6, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Crazy idea: Unsafe C# extensions #2011

Crazy idea: Unsafe C# extensions #2011

alexrp commented Aug 31, 2015

Therzok commented Aug 31, 2015

txdv commented Aug 31, 2015

dori4n commented Aug 31, 2015

txdv commented Aug 31, 2015

redknightlois commented Sep 1, 2015

tritao commented Sep 1, 2015

alexrp commented Sep 23, 2015

ddobrev commented Sep 23, 2015

tritao commented Oct 11, 2015

lewurm commented Mar 28, 2016

alexrp commented Mar 29, 2016

dnfclas commented Apr 6, 2016

dnfclas commented Apr 11, 2016

alexrp commented Jun 6, 2016

Navigation Menu

Crazy idea: Unsafe C# extensions #2011

Crazy idea: Unsafe C# extensions #2011

Conversation

alexrp commented Aug 31, 2015

Pointers in generic type arguments

Pointers to managed objects

Managed types in stackalloc declarations

Size of managed references

TL;DR

Therzok commented Aug 31, 2015

txdv commented Aug 31, 2015

dori4n commented Aug 31, 2015

txdv commented Aug 31, 2015

redknightlois commented Sep 1, 2015

tritao commented Sep 1, 2015

alexrp commented Sep 23, 2015

ddobrev commented Sep 23, 2015

tritao commented Oct 11, 2015

lewurm commented Mar 28, 2016

alexrp commented Mar 29, 2016

dnfclas commented Apr 6, 2016

dnfclas commented Apr 11, 2016

alexrp commented Jun 6, 2016

Managed types in `stackalloc` declarations