New rule: using mutable structs in readonly fields #2811

mareklinka · 2019-09-04T11:08:17Z

A split off issue from corefx/issues/40739 to track the suggested rule

readonly SpinLock fields. SpinLock is a mutable struct, meant only for advanced scenarios. Accidentally making a SpinLock field readonly can result in silent but significant problems, as any mutations to the instance (e.g. Enter, Exit) will be done on a compiler-generated copy and thus be ignored, making the lock an expensive nop. (It might make sense to extend this analyzer to additional mutable struct types where storing them in a readonly field is likely a bug, e.g. GCHandle.)

Detecting this should be easy - for each class/struct find all fields that are readonly and of a type that is designed to be mutable (for starters, the GCHandle and SpinLock).

Questions:

Can we think of more types that should be included in this list?
An automated fix should just remove the readonly modified from the field, right?
What would be the default severity for the rule? As there is a rather large chance this situation will lead to a bug, is a warning sufficient?
Are there any edge-cases where having a readonly mutable-by-design struct might actually be desired?

The text was updated successfully, but these errors were encountered:

stephentoub · 2019-09-04T13:33:32Z

An automated fix should just remove the readonly modified from the field, right?

Yes

As there is a rather large chance this situation will lead to a bug, is a warning sufficient?

For SpinLock, I can't think of any real situation where it's not definitively a bug. Calling Enter will report that the lock was successfully entered, but the SpinLock itself will have no record of that, which means everyone trying to enter the lock will do so successfully, and the lock will effectively become a silent nop... not what you want for something that's meant to ensure thread safety. If the SpinLock was constructed with its default ctor or with enableThreadOwnerTracking:true as its ctor argument, then at least Exit will end up throwing an exception (because the SpinLock will think the calling thread doesn't actually own the lock), but a) the damage may already be done, b) Exit won't throw if enableThreadOwnerTracking:false was passed to the ctor (which is the high-performance thing to do and thus what most folks using SpinLock do, at least in release builds), and c) regardless it's a bug.

For GCHandle, it may not be a real buge.g. if it's stored into an object that needs access to what's in the handle, but, for example, the handle will never be Free'd, then you could imagine readonly being used with it, though I still think a warning would be useful.

Are there any edge-cases where having a readonly mutable-by-design struct might actually be desired?

Yes. If the object storing the struct has no plans or need to mutate it and is just reading from it, you could imagine someone wanting to make it readonly. But I think that's still worth a suggestion/warning/whatever default level is chosen. However, this is also why IMO any such rule should focus only on a specific list of hand-picked types, rather than, say, warning on any struct that's not readonly or whose fields aren't all readonly... that would almost certainly result in a ton of false positives.

mareklinka · 2019-09-05T07:14:57Z

What package/project should this go in? I don't think this is an FxCop rule so maybe Microsoft.CodeQuality.Analyzers or Microsoft.NetCore.Analyzers?

mareklinka · 2019-09-06T11:36:21Z

I have started looking into the implementation here and came up with a few more questions:

Is there a rule/convention for creating RuleIds? What ID should we give to this analyzer?
The rule will need to check if a field is of a specific type. The easiest option I found to get the SpinLock or GCHandle types in the analyzer is to call context.Compilation.GetTypeByMetadataName("System.Threading.SpinLock"). However, in here there is a similar approach used, with the addition of a fallback strategy that tries to find the type not by its metadata name but by going through namespaces manually. I think this is a fallback for situations where the type in question is "replaced" by a local implementation. The question is, do we need to do something similar?

mavasani · 2019-09-06T16:55:49Z

Is there a rule/convention for creating RuleIds? What ID should we give to this analyzer?

We generally follow the below steps:

Decide the applicable "category" for the new rule. See https://github.com/dotnet/roslyn-analyzers/blob/master/src/Utilities/Compiler/DiagnosticCategoryAndIdRanges.txt for current categories, and the CAxxxx IDs reserved for each category. Note that DiagnosticCategoryAndIdRanges.txt is fed as an additional file into an analyzer that runs on our repo, that verifies there are no duplicate IDs in each analyzer package and the ID is in the registered diagnostic descriptor's range for the provided category.
Refer to the current documentation/reference section for all CA rules by the chosen category over here: https://docs.microsoft.com/en-us/visualstudio/code-quality/code-analysis-for-managed-code-warnings?view=vs-2019. For example, this rule will likely fall in the "Performance" category, so you can navigate to https://docs.microsoft.com/en-us/visualstudio/code-quality/performance-warnings?view=vs-2019. You can find that CA1824 is the last documented rule ID in this range, and we also have couple of doc issues on that page for very recently added rules CA1825 and CA1826 for which we documentation still needs to be added. You can also search for the CA ID in the repo to confirm it is not being already used.
Once the rule is merged, it is expected that you file an issue on the docs page for the rules category to ensure it gets documented. For example, Add documentation for new Performance rule CA1825 (AvoidZeroLengthArrayAllocations) in FxCop analyzers MicrosoftDocs/visualstudio-docs#3454.

The rule will need to check if a field is of a specific type. The easiest option I found to get the SpinLock or GCHandle types in the analyzer is to call context.Compilation.GetTypeByMetadataName("System.Threading.SpinLock"). However, in here there is a similar approach used, with the addition of a fallback strategy that tries to find the type not by its metadata name but by going through namespaces manually. I think this is a fallback for situations where the type in question is "replaced" by a local implementation. The question is, do we need to do something similar?

I would recommend just using GetTypeByMetadataName. The workaround/fallback strategy was primarily added for ImmutableArray due to the fact that it exists in couple of different assemblies and the test framework was including both of them. It will likely be removed in future.

mavasani · 2019-09-06T17:00:22Z

I don't think this is an FxCop rule so maybe Microsoft.CodeQuality.Analyzers or Microsoft.NetCore.Analyzers?

FxCopAnalyzers package does not have any rules by itself. It is just a master package that includes both Microsoft.CodeQuality.Analyzers and Microsoft.NetCore.Analyzers as dependencies for ease of install and migration for legacy FxCop users. I hope the description over https://github.com/dotnet/roslyn-analyzers#microsoftcodequalityanalyzers and https://github.com/dotnet/roslyn-analyzers#microsoftnetcoreanalyzers helps you decide the correct package to choose.

Bottom line is analyzers related to purely code quality improvements, but are not specific to any API will go into Microsoft.CodeQuality.Analyzers. Analyzers specific to usage of any API would go into Microsoft.NetCore.Analyzers. A good rule of thumb is that if your analyzer needs to invoke GetTypeByMetadataName, then most likely it is an API specific analyzer and belongs to Microsoft.NetCore.Analyzers.

Based on the questions from dotnet#2811

mavasani · 2019-09-06T17:57:58Z

FYI: I submitted a PR to document the above guidelines in the repo.

mavasani · 2019-09-06T19:25:16Z

Guidelines are now at https://github.com/dotnet/roslyn-analyzers/blob/master/GuidelinesForNewRules.md

mareklinka · 2019-09-07T08:00:18Z

Awesome, thanks for the guidance!

dotnet#2811

mavasani · 2019-09-08T18:30:59Z

Reading the comments above another time, it seems like this is not a performance rule, but indicates a very likely functional bug. Is that correct? If so, this should likely not be a Performance rule, but must have a different category and rule ID. Thoughts?

mareklinka · 2019-09-08T18:36:33Z

Yeah, I'm on the fence here. On one side, using the readonly modifier degrades performance because of the shadow copying by the compiler. On the other, using a readonly SpinLock is pretty much a definite bug. However, for GCHandle it might make sense to have it readonly in situations when the class just gets it from outside and doesn't plan to modify/free it.

So it's kind of Performance and kind of Usage or Maintainability. Not sure which would be best but I agree the performance aspect is rather marginal here.

stephentoub · 2019-09-08T18:39:02Z

I don't think it should be performance. Can you come up with a benchmark where adding readonly measurably improves performance with a correct implementation?

mareklinka · 2019-09-08T18:43:40Z

Also a good point. New idea: since such a usage likely (at least for the SpinLock) constitutes a bug, maybe the best category would be Reliability?

stephentoub · 2019-09-08T19:40:58Z

What's the definition of the correctness category vs the reliability category?

sharwell · 2019-09-08T20:37:23Z

Ideally we would have the ability to mark SpinLock as non-copyable (dotnet/csharplang#859), which covers a much broader set of misuse cases than an analyzer only focused on read-only.

I would argue that for the rule in question here, the importance of non-copyable is sufficient to make that the target behavior of the rule instead of limiting it to whether or not the storage is readonly.

mareklinka · 2019-09-09T05:34:12Z

What's the definition of the correctness category vs the reliability category?

As far as I can see in the docs there is no correctness category. Which is consistent with the DiagnosticCategory enum which only has MicrosoftCodeAnalysisCorrectness (which I assume is specific to diagnostics and code fixes?).

From the description of the Reliability category, I think it might be a good fit:

Warnings that support library and application reliability, such as correct memory and thread usage.

carlossanlop · 2020-11-30T22:20:50Z

@mavasani I noticed this proposal was marked as approved in this repo. I'll remove the label since we want it to go through API review via dotnet/runtime#33773

Edit: Maybe we can make an exception with this one since the discussion is large and there's a PR out already. I'll wait for confirmation on the runtime issue.

mavasani added a commit to mavasani/roslyn-analyzers that referenced this issue Sep 6, 2019

Add documentation for guidance on contributing a new CA rule to the repo

154fd49

Based on the questions from dotnet#2811

mavasani mentioned this issue Sep 6, 2019

Add documentation for guidance on contributing a new CA rule to the repo #2824

Merged

mareklinka added a commit to mareklinka/roslyn-analyzers that referenced this issue Sep 8, 2019

Add an analyzer to check for readonly SpinLock and GChandle fields

4fcf7db

dotnet#2811

mareklinka added a commit to mareklinka/roslyn-analyzers that referenced this issue Sep 8, 2019

Add an analyzer to check for readonly SpinLock and GChandle fields

b4bd2f2

dotnet#2811

mareklinka linked a pull request Sep 8, 2019 that will close this issue

[WIP] Add rule for detecting certain readonly mutable structs #2831

Open

mavasani added Feature Request Needs-Review Approved-Rule Indicates if the new proposed rule has been approved to move to implementation phase and removed Needs-Review labels Sep 9, 2019

mavasani added the help wanted The issue is up-for-grabs, and can be claimed by commenting label Sep 9, 2019

mavasani added this to the Unknown milestone Sep 9, 2019

mavasani mentioned this issue Mar 2, 2020

Readonly Structs vs Classes have dangerous inconsistency - failed spin lock dotnet/roslyn#17310

Closed

Evangelink mentioned this issue Mar 20, 2020

CA1070: Do not use mutable struct in readonly field #3405

Closed

stephentoub mentioned this issue Mar 20, 2020

Do not hold SpinLock in fields marked as readonly dotnet/runtime#33773

Open

stephentoub mentioned this issue May 1, 2020

Improve HTTP/2 scaling performance dotnet/runtime#35694

Merged

carlossanlop removed the Approved-Rule Indicates if the new proposed rule has been approved to move to implementation phase label Nov 30, 2020

stephentoub mentioned this issue Apr 12, 2024

Regression in R2R compilation time dotnet/runtime#100995

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New rule: using mutable structs in readonly fields #2811

New rule: using mutable structs in readonly fields #2811

mareklinka commented Sep 4, 2019

stephentoub commented Sep 4, 2019

mareklinka commented Sep 5, 2019

mareklinka commented Sep 6, 2019

mavasani commented Sep 6, 2019

mavasani commented Sep 6, 2019

mavasani commented Sep 6, 2019

mavasani commented Sep 6, 2019

mareklinka commented Sep 7, 2019

mavasani commented Sep 8, 2019

mareklinka commented Sep 8, 2019 •

edited

stephentoub commented Sep 8, 2019

mareklinka commented Sep 8, 2019

stephentoub commented Sep 8, 2019

sharwell commented Sep 8, 2019 •

edited

mareklinka commented Sep 9, 2019

carlossanlop commented Nov 30, 2020 •

edited

New rule: using mutable structs in readonly fields #2811

New rule: using mutable structs in readonly fields #2811

Comments

mareklinka commented Sep 4, 2019

stephentoub commented Sep 4, 2019

mareklinka commented Sep 5, 2019

mareklinka commented Sep 6, 2019

mavasani commented Sep 6, 2019

mavasani commented Sep 6, 2019

mavasani commented Sep 6, 2019

mavasani commented Sep 6, 2019

mareklinka commented Sep 7, 2019

mavasani commented Sep 8, 2019

mareklinka commented Sep 8, 2019 • edited

stephentoub commented Sep 8, 2019

mareklinka commented Sep 8, 2019

stephentoub commented Sep 8, 2019

sharwell commented Sep 8, 2019 • edited

mareklinka commented Sep 9, 2019

carlossanlop commented Nov 30, 2020 • edited

mareklinka commented Sep 8, 2019 •

edited

sharwell commented Sep 8, 2019 •

edited

carlossanlop commented Nov 30, 2020 •

edited