Use Count in Enumerable.Any if available #40377

stephentoub · 2019-08-16T17:14:38Z

We've been hesitant to make this change in the past, as it adds several interface checks which do show up in microbenchmarks (as is evidenced below).

However, wide-spread "wisdom" is that Any() is as fast or faster than Count() > 0, and there are even FxCop rules/analyzers that warn about using the latter instead of the former, but in its current form that can frequently be incorrect: if the source does implement ICollection<T>, generally its Count is O(1) and allocation-free, whereas Any() will almost always end up allocating an enumerator.

On balance, it seems better to just have Any() map closely to Count() so that their performance can be reasoned about in parallel. I'd like a second opinion, though. @cston? @ahsonkhan? @bartonjs?

using System.Collections.Generic;
using System.Linq;
using System.Runtime.CompilerServices;
using BenchmarkDotNet.Attributes;
using BenchmarkDotNet.Running;

[MemoryDiagnoser]
public class Program
{
    public static void Main(string[] args) => BenchmarkSwitcher.FromTypes(new[] { typeof(Program) }).Run(args);

    private static IEnumerable<int> Iterator() { yield return 1; }

    public IEnumerable<object[]> Sources()
    {
        yield return new object[] { "Empty", Enumerable.Empty<int>() };
        yield return new object[] { "Range", Enumerable.Range(0, 10) };
        yield return new object[] { "List", new List<int>() { 1, 2, 3 } };
        yield return new object[] { "int[]", new int[] { 1, 2, 3 } };
        yield return new object[] { "int[].Select", new int[] { 1, 2, 3 }.Select(i => i) };
        yield return new object[] { "int[].Select.Where", new int[] { 1, 2, 3 }.Select(i => i).Where(i => i % 2 == 0) };
        yield return new object[] { "Iterator", Iterator() };
        yield return new object[] { "Iterator.Select", Iterator().Select(i => i) };
        yield return new object[] { "Iterator.Select.Where", Iterator().Select(i => i).Where(i => i % 2 == 0) };
    }

    [Benchmark]
    [ArgumentsSource(nameof(Sources))]
    public void Any(string name, object source) => Unsafe.As<IEnumerable<int>>(source).Any();
}

produces:

Method	Toolchain	name	source	Mean	Allocated
Any	New	Empty	Syste(...)nt32] [42]	6.966 ns	-
Any	Old	Empty	Syste(...)nt32] [42]	5.421 ns	-

Any	New	Iterator	Progr(...)>d__1 [22]	20.192 ns	32 B
Any	Old	Iterator	Progr(...)>d__1 [22]	13.645 ns	32 B

Any	New	Iterator.Select	Syste(...)nt32] [76]	42.764 ns	88 B
Any	Old	Iterator.Select	Syste(...)nt32] [76]	35.661 ns	88 B

Any	New	Itera(...)Where [21]	Syste(...)nt32] [62]	74.852 ns	144 B
Any	Old	Itera(...)Where [21]	Syste(...)nt32] [62]	65.916 ns	144 B

Any	New	List	Syste(...)nt32] [47]	3.979 ns	-
Any	Old	List	Syste(...)nt32] [47]	12.500 ns	40 B

Any	New	Range	Syste(...)rator [36]	7.972 ns	-
Any	Old	Range	Syste(...)rator [36]	15.880 ns	40 B

Any	New	int[]	System.Int32[]	11.606 ns	-
Any	Old	int[]	System.Int32[]	9.594 ns	32 B

Any	New	int[].Select	Syste(...)nt32] [71]	8.505 ns	-
Any	Old	int[].Select	Syste(...)nt32] [71]	19.888 ns	48 B

Any	New	int[].Select.Where	Syste(...)nt32] [62]	59.662 ns	104 B
Any	Old	int[].Select.Where	Syste(...)nt32] [62]	48.749 ns	104 B

ps @adamsitnik, I could not figure out how to get the benchmark to take an IEnumerable<int>; everything I tried resulted in errors like error CS0266: Cannot implicitly convert type 'object' to 'System.Collections.Generic.IEnumerable<int>'. This is with benchmarkdotnet 11.5.

src/System.Linq/src/System/Linq/AnyAll.cs

bartonjs

The perf losses seem low, particularly when considered long-term (zero allocation means less GC noise, so the few ns spent in the if might still be better).

Perhaps it's also better to let the compiler further optimize things for people?

public static bool Any<T>(this ICollection<T> source);
public static int Count<T>(this ICollection<T> source);
...

?

stephentoub · 2019-08-16T18:40:21Z

Perhaps it's also better to let the compiler further optimize things for people?

https://github.com/dotnet/corefx/issues/7580

stephentoub · 2019-08-16T18:40:45Z

The perf losses seem low, particularly when considered long-term (zero allocation means less GC noise, so the few ns spent in the if might still be better).

Ok, thanks for weighing in.

ahsonkhan · 2019-08-16T23:48:24Z

The perf losses seem low, particularly when considered long-term (zero allocation means less GC noise, so the few ns spent in the if might still be better).

The usages that went down to zero alloc, generally also improved in runtime perf (within the microbenchmark results shared).
The usages that got slower don't benefit from the zero allocation (the allocations are the same), so there are no long-term savings to offset that.

However, I don't know how heavily Iterators usage is, so the small regression seems fine.

Edit: I just realized Iterator is just an IEnumerable<int> and not a type, so ignore that.

On balance, it seems better to just have Any() map closely to Count() so that their performance can be reasoned about in parallel.

In that case, why not go all in and implement Any() in terms of Count(), or are the savings from not using GetCount (for IIListProvider) when it isn't "cheap", worth the custom implementation?

src/System.Linq/src/System/Linq/AnyAll.cs

stephentoub · 2019-08-16T23:57:31Z

why not go all in and implement Any() in terms of Count()

Because that makes Any O(N) instead of O(1) when none of the interface are implemented, a common case.

ahsonkhan

LGTM

ahsonkhan · 2019-08-17T00:01:25Z

Because that makes Any O(N) instead of O(1) when none of the interface are implemented, a common case.

Ah, right, missed the iterator loop at the end (one iteration for any vs n for count).

We've been hesitant to make this change in the past, as it adds several interface checks. However, wide-spread "wisdom" is that `Any()` is as fast or faster than `Count() > 0`, and there are even FxCop rules/analyzers that warn about using the latter instead of the former, but in its current form that can frequently be incorrect: if the source does implement `ICollection<T>`, generally its `Count` is O(1) and allocation-free, whereas `Any()` will almost always end up allocating an enumerator. On balance, it seems better to just have `Any()` map closely to `Count()` so that their performance can be reasoned about in parallel.

adamsitnik · 2019-08-20T13:35:54Z

I could not figure out how to get the benchmark to take an IEnumerable<int>

I've fixed that in dotnet/BenchmarkDotNet#1228

stephentoub · 2019-08-20T13:37:09Z

I've fixed that

Thanks, @adamsitnik.

* Use Count in Enumerable.Any if available We've been hesitant to make this change in the past, as it adds several interface checks. However, wide-spread "wisdom" is that `Any()` is as fast or faster than `Count() > 0`, and there are even FxCop rules/analyzers that warn about using the latter instead of the former, but in its current form that can frequently be incorrect: if the source does implement `ICollection<T>`, generally its `Count` is O(1) and allocation-free, whereas `Any()` will almost always end up allocating an enumerator. On balance, it seems better to just have `Any()` map closely to `Count()` so that their performance can be reasoned about in parallel. * Add test coverage for Enumerable.Any Commit migrated from dotnet/corefx@9021bc1

Dotnet-GitSync-Bot added the area-System.Linq label Aug 16, 2019

cston approved these changes Aug 16, 2019

View reviewed changes

bartonjs reviewed Aug 16, 2019

View reviewed changes

src/System.Linq/src/System/Linq/AnyAll.cs Show resolved Hide resolved

bartonjs reviewed Aug 16, 2019

View reviewed changes

src/System.Linq/src/System/Linq/AnyAll.cs Show resolved Hide resolved

bartonjs approved these changes Aug 16, 2019

View reviewed changes

ahsonkhan reviewed Aug 16, 2019

View reviewed changes

src/System.Linq/src/System/Linq/AnyAll.cs Show resolved Hide resolved

ahsonkhan approved these changes Aug 16, 2019

View reviewed changes

stephentoub added 2 commits August 16, 2019 21:48

Add test coverage for Enumerable.Any

f47fce3

stephentoub force-pushed the anyinterfaces branch from 76097ee to f47fce3 Compare August 17, 2019 01:48

stephentoub merged commit 9021bc1 into dotnet:master Aug 17, 2019

stephentoub deleted the anyinterfaces branch August 17, 2019 23:49

adamsitnik mentioned this pull request Aug 20, 2019

Support IEnumerable as benchmark argument dotnet/BenchmarkDotNet#1228

Merged

stephentoub mentioned this pull request Aug 20, 2019

Fix CA1827 (DoNotUseCountWhenAnyCanBeUsed) violations in Roslyn.sln dotnet/roslyn#37959

Open

paulomorgado mentioned this pull request Aug 21, 2019

Exclude Enumerable.Count<TSource>(IEnumerable<TSource>) dotnet/roslyn-analyzers#2774

Closed

karelz added this to the 5.0 milestone Dec 19, 2019

This was referenced Feb 11, 2021

Detect unintended creation of an enumerator OctopusDeploy/RoslynAnalyzers#1

Merged

Added better Any() method and moved some extensions from Shared OctopusDeploy/CoreUtilities#5

Merged

MarcinZiabek mentioned this pull request Jul 30, 2022

Remove some casts on hot path by using .Count == 0 instead of .Any() QuestPDF/QuestPDF#221

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Count in Enumerable.Any if available #40377

Use Count in Enumerable.Any if available #40377

stephentoub commented Aug 16, 2019

bartonjs left a comment

stephentoub commented Aug 16, 2019

stephentoub commented Aug 16, 2019

ahsonkhan commented Aug 16, 2019 •

edited

Loading

stephentoub commented Aug 16, 2019

ahsonkhan left a comment

ahsonkhan commented Aug 17, 2019 •

edited

Loading

adamsitnik commented Aug 20, 2019

stephentoub commented Aug 20, 2019

Use Count in Enumerable.Any if available #40377

Use Count in Enumerable.Any if available #40377

Conversation

stephentoub commented Aug 16, 2019

bartonjs left a comment

Choose a reason for hiding this comment

stephentoub commented Aug 16, 2019

stephentoub commented Aug 16, 2019

ahsonkhan commented Aug 16, 2019 • edited Loading

stephentoub commented Aug 16, 2019

ahsonkhan left a comment

Choose a reason for hiding this comment

ahsonkhan commented Aug 17, 2019 • edited Loading

adamsitnik commented Aug 20, 2019

stephentoub commented Aug 20, 2019

ahsonkhan commented Aug 16, 2019 •

edited

Loading

ahsonkhan commented Aug 17, 2019 •

edited

Loading