ForEach-Object -Parallel disallows script blocks via $using:, unlike Start-ThreadJob #12378

mklement0 · 2020-04-18T16:41:27Z

Start-ThreadJob allows passing a script block by way of a $using: reference, which is a convenient way to make a function from the caller's scope available to the job:

PS> function foo { 'bar' }; Start-ThreadJob { & $using:function:foo } | Receive-Job -Wait -AutoRemoveJob
bar

That is, the foo function's script block ([scriptblock] instance containing the function body) reported by $function:foo (namespace variable notation) was referenced via $using:, allowing it to be called with &.

Unexpectedly, ForEach-Object -Parallel, whose behavior is generally very similar to Start-ThreadJob, explicitly disallows referencing [scriptblock]-typed values from the caller's scope via $using:

Note:

The error message (see below) mentions "undefined behavior" - but wouldn't that apply to Start-ThreadJob as well? Is the answer to disallow script blocks in both?
- If the concern is thread safety: that problem exists for all reference types passed via $using:, not just for script blocks that may form closures with outside values from the caller's scope.
Provide option in ForEach-Object -parallel to transfer current runspace state #12240 proposes an opt-in mechanism for ForEach-Object -Parallel to allow copying the calling runspace's state to the parallel threads, which would allow for direct use of function from the caller's scope; however, a lower-overhead way of being able to call functions selectively still seems beneficial.
Loosely related: Consider deserializing serialized script blocks as such (remoting, background jobs) #11698

Steps to reproduce

function foo { 'bar' }; ForEach-Object -Parallel { & $using:function:foo } | Should -Be 'bar'

Expected behavior

The test should succeed.

Actual behavior

The test fails, because a statement-terminating error occurs:

A ForEach-Object -Parallel using variable cannot be a script block. 
Passed-in script block variables are not supported with ForEach-Object -Parallel, 
and can result in undefined behavior.

Environment data

PowerShell Core 7.1.0-preview.1

The text was updated successfully, but these errors were encountered:

bpayette · 2020-04-19T18:48:55Z

@mklement0 ScriptBlocks are bound to the runspace where they were created so you can't reliably invoke them in a different runspace without regenerating the scriptblock in the new runspace. Fortunately regenerating the script block is pretty cheap using the existing (internal) APIs. I'm guessing @PaulHigin is doing this for thread jobs but not for foreach -parallel. Paul?

BTW - another thing to check out are classes. An instance of a PowerShell class is also affinitized to the runspace where it was created, so passing that instance to another runspace won't work reliably.

PaulHigin · 2020-04-20T16:05:13Z

As @bpayette states, the concern was script block affinity to the runspace in which it was created. As @mklement0 mentions, this should also be disallowed for Start-ThreadJob. And I agree with @mklement0 that with the new upcoming optional "replicate runspace state" feature, it may make sense to allow a scriptblock variable by re-creating it in the new runspace. I'll include that as part of the feature.

Thanks!

vexx32 · 2020-04-20T16:15:45Z

Users consistently seem to come to these kinds of features wanting to be able to transfer a scriptblock to the new runspace or process and have it run. This isn't possible to do directly due to those concerns, but perhaps there's something we can do around automatically serializing/deserializing the scriptblock as it crosses the runspace boundary when the session state isn't replicated.

Even if it's an opt in via a new [SerializableScriptblock] sort of type that automatically re-hydrates (completely re-parsing if necessary, but if I recall correctly the AST can be passed around without these issues?) the scriptblock and binds itself to the new runspace it finds itself in, I think users will find that experience much more intuitive than simply preventing scriptblocks to be passed around at all.

PaulHigin · 2020-04-20T16:21:19Z

I don't think it makes sense to re-create a new scriptblock unless the runspace in which it is being created is replicated from the client script block, because it (the scriptblock) may be referencing variables, functions, modules that don't exist. But I see no reason not to do it into a replicated runspace, other than known variable mulit-threading issues.

/cc @daxian-dbw

vexx32 · 2020-04-20T16:29:54Z

It might be, sure. But just as often as I see that, I'll see someone trying to throw the same self-contained script into a handful of different threads to parallelize a task.

For a case like that, I fail to see why permitting such a thing meets so much resistance. 😕 It just means you don't have to waste your time thinking about how you're going to get the script over to that runspace and recreate it.

mklement0 · 2020-04-20T16:36:59Z

I agree with @vexx32: simply transparently recreating the script block in the target runspace - whether it is a replicated one or not - is sensible default behavior that will cover a lot of use cases, notably the case where the script block is a function body or a similarly self-contained script block, as in the example in the OP.

All that is then needed is to document the behavior and limitations, in the context of the $using: specifier in the about_Remote_Variables topic.

I also think it is the sensible default behavior in remoting / background job scenarios: see #11698

bpayette · 2020-04-21T03:31:16Z

@PaulHigin IIRC you can quickly create a new script block by using the AST of the existing script block. There is a non-public API (or constructor?) on scriptblock to do this. I measured the performance and it was quit good. This will be especially important if you're implementing a Runspce.Clone(), as you'll have to do this for every function being copied to the new runspace.

vexx32 · 2020-04-21T04:21:15Z

Not sure about scriptblock itself, but ScriptblockAst has a GetScriptBlock() method, at least. 🙂

Viajaz · 2021-10-14T05:17:31Z

Maybe a different approach but perhaps ForEach-Object -Parallel with -AsThreadJob similar to that described in #10841 ?

MicrosoftDocs/PowerShell-Docs#4977 implies that ForEach-Object -Parallel -AsJob is a ThreadJob but it's a PSTaskJob isn't it?

Docs imply something similar: https://docs.microsoft.com/en-us/powershell/module/microsoft.powershell.core/about/about_thread_jobs?view=powershell-7.1#thread-jobs-and-variables

about Thread Jobs - PowerShell
Provides information about PowerShell thread-based jobs. A thread job is a type of background job that runs a command or expression in a separate thread within the current session process.

segraef · 2022-02-17T08:37:00Z

Hi there, #16564 fixed #16445 but is it part of v7.3.0-preview.1 or 7.2.1?

PaulHigin · 2022-02-17T17:01:46Z

#16564 fix went into v7.3.0-preview branch. However, it is marked for consideration to backport to v7.2x.

kasini3000 · 2022-08-05T04:39:26Z

hi who add backport label？
from i tested，ps 7.2 ps6.2 has this problem.
but ps 7.0,ps7.1 normal

ImportTaste · 2023-08-04T19:14:23Z

For a case like that, I fail to see why permitting such a thing meets so much resistance. 😕 It just means you don't have to waste your time thinking about how you're going to get the script over to that runspace and recreate it.

Yes, very much so.

Here's a way to still do it:

function test { 'asdf' }
$foo = { 'bar' }

$pVars = (Get-Variable).Where{ -not ($_.Attributes -or $_.Description) }
$pFuncs = (Get-Command -Type Function).Where{ -not $_.Module }

1..20 | ForEach-Object -Parallel {
    $pVars = $using:pVars
    $pFuncs= $using:pFuncs

    $foo = $pVars.Where{ $_.Name -eq 'foo' }[0].Value
    $function:test = $pFuncs.Where{ $_.Name -eq 'test' }[0].Definition

    &$foo
    test
}

@PaulHigin IIRC you can quickly create a new script block by using the AST of the existing script block. There is a non-public API (or constructor?) on scriptblock to do this. I measured the performance and it was quit good. This will be especially important if you're implementing a Runspce.Clone(), as you'll have to do this for every function being copied to the new runspace.

Not sure about scriptblock itself, but ScriptblockAst has a GetScriptBlock() method, at least. 🙂

Could either of you give me an example of how to use them in a PowerShell script with ForEach-Object -Parallel?

mklement0 added the Issue-Question ideally support can be provided via other mechanisms, but sometimes folks do open an issue to get a label Apr 18, 2020

PaulHigin mentioned this issue Apr 20, 2020

Provide option in ForEach-Object -parallel to transfer current runspace state #12240

Open

mklement0 mentioned this issue May 27, 2020

ForEach-Object -Parallel situationally drops pipeline input #12801

Closed

This was referenced Dec 3, 2021

ForEach-Object -Parallel does not accept a scriptblock variable containing a $using #16445

Closed

Unable to use using keyword in a ScriptBlock instantiated via [ScriptBlock]::create in PowerShell 7.2.0 with ForEach-Object -Parallel #16551

Closed

daxian-dbw mentioned this issue Dec 10, 2021

Fix ForEach-Object -Parallel when passing in script block variable #16564

Merged

22 tasks

jborean93 mentioned this issue Aug 4, 2022

ForEach-Object Parallel Splat with ScriptBlock unable to access $using variables in pwsh 7.2.5 #17848

Closed

5 tasks

iSazonov added the Resolution-Fixed The issue is fixed. label Aug 5, 2022

iSazonov closed this as completed Aug 5, 2022

mklement0 mentioned this issue Mar 17, 2023

In background jobs, passing a [scriptblock]::Create()-created script block to a cmdlet fails #18456

Closed

5 tasks

mklement0 mentioned this issue Oct 2, 2023

In remoting calls, combining the $using: scope with function: namespace variable notation is quietly ignored or causes syntax errors, unlike in background jobs. #20422

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ForEach-Object -Parallel disallows script blocks via $using:, unlike Start-ThreadJob #12378

ForEach-Object -Parallel disallows script blocks via $using:, unlike Start-ThreadJob #12378

mklement0 commented Apr 18, 2020 •

edited

bpayette commented Apr 19, 2020

PaulHigin commented Apr 20, 2020

vexx32 commented Apr 20, 2020 •

edited

PaulHigin commented Apr 20, 2020

vexx32 commented Apr 20, 2020 •

edited

mklement0 commented Apr 20, 2020

bpayette commented Apr 21, 2020

vexx32 commented Apr 21, 2020

Viajaz commented Oct 14, 2021 •

edited by unfurl-links bot

segraef commented Feb 17, 2022

PaulHigin commented Feb 17, 2022

kasini3000 commented Aug 5, 2022

ImportTaste commented Aug 4, 2023 •

edited

ForEach-Object -Parallel disallows script blocks via $using:, unlike Start-ThreadJob #12378

ForEach-Object -Parallel disallows script blocks via $using:, unlike Start-ThreadJob #12378

Comments

mklement0 commented Apr 18, 2020 • edited

Steps to reproduce

Expected behavior

Actual behavior

Environment data

bpayette commented Apr 19, 2020

PaulHigin commented Apr 20, 2020

vexx32 commented Apr 20, 2020 • edited

PaulHigin commented Apr 20, 2020

vexx32 commented Apr 20, 2020 • edited

mklement0 commented Apr 20, 2020

bpayette commented Apr 21, 2020

vexx32 commented Apr 21, 2020

Viajaz commented Oct 14, 2021 • edited by unfurl-links bot

segraef commented Feb 17, 2022

PaulHigin commented Feb 17, 2022

kasini3000 commented Aug 5, 2022

ImportTaste commented Aug 4, 2023 • edited

mklement0 commented Apr 18, 2020 •

edited

vexx32 commented Apr 20, 2020 •

edited

vexx32 commented Apr 20, 2020 •

edited

Viajaz commented Oct 14, 2021 •

edited by unfurl-links bot

ImportTaste commented Aug 4, 2023 •

edited