Shell substitutions with parentheses fail #118

arowla · 2013-12-24T19:00:41Z

The example below fails due to the closing parenthesis in the command, as the parser will consider the first closing parenthesis it finds to be the end of the shell substitution statement.

DATE := $(python -c "import datetime; print datetime.datetime.now()")

The text was updated successfully, but these errors were encountered:

myronahn · 2014-01-19T15:16:07Z

@dirtyvagabond @aboytsov @egamble I was working on this bug a bit and came across a big problem in the parser.

The monads in the parser are backtracking monads so they are not supposed to have side effects, however the monad that handles any shell substitution (command-sub) actually shells out right in the monad. When I was testing a fix to this bug, I noticed that it was shelling out twice because of the backtracking.

Unfortunately, being able to do command-line substitution right in the parser is critical for defining variables and in-line shell commands.

So this bug revealed a pretty deep problem. This may be the impetus we need to separate the lexer from the parser.

dirtyvagabond · 2014-01-21T19:27:07Z

@myronahn oh man, this sounds like a nice find. so... are you volunteering to do some hard core lexer/parser development?

aboytsov · 2014-01-21T19:53:20Z

I agree that a proper parser/lexer separation is the right way to go about fixing it. But in the meantime, we can easily make sure every shell substitution is executed once by maintaining a global hashmap that stores the result of each run (and caches it for backtracking). Thoughts?

egamble · 2014-01-21T19:55:53Z

I use memoization with fnparse in the parser for my language Timeless. It
works pretty well:
https://github.com/egamble/timeless/blob/master/src/clj/timeless/bootstrap/parser.clj

On Tue, Jan 21, 2014 at 11:53 AM, Artem Boytsov notifications@github.comwrote:

I agree that a proper parser/lexer separation is the right way to go about
fixing it. But in the meantime, we can easily make sure every shell
substitution is executed once by maintaining a global hashmap that stores
the result of each run (and caches it for backtracking). Thoughts?

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/118#issuecomment-32956284
.

myronahn · 2014-01-22T07:11:33Z

Good idea - hash map/memoization sounds good except for a couple of things:

I'm not sure if there is a chance that backtracking will cause the parser to choose an entirely different path, that is, it might try the shell expansion and, after backtracking, decide that it is actually not a shell expansion after all. I wouldn't be surprised if this were a possibility.
Backtracking also might cause the shell expansion to parse differently, or might cause a different portion of the text to be parsed as the shell expansion.

However, these are probably very rare cases that can be ignored for now until the full parser/lexer separation. I might be able to detect when this happens and error out - but chances are, it would error out anyway.

We'll probably have to memoize the line number (or something) with the shell command as well since it could be that the user actually wants to run the same shell command multiple times in different parts of the workflow.

I'll play around with this to see if I can come up with a quick fix.

Also memoize the calls to shell (along with file line/column) Clear memoize map when new workflow is run

dirtyvagabond · 2014-05-21T22:40:43Z

@myronahn is this close-able?

myronahn · 2014-05-26T17:03:01Z

@dirtyvagabond Well the immediate issue is hack-fixed but I wouldn't call it a long-term fix. The deeper issue I brought up is still an issue.

dirtyvagabond · 2014-05-27T00:36:44Z

@amalloy may find this interesting, we recently talked about the parser/lexer separation considerations

amalloy · 2014-05-27T17:59:28Z

@myronahn I'm curious what sort of input causes the parser to backtrack enough to execute a shell expansion twice. I don't see any rules that would obviously cause this sort of issue.

amalloy · 2014-05-29T01:05:01Z

Anyway, we should be able to parse these just as well as bash does, and support ' and " within the command list. Supporting nested shell escapes might be a bit ambitious, but is probably not impossible either?

myronahn · 2014-05-29T18:47:58Z

@amalloy It's been a while since I've worked on this, but if I'm not mistaken, it was the original test case at the beginning of this bug that caused the backtrack (and I believe I put it in the regression test suite as well).

Addresses #118.

amalloy · 2014-07-17T01:23:42Z

We've been able to handle shell expansions containing ) for a while; I've added a test confirming this. Avoiding double-expansion of ambiguous shell commands is still an open question, but I don't think it's part of issue #118, so we should close this one.

myronahn pushed a commit that referenced this issue Jan 22, 2014

Initial fix for #118 to handle quotes better in shell commands

7eaf49e

Also memoize the calls to shell (along with file line/column) Clear memoize map when new workflow is run

myronahn pushed a commit that referenced this issue Jan 22, 2014

For #118 removed some debug statements

da59d7f

amalloy added a commit that referenced this issue Jul 17, 2014

Add a test confirming that you can use ) in shell expansions.

7daa8be

Addresses #118.

amalloy closed this as completed Jul 17, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shell substitutions with parentheses fail #118

Shell substitutions with parentheses fail #118

arowla commented Dec 24, 2013

myronahn commented Jan 19, 2014

dirtyvagabond commented Jan 21, 2014

aboytsov commented Jan 21, 2014

egamble commented Jan 21, 2014

myronahn commented Jan 22, 2014

dirtyvagabond commented May 21, 2014

myronahn commented May 26, 2014

dirtyvagabond commented May 27, 2014

amalloy commented May 27, 2014

amalloy commented May 29, 2014

myronahn commented May 29, 2014

amalloy commented Jul 17, 2014

Shell substitutions with parentheses fail #118

Shell substitutions with parentheses fail #118

Comments

arowla commented Dec 24, 2013

myronahn commented Jan 19, 2014

dirtyvagabond commented Jan 21, 2014

aboytsov commented Jan 21, 2014

egamble commented Jan 21, 2014

myronahn commented Jan 22, 2014

dirtyvagabond commented May 21, 2014

myronahn commented May 26, 2014

dirtyvagabond commented May 27, 2014

amalloy commented May 27, 2014

amalloy commented May 29, 2014

myronahn commented May 29, 2014

amalloy commented Jul 17, 2014