Parser refactor #2121

dgnorton · 2015-03-30T21:49:13Z

The big change in this PR is how identifiers are scanned / parsed. The scanner used to treat "db"."rp".cpu as one identifer. It would be stored in a single string in the AST somewhere. The scanner API exposed SplitIdent and it was up to clients consuming the AST to split identifiers into segments, strip quotes from the segments, and figure out what each segment was (db?, rp?, measurement?). This PR simplifies things for client code by splitting identifiers and stripping quotes at the earliest stage (scanning) and changing the parser to store them in appropriate fields (db, rp, etc.) in the AST.

Change Scanner.scanIdent() to scan "db"."rp".cpu as separate tokens: IDENT, DOT, IDENT, DOT, IDENT instead of scanning it as a single IDENT
Add Parser.parseSegmentedIdents() to parse segmented (DOT separated) identifiers like "db"."rp".cpu and return them as an []string
Change AST types that held segmented identifiers in a string to have individual fields. Note that type VarRef still, for now, holds a single string. This may need to be changed.
Add type HasDefaultDatabase interface to ast.go. Types like type CreateContinuousQueryStatement that have a default database should implement this interface.
Change parsing functions that work with segmented identifiers to use the new parseSegmentedIdents() function and store segments in the appropriate AST fields.
Change QuoteIdent function to only quote segments that must be quoted.
Add IdentNeedsQuotes func that returns true if an identifier would require quotes
Change Server.ExecuteQuery to check for a default database passed by the caller. If none was provided, see if the statement implements the HasDefaultDB interface and get the default database from there. This fixes a bug in statement normalization.
Remove uses of influxql.SplitIdent function outside of the parser
Change Server.expandSources to always return expanded sources in sorted order
Change ErrDatabaseNotFound and ErrRetentionPolicyNotFound to be functions that report the name of the "not found" object and the error locust

otoolep · 2015-03-31T18:07:40Z

cmd/influxd/server_integration_test.go

@@ -177,7 +178,7 @@ func write(t *testing.T, node *Node, data string) {

 // query executes the given query against all nodes in the cluster, and verifies no errors occured, and
 // ensures the returned data is as expected
-func query(t *testing.T, nodes Cluster, urlDb, query, expected string) (string, bool) {
+func query(t *testing.T, nodes Cluster, urlDb, query, expected, expectPattern string) (string, bool) {


How about making this a vargs? Then all the callers won't need to be changed.

Perhaps we should have two versions of this function query and queryRegex. You could factor out the common code, but two functions might be cleaner.

Would we also need two versions of queryAndWait then? I wasn't crazy about adding the extra parameter but it seemed like the simplest solution. If we factor into two funcs, every caller has to do the if expected != "" check. No strong preference here...just don't want to add complication.

Yeah, wasn't sure about this. We can come back it.

otoolep · 2015-03-31T18:36:48Z

Took a first pass, will re-review again soon. I like the introduction of regex matching into the test suite, very handy. I'm not keen on the new HasDefaultDB interface though, it feels a bit awkward, though I am not in the code so it may be required. I wonder if there is another way, perhaps every statement just have this call, period.

otoolep · 2015-04-08T19:56:38Z

cmd/influxd/server_integration_test.go

+		queryDb       string  // If set, is used as the "db" query param.
+		queryOne      bool    // If set, only 1 node is queried.
+		expected      string  // If 'query' is equal to the blank string, this is ignored.
+		expectPattern string  // Regexp alternative to expected field.


Minor: document that this is ignored if expected is non-blank.

otoolep · 2015-04-08T20:23:12Z

OK, I took a look at this. It's obviously a very large change but most of it is adding extra variables to functions for example. The tests haven't changed that much, so the green build gives me confidence too.

otoolep · 2015-04-08T20:24:06Z

influxql/parser.go

+
+	vr := &VarRef{Val: strings.Join(segments, ".")}
+
+	if len(segments) > 2 {


Just to be clear, if we have more than 2 segments here, it's a problem? Normally we can have up to 3 segments. Which segment is not expected to be present?

Good catch. That's probably a mistake...not sure why I did that. Will review.

OK, cool. Might indicate that a unit test is needed to check a specific code path.

Parser refactor

dgnorton added the 2 - Working label Mar 30, 2015

otoolep reviewed Mar 31, 2015
View reviewed changes

dgnorton force-pushed the fix-2015 branch 2 times, most recently from 3a8bebc to 8e59277 Compare April 8, 2015 16:39

dgnorton changed the title ~~(WIP) Parser refactor~~ Parser refactor Apr 8, 2015

otoolep reviewed Apr 8, 2015
View reviewed changes

dgnorton force-pushed the fix-2015 branch 2 times, most recently from a52bb5e to 41b4a0f Compare April 9, 2015 17:20

refactor scanning & parsing of identifiers

25cea58

dgnorton force-pushed the fix-2015 branch from 41b4a0f to 25cea58 Compare April 9, 2015 17:21

toddboom added a commit that referenced this pull request Apr 9, 2015

Merge pull request #2121 from influxdb/fix-2015

ff15388

Parser refactor

toddboom merged commit ff15388 into master Apr 9, 2015

toddboom removed the 2 - Working label Apr 9, 2015

toddboom deleted the fix-2015 branch April 9, 2015 17:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parser refactor #2121

Parser refactor #2121

dgnorton commented Mar 30, 2015

otoolep Mar 31, 2015

otoolep Mar 31, 2015

dgnorton Mar 31, 2015

otoolep Mar 31, 2015

otoolep commented Mar 31, 2015

otoolep Apr 8, 2015

otoolep commented Apr 8, 2015

otoolep Apr 8, 2015

dgnorton Apr 8, 2015

otoolep Apr 8, 2015


		vr := &VarRef{Val: strings.Join(segments, ".")}

		if len(segments) > 2 {

Parser refactor #2121

Parser refactor #2121

Conversation

dgnorton commented Mar 30, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

otoolep commented Mar 31, 2015

Choose a reason for hiding this comment

otoolep commented Apr 8, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment