Interpreter in Go

This project reflects the study of the book "Interpreter in Go" writen by Thorsten Ball.

Language used - Monkey

"Without a compiler or an interpreter a programming language is nothing more than an idea or a specification."

Monkey has the following list of features:

C-like syntax
variable bindings
integers and booleans
arithmetic expressions
built-in functions
first-class and higher-order functions
closures
a string data structure
an array data structure
a hash data structure

Some examples of this language usage:

//some variables
let age = 1;
let name = "Monkey";
let result = 10 * (20 / 2);

//hash-map
let thorsten = {"name": "Thorsten", "age": 28};

Or something more complex

let twice = fn(f, x) {
  return f(f(x));
};
let addTwo = fn(x) {
  return x + 2;
};
twice(addTwo, 2); // => 6

Parts of an interpreter:

the lexer
the parser
the Abstract Syntax Tree (AST)
the internal object system
the evaluator

The Lexer

let five = 5;
let ten = 10;
let add = fn(x, y) {
  x + y;
};
let result = add(five, ten);

From the example above we can identify tokens like an integer, a keyword or even variable names. We'll distinguish from types, keywords and identifiers (variable and function names) among others. We can specify these in our code by using constants. "ILLEGAL" will denote something we are not expecting and "EOF" will mark the end of our reading process.

The lexer will ignore spaces since Monkey language does not care for them. Also _ are supported as part of variable names.

The Parser

Basically, a parser turns its input into a data structure that represents the input and checks its correctness in the process. This component is responsible for parser errors.

As an example:

var input = 'if (3 * 5 > 10) { return "hello"; } else { return "goodbye"; }';
var tokens = MagicLexer.parse(input);
MagicParser.parse(tokens);

{
  type: "if-statement",
  condition: {
    type: "operator-expression",
    operator: ">",
    left: {
      type: "operator-expression",
      operator: "*",
      left: { type: "integer-literal", value: 3 },
      right: { type: "integer-literal", value: 5 }
    },
    right: { type: "integer-literal", value: 10 }
  },
  consequence: {
    type: "return-statement",
    returnValue: { type: "string-literal", value: "hello" }
  },
  alternative: {
    type: "return-statement",
    returnValue: { type: "string-literal", value: "goodbye" }
  }
}

Abstract Syntax Tree

For this project this data structure will be basically just composed by nodes connected to each other. This structure will represent the program running. Represents the syntax almost exactly and omits irrelevant details like whitespace or so (because these dont matter in our language). The implementation can be seen in the package ast.

(image from 'Writing an interpreter in Go')

The image above is the representation of the expression ((1 + 2) + 3).

Parsing expressions

Parsing statements is fairly straightforward. Reading from left to right and identify which keyword and from there parse the rest of the statement. Parsing expressions is a bit more complicated. One of the problems building a parser is the operator precedence. The fact that expressions can be found in many different situations is also a problem that we need to take care of by applying a a correct parsing procedure that is understandable and extensible since the beginning.

Parser structure insight:

The following struct represents the concept of a parser inside this interpreter.

type Parser struct {
	lxr    *lexer.Lexer
	errors []string

	curToken  token.Token
	peekToken token.Token

   [...]
}

The presence of a a lexer is understandable for obvious reasons. Perhaphs both fields of the type Token is not. This fields allow our parser to act like an iterator. The first points to the current token being looked at and the second allow us to make decisions based on what comes next.

(image from 'Writing an interpreter in Go')

Not only but also because of this feature we can understand what kind of statements are we reading and parse it the best way possible.

Parsing an expression routine:

We can see above an high level representation of the flow for parsing an expression, the most complicated type of parsing in our interpreter. Because of recursivity we don't need many lines for each parsing function.

All the registration of each parsing routine is made in the parser#registerparsingFns function. This way allows a more general routine parse#parseExpression to index into the map of parsing functions with the given token type and retrieve the appropriate parsing function.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
ast		ast
lexer		lexer
parser		parser
repl		repl
token		token
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go
makefile		makefile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ast

ast

lexer

lexer

parser

parser

repl

repl

token

token

README.md

README.md

go.mod

go.mod

go.sum

go.sum

main.go

main.go

makefile

makefile

Repository files navigation

Interpreter in Go

Language used - Monkey

Parts of an interpreter:

The Lexer

The Parser

Abstract Syntax Tree

Parsing expressions

Parser structure insight:

Parsing an expression routine:

About

Languages

PeteGabriel/interpreter_in_go

Folders and files

Latest commit

History

Repository files navigation

Interpreter in Go

Language used - Monkey

Parts of an interpreter:

The Lexer

The Parser

Abstract Syntax Tree

Parsing expressions

Parser structure insight:

Parsing an expression routine:

About

Topics

Resources

Stars

Watchers

Forks

Languages