KED Lexer Documentation

Introduction

The KED lexer is a lexical analyzer for the KED domain-specific language (DSL), which utilizes Cork slang as syntax. It is implemented in Golang and utilizes a test-driven development (TDD) approach to ensure correctness and reliability.

Lexer Architecture

The KED lexer is composed of the following components:

Tokenization: The lexer breaks the input text into individual tokens, each representing a meaningful element of the language, such as identifiers, keywords, operators, and punctuation.

State Machine: A finite state machine guides the tokenization process, transitioning between states based on the input characters. Each state corresponds to a specific token type.

State Transition Table: The state transition table defines the transitions between states, allowing the lexer to identify the correct token type for each sequence of input characters.

Tokenization Process

The KED lexer tokenizes the input text according to the following steps:

Initial State: The lexer starts in the initial state, ready to process the input characters.
Character Examination: For each input character:
- Character Mapping: Check if the character matches any of the patterns associated with a token type.
- State Transition: If a match is found, transition to the corresponding state. If no match is found, remain in the current state.
- Token Generation: If the state is a terminal state, indicating the end of a token, generate a token object with the corresponding type and lexeme (the string of characters comprising the token).
Output: Collect and return all generated tokens.

TDD Approach

A test-driven development (TDD) approach was employed to ensure the correctness and reliability of the lexer. This involved the following steps:

Write a failing test: Define a test case that describes the expected behavior of the lexer for a specific input sequence.
Write minimum code: Implement the minimum amount of code necessary to pass the test case.
Refactor code: Improve the readability and maintainability of the code without changing its functionality.
Refactor tests: Ensure that the tests remain valid and effective after each refactoring step.

GitHub Actions Workflow Pipeline

A robust workflow pipeline was developed using GitHub Actions to automatically check the functionality of the lexer before each commit. This pipeline consists of the following steps:

Linting: Lint the Golang source code to identify potential syntax errors and stylistic issues. # TODO
Unit Testing: Execute all unit tests to ensure that the lexer correctly identifies and categorizes tokens.
Code Coverage: Analyze the code coverage to measure the percentage of code that is being tested by the unit tests. #TODO
Static Analysis: Perform static analysis to identify potential security vulnerabilities and code quality issues. #TODO

This workflow pipeline helps to maintain code quality, prevent regressions, and ensure that the lexer meets the required standards.

Conclusion

The KED lexer is a robust and reliable lexical analyzer for the KED DSL, developed using a test-driven development approach and a comprehensive workflow pipeline. It plays a crucial role in the language processing pipeline, enabling the interpretation and manipulation of KED code.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.vscode		.vscode
Lexer		Lexer
Parser		Parser
Semantic_Checker		Semantic_Checker
.gitignore		.gitignore
README.md		README.md
go.mod		go.mod
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KED Lexer Documentation

Introduction

Lexer Architecture

Tokenization Process

TDD Approach

GitHub Actions Workflow Pipeline

Conclusion

About

Releases

Packages

Languages

SequeI/Ked

Folders and files

Latest commit

History

Repository files navigation

KED Lexer Documentation

Introduction

Lexer Architecture

Tokenization Process

TDD Approach

GitHub Actions Workflow Pipeline

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages