Minishell

The minishell project requires students to build their own simple shell (using bash as reference) following the established guidelines and requirements. It is the first project of the 42 cursus that is developed with another student. To make this project happen, Manuel and I paired up and put our skills to the test.

The bulk of the work was divided as follows:

Miguel:

Parsing, Expansions, Redirections, Execution and Signal Handling

Manuel:

Built-ins, Environment Variables and Signal Handling

This Readme is still in development - everything here is subject to change so take what you see here 'with a grain of salt'.

Index

I. Usage

II.Features

III. Structure

Usage

In development

Features

In development

Built-in Functions

In development

Structure

INPUT → LEXER → SYNTAX CHECKER → PARSER → EXPANDER → REDIRECTIONS → EXECUTOR

In this README, we'll look into the following example to illustrate how the input is handled throughout our minishell.

<infile cat|grep "else if" | wc -l >>$USER$NOEXIST.txt

Lexer

The lexer's function is to tokenize the received input. It does this by separating the operators from any words — adding a space before and after each operator ( | , < , >, <<, >>) . Afterwards, it procceds to split each word into tokens, respecting single ( ' ) and double ( " ) quotes.

For easier understanding in the following segments, we'll classify these tokens as pipe, redirection and word tokens depending on their content.

Syntax Checker

After the input has been tokenized, it needs to be checked in order to validate that it can be interpreted by minishell. In order for an input to be considered valid, it must follow these rules:

The input cannot start with a pipe token;
After any redirection token, there must be a word token;
Pipe tokens in a row are not allowed;
The input cannot end with a pipe or redirection token;

If these rules are followed, the tokens are sent to the parser. Otherwise, an appropriate error message is displayed, and the program exits.

Parser

The parser assigns the tokens provided by the lexer (and checked by the syntax checker) to their respective place in a struct called data so, later on, it can be interpreted by the executor. These structs form a linked list, with each node of the list representing a command.

    struct data
    {
    	char		**args;		// array of strings for the command and arguments
    	int		fd[2];		// file descriptors of a pipe (assigned later)
    	char		**infile;	// array of strings for each file to be read from, stored in order
    	char		**inflag;	// array using "0" and "1" as flags depending of the type of infile
    	char		**outfile;	// same as infile but for files to write to
    	char		**outflag;	// same as inflag but for the outfiles
    	int		fd_in;		// for the fd of an infile to be stored
    	int		fd_out;		// for the fd of an outfile to be stored
    	int		pid;		// for the pid of the fork to be stored
    	struct s_data	*next;		// pointer to the next node of the list
    	struct s_data	*prev;		// pointer to the previous node of the list
    }

Tokens are assigned following these rules:

if an operator token is found and it is a redirection (<, >, << or >>), the following token is added to the corresponding infile or outfile array and a "flag string" is added to the inflag or outflag array — 0 if it's a simple redirection, 1 if it's n here-doc or append.
if a word token is found, it is added to the args array — where the first string of that array will be the command name and all others will be the command arguments.
if an operator token is found and it is a pipe ( | ) a new node is created, and the next and prev pointers are set accordingly.

data1							data2							data3
{							{							{
	char		**args = [cat]				char		**args = [grep, "else if"]		char		**args = [wc, -l]
	int		fd[2] = [0 , 1]				int		fd[2] = [0 , 1]				int		fd[2] = [0 , 1]
	char		**infile = [infile]			char		**infile = NULL				char		**infile = NULL
	char		**inflag = [0]				char		**inflag = NULL				char		**inflag = NULL
	char		**outfile = NULL			char		**outfile = NULL			char		**outfile = [$USER$NOEXIST.txt]
	char		**outflag = NULL			char		**outflag = [1]				char		**outflag = [1]
	int		fd_in = 0				int		fd_in = 0				int		fd_in = 0
	int		fd_out = 1				int		fd_out = 1				int		fd_out = 1
	int		pid = 0					int		pid = 0					int		pid = 0
	struct s_data	*next = &data2				struct s_data	*next = &data3				struct s_data	*next = NULL
	struct s_data	*prev = NULL				struct s_data	*prev = &data1				struct s_data	*prev = &data2
}							}							}

Expander

After the linked list is constructed, each string in the args array of each node, as well as in the infile and outfile arrays, is checked for single or double quotations and for expandable arguments — arguments starting with a $ that are not in between single quotes.

Any quoted string has its quotes removed, and if an expandable argument is found, it's checked against the environment variables. If it's found, the expandable argument is replaced with the value of the variable. If it isn't found, then it's removed (from the string if it's in one, or from the array if the expandable argument is the string).

data1							data2							data3
{							{							{
	char		**args = [cat]				char		**args = [grep, "else if"]		char		**args = [wc, -l]
	int		fd[2] = [0 , 1]				int		fd[2] = [0 , 1]				int		fd[2] = [0 , 1]
	char		**infile = [infile]			char		**infile = NULL				char		**infile = NULL
	char		**inflag = [0]				char		**inflag = NULL				char		**inflag = NULL
	char		**outfile = NULL			char		**outfile = NULL			char		**outfile = [mbraga-s.txt]
	char		**outflag = NULL			char		**outflag = [1]				char		**outflag = [1]
	int		fd_in = 0				int		fd_in = 0				int		fd_in = 0
	int		fd_out = 1				int		fd_out = 1				int		fd_out = 1
	int		pid = 0					int		pid = 0					int		pid = 0
	struct s_data	*next = &data2				struct s_data	*next = &data3				struct s_data	*next = NULL
	struct s_data	*prev = NULL				struct s_data	*prev = &data1				struct s_data	*prev = &data2
}							}							}

Executor

In development

Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
libft		libft
srcs		srcs
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
main.c		main.c
minishell.h		minishell.h
readline.supp		readline.supp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Minishell

Index

Usage

Features

Built-in Functions

Structure

Lexer

Syntax Checker

Parser

Expander

Executor

About

Releases

Packages

Contributors 2

Languages

mbraga-s/minishell

Folders and files

Latest commit

History

Repository files navigation

Minishell

Index

Usage

Features

Built-in Functions

Structure

Lexer

Syntax Checker

Parser

Expander

Executor

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages