Distributed-Assignment-2

Members

Utsav Basu - 20CS30057
Anamitra Mukhopadhyay - 20CS30064
Ritwik Ranjan Mallik - 20CS10049
Anand Manojkumar Parikh - 20CS10007

How to run

Before starting, permissions for docker.sock have to be updated.

sudo chmod 777 <location of docker.sock> # usually the location is /var/run/docker.sock

There is a Makefile in the ./src directory. To start everything:

make

To stop everything:

make stop

Analysis

For A1-A3, the read times and write times are checked on two different machines. The payloads can be found inside read_write_testing.

A1

	Read Time (in s)	Write Time (in s)
Machine 1	366.13	548.40
Machine 2	196.46	576.48

A2

	Read Time (in s)	Write Time (in s)
Machine 1	368.24	1107.88
Machine 2	200.48	1140.94

A3

	Read Time (in s)	Write Time (in s)
Machine 1	373.65	1021.17
Machine 2	188.34	1124.97

A4

The prerequisites and sample request-reponse pairs for the endpoints can be found inside endpoint_testing.

Data Generation

In the ./src/client/data directory, there are two files fnames.txt and lnames.txt. fnames.txt contains 942 unique first names and lnames.txt contains 995 unique last names. Using these names, generate.py creates 942 * 995 = 937290 data entries with marks generated unformly at random in the range from 1 to 100.

Main Libraries Used

Quart

There are two major specifications for interfacing web applications with web servers: WSGI (Web Server Gateway Interface) and ASGI (Asynchronous Server Gateway Interface). WSGI is a synchronous interface, meaning that it handles one request at a time per process or thread. On the other hand, ASGI supports handling multiple requests concurrently without blocking. Since the load balancer should be able to handle as many as 10,000 concurrent requests, an web application based on WSGI such as Flask is not suitable. Rather, Quart, a framework built on top of Flask supporting ASGI servers is a better choice.

asyncio

In python, asyncio is a library to write concurrent code using async/await syntax. Many asynchronous python frameworks that provide high-performance network and web-servers, database connection libraries, distributed task queues etc. are built on top of asyncio.

aiohttp

The aiohttp library is used for bulilding asynchronous HTTP client/server for asyncio and Python. It supports both Server WebSockets and Client WebSockets.

aiodocker

The aiodocker library is a simple docker HTTP API wrapper written with asyncio and aiohttp. Rather than making system calls inside the load balancer to perform docker related tasking like adding or removing server containers, aiodocker exposes functions with loads of flexibilities to perform these tasks.

fifolock

This library is used for controlling concurrent read/writes to two important data structures used in the application. One of these data structures is used for mapping requests to virtual servers by consistent hasing. The other maintains the count of failed heartbeats for the servers.

asyncpg

Asyncpg is a high-performance, asynchronous PostgreSQL database client library for Python. It is designed to provide efficient and low-latency communication with a PostgreSQL database by leveraging the power of Python's asyncio framework.

Design Choices

Different design choices have been made throughout the application, keeping in mind efficiency, functionality and understandability. Some of these are given below:

Threading vs Asyncio

Both threading and asyncio libraries in Python support concurrent programming. However, choice was made to use the asyncio library due to the following reasons:

Due to the Global Interpreter Lock, only one thread can execute Python code at once. This means that for most Python 3 implementations, the different threads do not actually execute at the same time, they merely appear to.
Threads are used to perform preemptive multi-tasking where the scheduler decides which threads to run and when. On the other hand, asyncio model allows cooperative multi-tasking where the user decides which tasks to run and when.

Cooperative Multitasking

The application runs several taks asynchronously. These are:

spawn_container: This task is used to spawn the server containers. The load balancer maintains a total of N servers at a time. If some server goes down, the load balancer detects it and spawns a new server.
remove_container: This task is used to remove the spawned server containers. The load balancer sends a signal SIGINT to the running container to stop it. If the container does not stop within STOP_TIMEOUT, it sends a SIGKILL signal to remove the container altogether.
get_heartbeat: This task is used to periodically get the heartbeats of the server containers. Whenever a new server container is added, an entry is initialized as 0. This maintains the number of failed heartbeats for that container. The load balancer checks the heartbeats at an interval of HEARTBEAT_INTERVAL seconds. Upon not receiving the heartbeat for a server, the corresponding entry is increased. When this entry reaches MAX_FAIL_COUNT, the server is assumed to be down and is restarted. This task is run as background task using Quart.
handle_flatline: This task is used to handle the flatline of the servers. When a server becomes unresponsive and is assumed to be down, this task respawns a new server. This new server replaces the failed server.

A cooperation takes place whenever there are multiple tasks to perform simultaneously. For docker related tasks such as spawning and removing a container, the set of tasks are added to to a pool and processed in batches of size DOCKER_TASK_BATCH_SIZE. This is done using a semaphore initialized to DOCKER_TASK_BATCH_SIZE. For http requests, similarly, the set of requests are processed in batches of size REQUEST_BATCH_SIZE. This is done using a semaphore initialized to REQUEST_BATCH_SIZE. Cooperation is ensured by calling await asyncio.sleep(0) inside a task in appropiate places, which gives other tasks a chance to run before it itself starts executing.

Dockerfile

Some design choices have been made during dockerizing the application. These are as follows:

The default python image is based on the Ubuntu image which installs some unnecessary packages not needed for our application. Instead, the python image based on the Alpine image is used. This image is much lighter than the earlier one.
Multistage build have been used during dockerization. This is done to reduce the image sizes for the load balancer and the server as much as possible.
Both the load balancer and server containers are deployed via a shell script deploy.sh. postgres is first run as a background task and the main process waits until it starts, using a variable pg_isready. When postgres is up and running, the main process runs the actual python file as a background task - load_balancer.py for load balancer and server.py for servers. Signal handlers are modified such that any SIGTERM or SIGINT signal to the main process is sent to the two child processes as well. The main process then waits for the two child processes.

The Cat-Dat-Vat Algorithm

Preliminaries

For ensuring distributed database consistency, the Cat-Dat-Vat algorithm is used. The name is derived from three different indices used to ensure consistency - created_at (cat), deleted_at (dat) and valid_at (vat). A server maintains two tables:

StudT(stud_id: INTEGER, stud_name: TEXT, stud_marks: INTEGER, shard_id: TEXT, created_at: INTEGER, deleted_at: INTEGER)
TermT(shard_id: TEXT, term: INTEGER)

The load balancer maintains a single table:

ShardT(stud_id_low: INTEGER, shard_id: TEXT, shard_size: INTEGER, valid_at: INTEGER)

For every shard, both the servers and the load balancer maintains a notion of time - term for servers and valid_at for the load balancer. Instead of running a distributed leader election algorithm, the load balancer is regarded as the leader and its valid_at value for a shard is assumed to be always correct. When a shard is created for the first time, it does not contain any rows and these values (terms and valid_at) are initialized with 0. Inside a shard, a row maintains created_at which denotes when the row was created and deleted_at which denotes when it was or is to be deleted. The operations sent by the load balancer to the servers is either of or can be broken down into the following types of elementary operations:

Create: C(shard_id, stud_id, vat)
Read: R(shard_id, stud_id, vat)
Update: U(shard_id, stud_id, vat)
Delete: D(shard_id, stud_id, vat)

The general operation can therefore be written as O(shard_id, stud_id, vat) where O can be C, R, U or D. In the description of the algorithm, lines starting with LB denotes that it is executed by the load balancer whereas those starting with S denotes that they are executed by the server. The return statement is essentially the statement send response to client. The actual requests and responses may contain additional payloads not relevant to the Cat-Dat-Vat algorithm. These payloads are therefore not mentioned in the description of the algorithm. In load balancer or servers, any set of database accesses are all performed within a single transaction, such that when failures occur, the changes are all rolled back and consistency is ensured. Locking of database rows is employed by postgres's SELECT ... FOR UPDATE syntax.

The Algorithm

Algorithm Cat-Dat-Vat(O, shard_id, stud_id, vat):
    LB  |    
    LB  |    servers <-- list of servers for the load balancer to send requests to
    LB  |    max_vat <-- vat
    LB  |    for server in servers:
    S   |        delete entries where dat <= vat
    S   |        delete entries where cat > vat
    S   |        update dat to ∞ where dat > vat
    S   |        if O == R:
    S   |            response = {
    S   |                "data": select entry corresponding to stud_id having cat <= vat
    S   |            }
    S   |            
    S   |            send response to load balancer
    LB  |            send response to client
    LB  |            
    S   |        else:
    S   |            term <-- select term corresponding to shard_id
    S   |            term <-- max(term, vat) + 1
    S   |            
    S   |            if O == C:
    S   |                insert data with cat = term and dat = ∞
    S   |                update term corresponding to shard_id
    S   |                response = {
    S   |                    "vat": term
    S   |                }
    S   |                
    S   |                send response to load balancer        
    LB  |                
    S   |            else if O == D:
    S   |                update dat to term corresponding to stud_id
    S   |                update term corresponding to shard_id
    S   |                response = {
    S   |                    "vat": term
    S   |                }
    S   |                
    S   |                send response to load balaner
    LB  |                
    S   |            else if O == U:
    S   |                # `U` operation is essentially a `D` operation followed by a `C` operation 
    S   |                update dat to term corresponding to stud_id
    S   |                term += 1
    S   |                insert data with cat = term and dat = ∞
    S   |                update term corresponding to shard_id
    S   |                response = {
    S   |                    "vat": term
    S   |                }
    S   |                
    S   |                send response to load balancer
    LB  |                
    LB  |            max_vat <-- max(max_vat, response["vat"])
    LB  |
    LB  |    update vat to max_vat corresponding to shard_id
    LB  |
    LB  |    send response to client

Name		Name	Last commit message	Last commit date
Latest commit History 151 Commits
plots		plots
src		src
.gitignore		.gitignore
DS_assign2_DB.pdf		DS_assign2_DB.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed-Assignment-2

Members

How to run

Analysis

A1

A2

A3

A4

Data Generation

Main Libraries Used

Quart

asyncio

aiohttp

aiodocker

fifolock

asyncpg

Design Choices

Threading vs Asyncio

Cooperative Multitasking

Dockerfile

The Cat-Dat-Vat Algorithm

Preliminaries

The Algorithm

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Distributed-Assignment-2

Members

How to run

Analysis

A1

A2

A3

A4

Data Generation

Main Libraries Used

Design Choices

Threading vs Asyncio

Cooperative Multitasking

Dockerfile

The Cat-Dat-Vat Algorithm

Preliminaries

The Algorithm

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Uh oh!

Uh oh!

Languages