AGRID

Agrid v0.1.4

Purpose

Agrid is a high available file storage service. Agrid can handle from 1 to several hundred storage nodes. Using command lines it's possible to store and retrieve files to/from this repository. Each file are cut and spread on nodes to be stored on high available way. a file piece is saved several time on distinct nodes. Files can be saved with associated metadata and retrieve with their metadata. File pieces are saved and retrieve in parallele way to speed up the transfer.

Go API are also available to store and retrieve files and also allows direct records file manipulations create/open, read/write, seek, close, on the cluster

File transfers are monitored on a separated process

Agrid is a docker service. It enough to pull its image freignat91/agrid:latest (or build it using make build) and create the docker service to use it on a swarm cluster. It can be scaled using docker service scale command.

Agrid uses grpc protocol (HTTP2) for communication between nodes and between nodes and clients. Under 20 nodes, all nodes are completely connected one to each other, up to 20 nodes Agrid create a grid and the communication between nodes are not direct anymore.

Agrid use Ant like behavior to found the shortest path between two nodes through the grid. The path are dynamically adapted regarding the nodes workload on the paths to stay the shortest in term of time. The grid become more efficient while it is used. see:

./docs/Agrid-grid-building.pptx
./docs/Agrid-Ant-net.pptx

.

Configuration using System Variables:

GRPCPORT: grpc server port used by nodes
NB_DUPLICATE: number of time a file is replicated in the cluster when it is stored: default 3
NB_DUPLICATE_ACK: number of acknowledge replications before a file is considered stored and acknowledge the client: default 1
NB_LINE_CONNECT: number of "line" type connection in grid: default 0 means computed automatically
NB_CROSS_CONNECT: number of "cross" type connection in grid: default 0 means computed automatically
BUFFER_SIZE: messages waiting to be treated buffer size: default 10000
PARALLEL_SENDER: maximum number of messages to be sent treated in parallel: default 100
PARALLEL_RECEIVER: maximum number of received messages treated in parallel: default 100
DATA_PATH: path in container where file data is stored: default: /data, by default for debug reason mapped on host at /tmp/agrid/data

Install

Docker 1.13 min should be installed min

pull this image or rebuild it:

clone the git project: https://github.com/freignat91/agrid
execute make install to build the agrid command line executable
execute make build to create a image freignat91/agrid:latest

agrid can't be used as a single container, it needs to be started as a service on a swarm machine (manager or worker), to create a swarm mamager on the local machine with a usable network execute:

docker swarm init --advertise-addr 127.0.0.1
docker network create -d overlay aNetwork

For instance with 5 nodes, using a publish port 30103 and the network "aNetwork" [dataPath] should exist on the host, it contains all the data saved by agrid

        docker service create --network aNetwork --name agrid \
        --publish 30103:30103 \
        --mount type=bind,source=[dataPath],target=/data \
        --replicas=5 \
        freignat91/agrid:latest

Agrid take more time to start than the "ready" docker status. Every agrid command executed before will be rejected with a message "Node not yet ready"

Resilience

For resilience reason, it's better to have a separated disk file system for each node (each node on its own VM), but for test reason it's possible to have several nodes on the same file system or have architecture with several nodes on several VMs, but the used memory on file retrieve command will be higher than normal.

Node crash

If a node crash (agrid itself, or disk file system failure or VM failure), docker will restart the node. When the new node restart, it will try to get it's previous file system or ask the other nodes to resend the blocks he handles (this last part is targeted for 0.1.5 version)

Scale out

To scale out, it's enough to use docker service scale agrid=xxx command. Agrid will recompute its grid recreating all the node connections accordingly to the number of nodes

Scale in

To scale in, it's enough to use docker service scale agrid=xxx command. Agrid will recompute its grid recreating all the node connections accordingly to the number of nodes. Warning, to do not lose files, it's important to scale in with a difference between the number of nodes lower than the NB_DUPLICATE parameter and let time to Agrid to reorganize the files blocks between each scale command

Grid simulation

To simulate nodes connections using different parameters as, node number, line connections, cross connections,and see the grid topology, use the cli command:

agrid grid simul [nodes] <--line xx > <--cross yy>

[nodes] the number of nodes
<--line xx> optionally: xx the number of line connections
<--cross yy> optionally: yy the number of cross connections

this command as not effect on the real cluster grid connections.see (./docs/Agrid-grid-building.pptx, ./docs/Agrid-Ant-net.pptx)

Users

Agrid use a "common" file space by default, everyone can access to this space. Even if files can be encrypted with a specific key, it's possible to create a user to ensure more confidentiality. Create a user, create at the same time a dedicated file space that no one can access except the user. A user can see and act only on its own file space. To authenticate a user a token a given at user creation by the cluster, this token should be provided for all commands used with a user. It's also possible to provide your own token at user creation and use it for all commandes with this user.

CLI

Agrid command lines is implemented using the Agrid Go API

common options

--verbose: display more informations messages
--server: format addr1:port,addr2:port, ... list of the cluster servers (can list less servers than really in the cluster).

create a user

agrid user create [username] <--token>

Create a user with its own file space in the cluster. This command return a token used to authenticate the user when executing any other command

[username] the user name to create
<--token> set the token for this user, without the token is computed by the server

remove a user

agrid user remove [user] <--force>

Remove a user. All files in its file space should have been removed first

[user] the user to remove, format userName:token
<--force> if this option exist then the user is removed with all its associated files, if not the user is removed only if its file space is empty.

store a new version of a file on cluster:

agrid file store [source] [target] <--thread> <--key> <--user>

[source]: the full pathname of the local file to store
[target]: the full pathname of the file in the cluster*
<--type xxx> set the file type
<--thread number>: optionally: number of threads used to store the file (default 1), each thread open a grpc connection on a distinct node and handle a part of the file (all the block n if n%nbThread==threadNumber)
<--key>: optionally: AES key to encrypt the file
<--user userName:token>: to store on the usee file space, token is given at token creation
<--meta key1=value1,key2=value2, ...>

by default, the file is stored 3 times (config NB_DUPLICATE parameter default value)

retrieve a file from cluster

Retrieve a file from cluster using duplicated blocks if some are missing

agrid file retrieve [source] [target] <--thread> <--key> <--user>

[source]: the full pathname of the file to get in cluster
[target]: the full pathname of the file to write locally
<--thread>: optionally: number of threads used to retrieve the file (default 1), each thread open a grpc connection on a distinct node and handle a part of the file (all the block n if n%nbThread==threadNumber)
<--key>: optionally: AES key to encrypt the file
<--user userName:token>: to store on the usee file space, token is given at user creation
<--meta> metadata file name, default: no metadata file
<--version nn> retrieve the version nn, default: the last version

list the files on the cluster

agrid file ls [path] <--user>

[path]: path name on the cluster to list, default /
<--user userName:token>: to store on the user file space, token is given at user creation
<--version> if exist, list all versions for all selected files

remove a file on the cluster

agrid file rm [pathname] <-r> <--user>

[pathname]: full pathname of the file to remove on the cluster
<-r>: to remove a folder recursively
<--user userName:token>: to store on the usee file space, token is given at user creation
<--version nn> remove a specific version of the file, default: remove all the versions

monitor file transfer

agrid file monitor <--type> <--user>

<--user userName:token>: to store on the usee file space, token is given at user creation
<--type xx> monitor only the file transfer with type=xx

list the cluster nodes

agrid node ls

ping a cluster node

agrid node ping |node]

[node] the node name to ping

API

Agrid is usable using Go api API github.com/freignat91/agrid/agridapi

Usage

        import "github.com/freignat91/agrid/agridapi"
        ...
        api := agridapi.New("localhost:10315")
        err, fileList := api.FileLs("/")
        ...

func (api *AgridAPI) UserCreate(name string, token string) (string, error)

Create a new user, return a token to authenticate the user Argument

name: the user name to create
token: if equal to "", the token is computed by server, if not it'sused as the user token.

func (api *AgridAPI) UserRemove(name string, force bool) error

Remove a user Argument

name: the user name to remove, format userName:token
force : if true the user is removed with all its associated files, if false the user is removed only if its file space is empty.

func (api *AgridAPI) SetUser(user string)

Set the current user and authenticate it with the token, then every api function will be executed with this user Arguements:

user: format userName:token, or "" to set the common user

func (api *AgridAPI) FileLs(folder string, version bool) ([]string, error)

List the file stored on the cluster Argument:

folder: Folder under which the files are listed
version: if true list all the versions of the selected files

func (api AgridAPI) FileStore(localFile string, clusterPathname string, meta []string, nbThread int, key string) (int, error)

Store a file on the cluster, rerturn the version stored and error if happend. Arguments:

localFile: pathname of the local file to store
clusterPathName: pathname of the file on the cluster
metadata associated to the file and stored with the file
nbThread: number of threads used to store the file (each thread open a distinc grpc connection)
key: AES key to encrypt the file on the cluster

func (api *AgridAPI) FileRetrieve(clusterPathname string, localFile string, version int, nbThread int, key string) (map[string]string, int, error)

Retrieve a file from the cluster and return its metadata, its version and error if happend Arguments:

clusterPathname: pathname of the file to get on the cluster
localFile: pathname of the file to write locally
version: to retrieve a specific version, default: the last version is retrieved
nbThread: number of threads used to store the file (each thread open a distinc grpc connection)
key: AES key to decrypt the file

func (api *AgridAPI) FileRm(clusterPathname string, version int, recursive bool) (error, bool)

Remove a file on the cluster Arguments:

clusterPathname: pathname of the file to remove on the cluster
recusive: if true remove all files under the clusterPathname
version: to remove a specifiv version, default: all versions are removed

func (api *AgridAPI) FileSetTransferEventCallback(fileType string, callbackFunction interface{}) error

Set a function callback executed at each TransferEvent Arguments:

fileType: callback only the transfer of files having type "fileType"
callbackFunction: function executed at each TransfertEvent, should have type: func(*AgridAPI.TransferEvent) error

func (api *AgridAPI) NodePing(node string, debugTrace bool) (string, error)

Ping a node Arguments:

node: node name to ping
debugTrace: if true, trace the message especially in the node logs.

func (api *AgridAPI) NodePingFromTo(node1 string, node2 string, debugTrace bool) (string, error)

Ping a node from another node Arguments:

node1: node name of the node which execute the ping
node2: targetted node name
debugTrace: if true, trace the message especially in the node logs.

func (api *AgridAPI) NodeSetLogLevel(node string, logLevel string) error

Set the logLevel on a node(s) Arguments:

node: targetted node
logLevel: error, warn, info, debug

func (api *AgridAPI) NodeLs() ([]string, error)

List the node of the cluster

func (api *AgridAPI) NodeLs() ([]string, error)

List the node of the cluster

func (api AgridAPI) CreateFile(name string, meta map[string]map, key string) AFile, error

Create a new file on cluster and return an AFile instance Argument:

key: if key != "", encrypte the data with the key (AES256)

func (api AgridAPI) OpenFile(name string, version int, key string) AFile, error

Open an existing file on cluster, move the current position to the end of file and return an AFile instance Arguments:

version: file version to retrieve, if 0, retrieve the last version
key: if key != "", encrypte the data with the key (AES256)

Functions on AFile (experimental part)

func (a *AFile) Write(data []byte) (int, error)

Write data in file at the current position

func (a *AFile) WriteAt(data []byte, at int64) (int, error)

Write data in file at the position 'at'

func (a *AFile) WriteString(data []byte) (int, error)

Write the string data in file at the current position

func (a *AFile) WriteStringAt(data []byte, at int64) (int, error)

Write the string data in file at the position 'at'

func (a *AFile) Read(data []byte) (int, error)

Read data in file at the current position

func (a *AFile) ReadAt(data []byte, at int64) (int, error)

Read data in file at the position 'at'

func (a *AFile) Seek(pos int64, whence int) (int64, error)

Move the current possition to 'pos', relative to whence:

0: from the begining of the file
1: from the current position
2: from the end of the file

func (a *AFile) getMetadata() map[string]string

return the metadata map. This map can be updated to in order to be stored with the file.

func (a *AFile) Sync() error

Save the memory data on the cluster

func (a *AFile) Close() error

Save the memory data on the cluster and free all AFile instance ressources

tests

execute: make test

Next versions

allows two (several) agrid clusters to connect one to each other by a given number of shared node connections creating this way an agrid meta cluster able to store and retrieve files with guaranty they have been replicate in each clusters.
use agrid meta-cluster to handle geo-cluster replication if each agrid cluster is hosted in different data centers and allow storage and retrieval taking automatically in account such data-center replication.
Allow any meta-cluster topologies, an agrid meta-cluster in the same region interconnected with another agrid meta-cluster on another region for instance, opening capacity to have a very large number of nodes engage in the storage/retrieve process,but still having a clear hierarchical usage of them (an optimized usage similar to a virtual quadtree)

License

Agrid is licensed under the Apache License, Version 2.0. See https://github.com/freignat91/agrid/blob/master/LICENSE for the full license text.

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
agrid		agrid
agridapi		agridapi
docs		docs
logs		logs
server		server
tests		tests
vendor		vendor
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
glide.lock		glide.lock
glide.yaml		glide.yaml
writeLog.sh		writeLog.sh

License

freignat91/agrid

Folders and files

Latest commit

History

Repository files navigation

AGRID

Purpose

Configuration using System Variables:

Install

Resilience

Node crash

Scale out

Scale in

Grid simulation

Users

CLI

common options

create a user

remove a user

store a new version of a file on cluster:

retrieve a file from cluster

list the files on the cluster

remove a file on the cluster

monitor file transfer

list the cluster nodes

ping a cluster node

API

Usage

func (api *AgridAPI) UserCreate(name string, token string) (string, error)

func (api *AgridAPI) UserRemove(name string, force bool) error

func (api *AgridAPI) SetUser(user string)

func (api *AgridAPI) FileLs(folder string, version bool) ([]string, error)

func (api *AgridAPI) FileStore(localFile string, clusterPathname string, meta *[]string, nbThread int, key string) (int, error)

func (api *AgridAPI) FileRetrieve(clusterPathname string, localFile string, version int, nbThread int, key string) (map[string]string, int, error)

func (api *AgridAPI) FileRm(clusterPathname string, version int, recursive bool) (error, bool)

func (api *AgridAPI) FileSetTransferEventCallback(fileType string, callbackFunction interface{}) error

func (api *AgridAPI) NodePing(node string, debugTrace bool) (string, error)

func (api *AgridAPI) NodePingFromTo(node1 string, node2 string, debugTrace bool) (string, error)

func (api *AgridAPI) NodeSetLogLevel(node string, logLevel string) error

func (api *AgridAPI) NodeLs() ([]string, error)

func (api *AgridAPI) NodeLs() ([]string, error)

func (api *AgridAPI) CreateFile(name string, meta map[string]map, key string) *AFile, error

func (api *AgridAPI) OpenFile(name string, version int, key string) *AFile, error

Functions on AFile (experimental part)

func (a *AFile) Write(data []byte) (int, error)

func (a *AFile) WriteAt(data []byte, at int64) (int, error)

func (a *AFile) WriteString(data []byte) (int, error)

func (a *AFile) WriteStringAt(data []byte, at int64) (int, error)

func (a *AFile) Read(data []byte) (int, error)

func (a *AFile) ReadAt(data []byte, at int64) (int, error)

func (a *AFile) Seek(pos int64, whence int) (int64, error)

func (a *AFile) getMetadata() map[string]string

func (a *AFile) Sync() error

func (a *AFile) Close() error

tests

Next versions

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

func (api AgridAPI) FileStore(localFile string, clusterPathname string, meta []string, nbThread int, key string) (int, error)

func (api AgridAPI) CreateFile(name string, meta map[string]map, key string) AFile, error

func (api AgridAPI) OpenFile(name string, version int, key string) AFile, error

Packages