Proglog

Commit logs - append-only data structure, sequenced by time.

http Handlers notes:

Each handler consists of three steps:

Unmarshal the request's JSON body into a struct.
Complete that endpoint's logic with the request, obtain a result.
Marshal and write the result to the response. If a handler becomes much more complicated than this, then move req & resp handling to middleware, business logic further down the stack.

protobuf & grpc notes:

Protobuf pros:

type safety
prevents schema violations
fast serialization
backward compatibility (new code can read old data structures)

The magic internal packages are used to restrict access to certain packages within a project. Packages inside an internal directory can only be imported by code within the parent directory or its subdirectories. This helps in encapsulating code and preventing it from being used outside its intended scope.

Types of gRPC Streaming RPCs: Unary: Single request and single response. Server streaming: Single request and multiple responses (streamed from server to client). Client streaming: Multiple requests (streamed from client to server) and single response. Bidirectional streaming: Both client and server send a sequence of messages using a read-write stream. note: Rcv() is a blocking call, waits until a msg is received or the stream is closed.

Security notes:

Fav quote about securing an application: "Whenever I'm building a service, I think about what it'd be like if the data I'm trying to protect was publicly posted all over planet. Picturing this gives me the motivation to make sure that sort of thing doesn't happen to me, ..."

Secrutiy of distrtibuted services in three steps:

Encrypt the transmitted/in-flight data against MITM (man in the middle)
- TLS - successor to SSL
- Typically web services user one-way auth and only auth the server through the handshake that's initiated when the client and server connect. It's up to the app to auth the user
- Certs for internal distributed systems don't need to come from a third party, one can operate a CA (cert authority) themself
Authentication to identify clients (who is who)
- There's also a two-way auth or TLS mutual auth, whcih is used in machine-to-machine communication, or distributed systems. Both client and server use a cert to auth itself
Authorization to assign the right permissions to the ID-ed clients.
- when you have a shared resource with varying levels of ownership (read/write permissions)
- ACL - access control list in quite common. It's a essentially a table with rules on what someone can or can't do.
- in ACL permissions are attached directly to resources, in RBAC - to roles

Observability notes:

It gives us the chance to look into and fix unexpected problems.

the measure of how well we understand our system's internals

Three types of telemetry data:

Metrics

numeric data over time that help us define SLIs, SLOs, and SLAs Typically, as your system / business grows, you can reduce the resolution of the mentrics by making them less granular, agregating them, deleting irrelevant data after processing, that way you make it easier on the storage.

Counters

Track the num of times something happened. Often used to get rate aka how many times per time interval something happened. Requests handled per second, error rate.

Historgrams

Shows data distribution. Mainly used for measuring percentiles of request duration and sizes.

Gauges

Track the curruent value of something. Useful for saturation-type metrics: host's disk storage, num of load balancers compared to provider's limit.

What to measure (Google) ?

Latency

the time it takes your service to process requests. Can be a signal to scale the system.

Traffic

the amount of demand on the service. This could be requests procezssed per second, num of concurrent users (for streaming), etc.

Errors

request failure rate, esp. internal server errors.

Saturation

a measure of service's capacity. at the current ingress rate, how soon will you run out of hard drive space? how much memory the service uses compared to the available memory?

Structured logs

a set of name value ordered pairs encoded in consistent schema and format. Enable to separate log acapture from transporting, persisting, and querying. It can be a good practice to connect the logs to an event streamning platofrm like Kafka.

Traces

capture request lifecycles and let you track requests as they flow through your system. There are services that can provide a visual representation of where the request spent its time.

Service discovery notes:

For clients to reach a server, once we have more than one instance, we put a load balancer in front of it. The load balancers knows the address of each node, and its status. It redirects the client to the correct intance. It's fine to use them but it has its own trade offs like being a SPOF, introducing new cost, maintenance, and possibly latency.

For server-to-server communication, or for the internal services to comminicate, we don't really need a trust boundary since the communication is internal. We still, however, need to discover other intances and services in the system to talk to. Service discovery keeps track of the server instances, their IP, ports, their health, and deregesters them if they go offline/updates their status.

Using service-discovery service transfers the burden from you to the users, which is honestly not a big deal if that's done for an in-org service. Now, however, it's possible to embed the service discovery into your own service. This is useful, so that once a new node goes online, it can replicate the data from the other nodes, whcih makes the service more resilient.

Pull-based replication - periodically poll the data source to check if there's new data to consume (good in log and msg systems when consumers and work loads can differ e.g. one runs continuously, the other - every 24h). Push-based replication - the data source pushes the data to its replicas.

Load balancing notes:

Strategies:

Server proxying (most common): the client sends its requessts to a load balancer that either knows the servers by querying a service register or by encapsulating a service registry. It proxies the requests to the back-end services.
External load balancing (operational burden(?) + costs): the client calls an external load-balancing service that knows the servers and tells the client what address to query.
Client-side balancing (when you trust the clients, e.g. for internal use): the client takes on the responsibility of querying the service registry to learn about servers and picks a server to send the call to. gRPC balances the calls using a DNS resolver(default) and round-robin algorithm. Round-robin works well when all servers are performing the same type of work equally. Doesn't work well for the leader-followers architecture. Also doesn't work well with globally distruibuted service, as we'd want to connect the clients to the instances located closer to them. Lastly, doesn't work well in case you want to direct the request to the server with, say, the lowest number of queued requests (trying to optimize for latency).

If we wanted to use a diff alforithm, we could always write our own resolver (discovers servers) and picker(manages directing produce calls to the leader and balancing consumes across the followers).

Pickers handle the RPC balancing logic. They pick a server from the servers discovered by the Resolver to handle each RPC. Pickers can route client calls based on information about the call, client, and server.

Kubernetes notes:

Kubernetes is an open source otchestration system for automating deployment, scalingm and operating services running in containers. It has a REST API. You can provide it with an 'end-goal' state and it'll figure out how to get to that state from the current one. Pods - smallest deployable unit in K8s. All containers (processes) running in a pod share the same network namespace, IP address, and the same interprocess communication (IPC) namesapce, same volumes. Pods are logical hosts. Nodes - physical hosts (may run multiple pods). Kubernetes is extendable (can create custom resources and controllers). Controllers - control loops that watch the state of the resources and make changes where needed. Kubernetes is made up of many controllers. kubectl is a common way of interacting with Kybernetes. Kind (K8s in Docker) - allows to run local k8s clusters using Docker containers as nodes. // kind create cluster // kubectl cluster-info // kubectl cluster-info dump // kind load docker-image github.com/innazh/proglog:0.0.1 Services - exposes an app as a network service. In definition, we need to specify what Pods the service applies to and how to access the Pods.

ClusterIP - default service type. The service is only reachable within the K8s cluster, has an internal cluster IP.
NodePort - the service is available outside the cluster. Service is exposed on each node's IP on a static port.
LoadBalancer - exposes the service externally using a cloud provider's load balancer. Auto creates Cluster & Node IP services behind the scenes.
ExternalName - allows to alias a DNS name.

Helm notes:

Helm is a package manager for K8s that enables you to distribute and install services in K8s. Packages == charts. Think "npm for kubernetes". It defines all resources required to run a service in a k8s cluster. Just like npm, Helm makes it easier for others to run your application. A release - an instance of running a chart. It's like a 'process'. Repositories are used to share charts. You can add repos to install pacakges. // helm repo add x https://... // helm install x x

The Raft Consensus Algorithm

https://raft.github.io/ Raft is typically used for leader-election capability as well as replication. Consensus is a fundamental problem in fault-tolerant distributed systems, suince a number of different services need to agree on values/state. Consensus algorithm ensures that all servers agree on the order of log entries/operations. The recommended sizes for a cluster are 3 (to handle 1 failure) and 5 (to handle 2 failures)

The order of building / operations in this project:

Chapter 1:

We defined the model of a Log and access methods.
We defined an Http server, a method to create it, routes, and handler's names and signatures.
Request and response structs (since we're receiving requests and sending responses, that have to be marshalled/unmarshalled.)
Implement the handlers
main.go logic to run the server

Chapter 2: Protocol Buffers

Define protos & make sure it compiles learning opportunity: can write a protobuf extensions/plugins

Chapter 3: Write a Log Package

Create an store for our log files (a wrapper around a 'file' in our case)
Code up the read and write methods to persist our records
Test file
Write out the index struct and logic, test file
Segment logic (so that we can split our log into segmentes when one gets too big), test file
Code the Log + test

Chapter 4: Add gRPC service

Add grpc Log service, declare methods, response and request objects
Compile the code and see it generate log_grpc.pb.go
Implement a grpc server that will implement the Log Service and define its methods
Error handling
Swap out the concrete Log structure / object our server depends on to an interface
Create a gRPC server and register it (NewGRPCServer)
Tests!

Chapter 5: Security

Create a cert issuer authority using CloudFlare's open source lib
Define the configs and write out the makefile cmds to generate certs
Add a /config dir to take care of retrieving the cert files and parsing them
Add grpc opts to our server so it can handle a creds opt to handle tls conns
Add ACL by adding policy and model, use casbin pkg to enforce it
Add an interceptor / middleware to our grpc server to extract cert's cn for the server to check
In the test cases / when instantiating the server, we now define the Authorizer interface and voila!

Chapter 6: Observability

Add libs for logging , metrics and tracing (OpenCensus, zap)
Set it up at the start of the server
Wrap the created and configured log in the middleware
Before instantiating the server, setup the files/output for tracing and metrics via LogExporter
Close the files as a part of graceful shutdown

Chapter 7: Service discovery

We're using a sef HashiCorp's library for tracking the state of our cluster and passing the info from one node to another
Defined handler interface which can keep track of the members that leave or join
Defined the functionality and config for the nodes inside the cluster: we're listening on join, leave, and fail events for the nodes
Test file that implements the handler which just keeps track of the members (it doesn't have to be complicated at this point)
Build replication (implements the handler)
Build an Agent that orchestrates and sets up the entire service instance, visual representation of the service is in the img below
Test that sets up a cluster with 3 nodes, and verifies that the other servers will replicate the record we write to one of them Current replication implementation's problem: the servers replicate each other in a cycle, making it an infinite replication.

Chapter 8: Coordinating services with Consensus

Get hashicorp's raft
Create a DistributedLog that consists of Raft, Log, LogStore, and Config
Extend our Config struct to include a RaftConf struct
Write methods to init Raft and boostrap the cluster
Write DistributedLog's API, and implement all interfaces required by Raft
Integrate our discovery layer with Raft (Impl discovery.Handler's interface for DistributedLog)
Setup Multiplex run multiple services on one port

Chapter 9: Discover services and Load Balance

Add a GetServers() endpoint that clients can call get servers' information (this is basically our server-attached service registry)
Code loadbalance.Resolver (implements grpc's resolver & builder interfaces), register it, and write a test
Code loadbalance.Picker and impl the tests
Upd the client in our agent test to use our resolver, add the wait for replication there

Chapter 10: Deploy with Kubernetes

Instal k8s & kind, get a cluster running
Write a CLI interface that can run our program using cobra, viper
Dockerfile, makefile cmd to build it
Load the container into locally running Kind cluster
Create a helm package for easier deploy
Deploy our package locally!

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
_httpLogServer		_httpLogServer
api/v1		api/v1
cmd		cmd
deploy/proglog		deploy/proglog
internal		internal
test		test
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
agent_enc.png		agent_enc.png
go.mod		go.mod
go.sum		go.sum

Folders and files

Latest commit

History

Repository files navigation

Proglog

http Handlers notes:

protobuf & grpc notes:

Security notes:

Observability notes:

Metrics

Counters

Historgrams

Gauges

Latency

Traffic

Errors

Saturation

Structured logs

Traces

Service discovery notes:

Load balancing notes:

Kubernetes notes:

Helm notes:

The Raft Consensus Algorithm

The order of building / operations in this project:

Chapter 1:

Chapter 2: Protocol Buffers

Chapter 3: Write a Log Package

Chapter 4: Add gRPC service

Chapter 5: Security

Chapter 6: Observability

Chapter 7: Service discovery

Chapter 8: Coordinating services with Consensus

Chapter 9: Discover services and Load Balance

Chapter 10: Deploy with Kubernetes

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages