CS6650 Distributed System Spring2023

Repo for the distributed system 2023 Spring at NEU.

Class Notes

Feb.24 2023, class note.

Demo day on results.

Donglin Xu's presentation (pretty cool) Peter:

when doing presentation, always explain business case (even for just 30 seconds).
it's good to explain the architecture and how data flows
when present the table, always explain what you are measuring. (and best to combine it with the graph)
**always exclude the error case from your performance data (otherwise the lantency will look especially good.)

Peter:

Wheel of the week. Every week, a team will be randomly selected and present the dashboard, numbers, and the problems you see. It is not a punishedment, but rather most people enjoy it.
what's the throughput of the system, end-to-end (vs. throughput of the server, of the client.) e.g. how many swipe per second that gets resolved? always come to the customer perspective.
most often in amazon, they look at latency number most.
To monitor the performance, one way is the aws monitoring dashboard; another good way it logging, log in the starting time with id, and in the client side, find a way to log the completion time with the same id so you can match them up and plot.

Donglin:

k8 server at home to do the monitoring

Feb.04 2023, self-learning note.

The lecture this Friday == guest speaker Peter Smith. See the obsidian note for the take aways.

Here's the notes on doing & learning Homework 4 (~ #LittlesLaw, #multithread)

Little's Law indicate that the average number of items in a system (inventory) is equal to the average arrival rate multiplied by the average time that each item spends in the system. i.e. L = λ * W
λ = throughput (request/second) = average arrival rate of requests = total number of requests / wall time.
L = average number of requests in process or system.
W = latency (seconds) = average response time per request.

Jan. 27 2023, third lecture

Peter's coming next week! Microservice
20 mins working on hw2 and hw3.
- metric, log, multi-thread!
Mapreduce papre
- They changed file system! They designed a specific file system for this map reduce work.
- More task specific file system shall be developed!

Jan.20 2023, second lecture

Intro and recap:

First start with homework:
- google tutotiral
- podc papers
- aws
- DSFP = distributed system for fun and profit (in the syllabus resource)

Jan.13 2023, first lecture.

Motivation, class structure, and expectation (AWS).
Failure modes (surviving). Redundancy (then persistency problem).
Works in ms level, and how do we keep data synced? Even worse, our clock might not be exactly the same! (google: atomic clock)
All matters: network, clock, node, communication, client-server architecture.
Lock free data structures -> driving the performance improvement in the distributed system.
2 phase commit. 3 phase commit.
consensus algorithm (Raff).
Data intensive computing.
Scalability issue.
Queuing, caching, monitoring.
Distributed database issues. More design patterns for data intensive computing.
microservice vs api: (the key point is dynamic) more elastic -> work concurrently. & (define module vs. object) -> more about scalability factor, swap out/in.
Nonfunctional requirements
- Performance, performance, performance.
- collecting data on performance generates a lot of data
- analysis of this is also challenging
- Scale: more, more, more. With dynamic spikes
- Security: data and access control
- Available, hopefully 24/7
Scalability!
- being ale to increase and decrease on demand.
- one of the key motivation of distributed system is to accommodate the increased load (and scale it down to save cost when the traffic is slow)
- pre warm up.
- Amdahl's Law: how parallel it is ~ how much benefit it can receive from distributed system.
Tradeoffs
- throughput vs. cost
- performance vs. availability
- need to be able to measure to discuss

TODO

Make video tutorials for modular homeworks.

1. Create an ec2 instance to hold tomcat.
2. Create a serverlet local via intelliJ
3. Hold the serverlet on ec2 tomcat
4. Create an ec2 to hold rabbit mq
5. Create and deploy consumer on ec2
6. Create an ec2 to conduct load balancing

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.idea		.idea
Lab01		Lab01
Lab02		Lab02
Lab03		Lab03
Lab05_RabbitMQ		Lab05_RabbitMQ
Lab06_Consumer		Lab06_Consumer
Lab06_buildServer		Lab06_buildServer
Lab07_Serverless		Lab07_Serverless
LectureSlides		LectureSlides
RabbitMQ_Tutorial6		RabbitMQ_Tutorial6
Twinder		Twinder
TwinderClient		TwinderClient
TwinderConsumer		TwinderConsumer
TwinderExample		TwinderExample
.gitignore		.gitignore
2023 Syllabus for 6650 Vancouver Coady.pdf		2023 Syllabus for 6650 Vancouver Coady.pdf
Littles-Law-50-Years-Later.pdf		Littles-Law-50-Years-Later.pdf
README.md		README.md
book.mixu.net-Distributed systems.pdf		book.mixu.net-Distributed systems.pdf
mixu-distributed-systems-book.epub		mixu-distributed-systems-book.epub

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CS6650 Distributed System Spring2023

Class Notes

Feb.24 2023, class note.

Feb.04 2023, self-learning note.

Jan. 27 2023, third lecture

Jan.20 2023, second lecture

Jan.13 2023, first lecture.

TODO

About

Uh oh!

Releases

Packages

Uh oh!

Languages

tobyatgithub/cs6650_distributed_system_spring2023

Folders and files

Latest commit

History

Repository files navigation

CS6650 Distributed System Spring2023

Class Notes

Feb.24 2023, class note.

Feb.04 2023, self-learning note.

Jan. 27 2023, third lecture

Jan.20 2023, second lecture

Jan.13 2023, first lecture.

TODO

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages