Hotspot Tracker

Overview

The Hotspot Tracker efficiently tracks the top N most frequently requested keys across a distributed system. It uses sharding and min-heaps to maintain performance and scalability.

Design Choices

Assumptions

RecordRequest will be most frequently called method.
Efficient concurrent access is crucial for a multi-threaded access.
Accurately track top N keys by frequency

Min-Heap

Min-heap is used within each shard as basic data structure to efficiently track the top N keys by frequency. The min-heap ensures that operations for maintaining the top N keys are logarithmic in complexity, providing an efficient way to manage frequent updates.

Trade-offs:

Memory Usage vs. Performance: Using a heap data structure ensures that the tracker operates efficiently even with frequent updates. While this incurs some memory overhead for maintaining the heap, the performance gains from logarithmic operations make it a suitable choice.

Sharding

Sharding distributes the load across multiple sub-trackers, reducing contention and improving concurrency. By breaking the data into smaller, manageable pieces, each shard can operate independently, which enhances parallel processing and reduces bottlenecks.

Trade-offs::

Memory Usage vs. Performance: While sharding improves concurrency and reduces contention, it increases memory usage because each shard maintains its own data structures. However, the performance benefits from reduced contention outweigh the increased memory overhead. While latency increased for individual operations but concurrent operations improved. refer bench.md

FNV Hash

The FNV hash function is chosen for key partitioning because it provides a good distribution of hash values, reducing the likelihood of hash collisions. This helps in evenly distributing keys across shards.

Trade-offs:

Complexity vs. Distribution Quality: While FNV is relatively simple and fast, it provides a good balance between complexity and the quality of distribution. This ensures that keys are evenly spread across shards, minimizing contention and maximizing concurrency.

Usage

Import

import "github.com/aayush993/htracker"

Initialization

ht := htracker.NewHotspotTracker(10, 4) // Track top 10 keys across 4 shards
ht.RecordRequest("key1")
ht.RecordRequest("key2")

hotspots := ht.GetHotspots()

fmt.Println(hotspots)

isHotspot := ht.IsHotspot("key1")
fmt.Println(isHotspot)

go test -v

go test -bench=.

Pending Improvements

Read Cache

Trying to implement a cache mechanism to periodically update the list of hotspots. This caching strategy will balance the need for performance with the requirement for accuracy, ensuring that the system does not become a bottleneck while still providing up-to-date hotspot information.

ht := htracker.NewHotspotTracker(4, 4).WithCache(1 * time.Microsecond)

Trade-offs:

Staleness vs. Performance: The cache may introduce slight staleness in hotspot data, but it is expected to reduce the performance overhead of frequently updating the hotspot list.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
bench.md		bench.md
go.mod		go.mod
htracker.go		htracker.go
htracker_test.go		htracker_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hotspot Tracker

Overview

Design Choices

Assumptions

Min-Heap

Trade-offs:

Sharding

Trade-offs::

FNV Hash

Trade-offs:

Usage

Import

Initialization

Pending Improvements

Read Cache

Trade-offs:

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hotspot Tracker

Overview

Design Choices

Assumptions

Min-Heap

Trade-offs:

Sharding

Trade-offs::

FNV Hash

Trade-offs:

Usage

Import

Initialization

Pending Improvements

Read Cache

Trade-offs:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages