Skip to content

Files

Latest commit

 

History

History
46 lines (17 loc) · 3.59 KB

scalable-data-streaming-with-amazon-kinesis.md

File metadata and controls

46 lines (17 loc) · 3.59 KB

Scalable Data Streaming with Amazon Kinesis

> Home

Chapter 4: Kinesis Data Streams

Amazon API Gateway allows developers to easily create an HTTP endpoint to send records to KDS. API Gateway also has support for authentication to secure access to the endpoint. (link)

The combination of the order guarantee and the failing record creates a scenario referred to as a poison pill. (link)

Utilizing an HTTP/2 WebSocket event stream, the message delivery from producer to consumer can be reduced to as little as 70 milliseconds (link)

With the GetRecords API, a consumer can request a batch of records of up to 10,000 records per shard (link)

You can use the AWS Cloud9 environment, described in the Technical requirements section, to install the Linux-based Kinesis Agent. (link)

You can check that the record is in the stream with the AWS CLI. The first command will get shard-id, and the second command will get ShardIterator. Lastly, we will execute get-records with the value of $SHARD_ITERATOR: (link)

When a record is written to the stream, the record cannot be altered or deleted. (link)

Shards can be dynamically added or removed through resharding. The actual sharding process within a stream, such as the splitting and merging of data, is fully managed by Amazon KDS: https (link)

A shard provides an ingestion capacity of 1 MB/per second and 1,000 PUT records per second. The consumption capacity is 2 MB/per second and 2,000 GET records per second (link)

The KCL will transparently track the record's sequence number in an Amazon DynamoDB table (link)

Using an AWS Cloud9 development environment: As an alternative to setting these up in your local development environment, you can create an AWS Cloud9 development environment: https (link)

You want to use KDS when you need to deliver near real-time sub-second performance (link)

Amazon KDS is a managed, massively scalable, durable, and low latency real-time data streaming service used by many of the largest data pipelines in the cloud. (link)

> Home