Internal Service Error caused subsequent kinesis requests to fail #30

akshaykailaje · 2015-10-23T21:13:37Z

Hi

We use the Kinesis Producer Library (v0.10.1) to push events to our Kinesis stream. On 10/20/2015 at 22:32 Pacific Time, we saw a steep rise in errors from the Kinesis Producer Library. The errors seem to have been triggered by an "Internal Service Error" on the Kinesis side. We didn't see any service outages on the AWS service health dashboard.
The errors resolved on its own when we restarted our tomcat server.

We also saw a spike in PutRecord latency at the same time. It seems like it was caused due to the errors.

Can you please give us insight into why these errors would occur?

Initial error:

Error while logging Kinesis Record - attempts=5, attemptDetails={"errorMessage":"Internal service failure.","duration":5571,"errorCode":"InternalFailure","successful":false,"delay":10013},{"errorMessage":"Internal service failure.","duration":2213,"errorCode":"InternalFailure","successful":false,"delay":4425},{"errorMessage":"Internal service failure.","duration":41,"errorCode":"InternalFailure","successful":false,"delay":5002},{"errorMessage":"Expired while waiting in HttpClient queue","duration":55568920,"errorCode":"Exception","successful":false,"delay":-55566188},{"errorMessage":"Record has reached expiration","duration":0,"errorCode":"Expired","successful":false,"delay":0}

Subsequent Errors:
Error while logging Kinesis Record - attempts=6, attemptDetails={"errorMessage":"Internal service failure.","duration":5571,"errorCode":"InternalFailure","successful":false,"delay":9602},{"errorMessage":"Internal service failure.","duration":2213,"errorCode":"InternalFailure","successful":false,"delay":4425},{"errorMessage":"Internal service failure.","duration":41,"errorCode":"InternalFailure","successful":false,"delay":5002},{"errorMessage":"Expired while waiting in HttpClient queue","duration":55568920,"errorCode":"Exception","successful":false,"delay":-55566188},{"errorMessage":"Expired while waiting in HttpClient queue","duration":55569332,"errorCode":"Exception","successful":false,"delay":-55568920},{"errorMessage":"Record has reached expiration","duration":0,"errorCode":"Expired","successful":false,"delay":0}

The text was updated successfully, but these errors were encountered:

perryn · 2016-07-14T00:59:41Z

Hi @akshaykailaje,

We are seeing similar behaviour - did you ever get to the bottom of this?

Cheers
Perryn

pfifer · 2017-02-15T16:49:55Z

Internal service errors indicate something has gone wrong internal to the Kinesis service. It's always a possibility that you will see them, normally they should recover automatically. Looking at the errors you provided it looks like there may have been issues that was causing requests to take longer than expected. The long request time may have caused the internal service failures, and the subsequent record expiration.

Are you seeing consistent levels of Internal Service Failures?

rakhu · 2018-09-01T15:12:19Z

Getting this frequent failures. 0.12.9 jar Windows 2012 R2 Standard VMWare Intel 2GHZ (4 processors) 6 GB RAM.
During this time CPU is going 100% and bringing down the windows server. So as a workaround onFailure i writen to destroy (flushSync and destroy didnt help ). But in this case i am loosing all the outstanding records. Can you help me to know if anyway to solve this ? Atlease i need a way to get the message and reprocess it.

Note: All Kinesis default config used.

perryn mentioned this issue Jul 14, 2016

KPL retries never seem to work #56

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Internal Service Error caused subsequent kinesis requests to fail #30

Internal Service Error caused subsequent kinesis requests to fail #30

akshaykailaje commented Oct 23, 2015

perryn commented Jul 14, 2016

pfifer commented Feb 15, 2017

rakhu commented Sep 1, 2018

Internal Service Error caused subsequent kinesis requests to fail #30

Internal Service Error caused subsequent kinesis requests to fail #30

Comments

akshaykailaje commented Oct 23, 2015

perryn commented Jul 14, 2016

pfifer commented Feb 15, 2017

rakhu commented Sep 1, 2018