Skip to content

hadoop trying to connect to elastic search non data node and failed #368

@tzhang101

Description

@tzhang101

Hi, I am running yarn/spark streaming job and use your package to write to elastic search.
It works mostly but i found the following 2 issues.

  1. your package trying to find the AWS private IP's address of the elastic search nodes to connect.
    I will have to do ip forwarding to map the private ip to public ip in order for the write to work.
  2. you also find the non-data node in the elastic search cluster to connect and as a result the connection failed. Following is the detailed error message.
    Can you help fix and/or advise how to work around. We need those non-data nodes for security/routing purpose. Thanks.

Tian

15/01/31 03:11:12 INFO httpclient.HttpMethodDirector: I/O exception (java.net.ConnectException) caught when processing request: Connection timed out
15/01/31 03:11:12 INFO httpclient.HttpMethodDirector: Retrying request
15/01/31 03:12:15 INFO httpclient.HttpMethodDirector: I/O exception (java.net.ConnectException) caught when processing request: Connection timed out
15/01/31 03:12:15 INFO httpclient.HttpMethodDirector: Retrying request
15/01/31 03:13:18 INFO httpclient.HttpMethodDirector: I/O exception (java.net.ConnectException) caught when processing request: Connection timed out
15/01/31 03:13:18 INFO httpclient.HttpMethodDirector: Retrying request
15/01/31 03:14:21 ERROR rest.NetworkClient: Node [Connection timed out] failed (10.218.189.172:9200); selected next node [50.16.116.16:9200]

10.218.189.172 is the non-data node and after this failure I can no longer write to elastic search unless I restart the spark streaming job.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions