Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] task ':plugins:repository-hdfs:haHdfs2Fixture' failing #107105

Open
astefan opened this issue Apr 4, 2024 · 4 comments
Open

[CI] task ':plugins:repository-hdfs:haHdfs2Fixture' failing #107105

astefan opened this issue Apr 4, 2024 · 4 comments
Labels
:Data Management/HDFS HDFS repository issues low-risk An open issue or test failure that is a low risk to future releases Team:Data Management Meta label for data/management team >test-failure Triaged test failures from CI

Comments

@astefan
Copy link
Contributor

astefan commented Apr 4, 2024

CI Link

https://gradle-enterprise.elastic.co/s/qj4m5zeidpqji

Repro line

none

Does it reproduce?

Yes

Applicable branches

7.17

Failure history

No response

Failure excerpt

java.io.IOException: java.lang.RuntimeException: Could not resolve Kerberos principal name: java.net.UnknownHostException: i-0827c96753d4ac016.us-west-2.compute.internal: i-0827c96753d4ac016.us-west-2.compute.internal: Name or service not known |  
-- | --
  | at org.apache.hadoop.http.HttpServer2.<init>(HttpServer2.java:396) |  
  | at org.apache.hadoop.http.HttpServer2.<init>(HttpServer2.java:111) |  
  | at org.apache.hadoop.http.HttpServer2$Builder.build(HttpServer2.java:325) |  
  | at org.apache.hadoop.hdfs.server.namenode.NameNodeHttpServer.start(NameNodeHttpServer.java:155) |  
  | at org.apache.hadoop.hdfs.server.namenode.NameNode.startHttpServer(NameNode.java:842) |  
  | at org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:693) |  
  | at org.apache.hadoop.hdfs.server.namenode.NameNode.<init>(NameNode.java:906) |  
  | at org.apache.hadoop.hdfs.server.namenode.NameNode.<init>(NameNode.java:885) |  
  | at org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1626) |  
  | at org.apache.hadoop.hdfs.MiniDFSCluster.createNameNode(MiniDFSCluster.java:1162) |  
  | at org.apache.hadoop.hdfs.MiniDFSCluster.createNameNodesAndSetConf(MiniDFSCluster.java:1037) |  
  | at org.apache.hadoop.hdfs.MiniDFSCluster.initMiniDFSCluster(MiniDFSCluster.java:830) |  
  | at org.apache.hadoop.hdfs.MiniDFSCluster.<init>(MiniDFSCluster.java:485) |  
  | at org.apache.hadoop.hdfs.MiniDFSCluster$Builder.build(MiniDFSCluster.java:444) |  
  | at hdfs.MiniHDFS.main(MiniHDFS.java:114) |  
  | Caused by: java.lang.RuntimeException: Could not resolve Kerberos principal name: java.net.UnknownHostException: i-0827c96753d4ac016.us-west-2.compute.internal: i-0827c96753d4ac016.us-west-2.compute.internal: Name or service not known |  
  | at org.apache.hadoop.security.AuthenticationFilterInitializer.getFilterConfigMap(AuthenticationFilterInitializer.java:90) |  
  | at org.apache.hadoop.http.HttpServer2.getFilterProperties(HttpServer2.java:508) |  
  | at org.apache.hadoop.http.HttpServer2.constructSecretProvider(HttpServer2.java:499) |  
  | at org.apache.hadoop.http.HttpServer2.<init>(HttpServer2.java:388) |  
  | ... 14 more |  
  | Caused by: java.net.UnknownHostException: i-0827c96753d4ac016.us-west-2.compute.internal: i-0827c96753d4ac016.us-west-2.compute.internal: Name or service not known |  
  | at java.base/java.net.InetAddress.getLocalHost(InetAddress.java:1647) |  
  | at org.apache.hadoop.security.SecurityUtil.getLocalHostName(SecurityUtil.java:230) |  
  | at org.apache.hadoop.security.SecurityUtil.replacePattern(SecurityUtil.java:198) |  
  | at org.apache.hadoop.security.SecurityUtil.getServerPrincipal(SecurityUtil.java:153) |  
  | at org.apache.hadoop.security.AuthenticationFilterInitializer.getFilterConfigMap(AuthenticationFilterInitializer.java:87) |  
  | ... 17 more |  
  | Caused by: java.net.UnknownHostException: i-0827c96753d4ac016.us-west-2.compute.internal: Name or service not known |  
  | at java.base/java.net.Inet6AddressImpl.lookupAllHostAddr(Native Method) |  
  | at java.base/java.net.InetAddress$PlatformNameService.lookupAllHostAddr(InetAddress.java:933) |  
  | at java.base/java.net.InetAddress.getAddressesFromNameService(InetAddress.java:1519) |  
  | at java.base/java.net.InetAddress$NameServiceAddresses.get(InetAddress.java:852) |  
  | at java.base/java.net.InetAddress.getAllByName0(InetAddress.java:1509) |  
  | at java.base/java.net.InetAddress.getLocalHost(InetAddress.java:1642) |  
  | ... 21 more |  
  | 2024-04-04 12:47:20,663 INFO  [main] hdfs.MiniDFSCluster (MiniDFSCluster.java:shutdown(1789)) - Shutting down the Mini HDFS Cluster
@astefan astefan added >test-failure Triaged test failures from CI :Data Management/HDFS HDFS repository issues labels Apr 4, 2024
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-data-management (Team:Data Management)

@elasticsearchmachine elasticsearchmachine added blocker Team:Data Management Meta label for data/management team labels Apr 4, 2024
@masseyke
Copy link
Member

masseyke commented Apr 5, 2024

It looks like this happens very frequently on 7.17, but I don't see any failures for months in 8.x. @breskeby tells me that they've done a lot of work on stabilizing this in 8. I'm tempted to mark this as won't fix because it's not worth the effort to backport the test fixture improvements. What do you think @breskeby @dakrone @jbaiera ?

@dakrone
Copy link
Member

dakrone commented Apr 23, 2024

@masseyke it's marked as reproducing, is it something that would affect production systems at all?

@dakrone dakrone added low-risk An open issue or test failure that is a low risk to future releases and removed blocker labels Apr 23, 2024
@masseyke
Copy link
Member

The fixes were in the test fixture, not in production code.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Data Management/HDFS HDFS repository issues low-risk An open issue or test failure that is a low risk to future releases Team:Data Management Meta label for data/management team >test-failure Triaged test failures from CI
Projects
None yet
Development

No branches or pull requests

4 participants