Skip to content

[CI] HdfsRepositoryTests class failing #127288

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
elasticsearchmachine opened this issue Apr 23, 2025 · 2 comments · Fixed by #127534
Closed

[CI] HdfsRepositoryTests class failing #127288

elasticsearchmachine opened this issue Apr 23, 2025 · 2 comments · Fixed by #127534
Assignees
Labels
:Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs low-risk An open issue or test failure that is a low risk to future releases Team:Distributed Coordination Meta label for Distributed Coordination team >test-failure Triaged test failures from CI

Comments

@elasticsearchmachine
Copy link
Collaborator

elasticsearchmachine commented Apr 23, 2025

Build Scans:

Reproduction Line:

undefined

Applicable branches:
8.19

Reproduces locally?:
N/A

Failure History:
See dashboard

Failure Message:

undefined

Issue Reasons:

  • [8.19] 5 consecutive failures in class org.elasticsearch.repositories.hdfs.HdfsRepositoryTests
  • [8.19] 17 consecutive failures in step openjdk17_checkpart1_java-matrix
  • [8.19] 18 consecutive failures in step openjdk17_checkpart1_java-fips-matrix
  • [8.19] 19 consecutive failures in step openjdk21_checkpart1_java-matrix
  • [8.19] 18 consecutive failures in step graalvm-ce17_checkpart1_java-matrix
  • [8.19] 88 failures in class org.elasticsearch.repositories.hdfs.HdfsRepositoryTests (14.6% fail rate in 602 executions)
  • [8.19] 16 failures in step openjdk22_checkpart1_java-matrix (94.1% fail rate in 17 executions)
  • [8.19] 17 failures in step openjdk17_checkpart1_java-matrix (100.0% fail rate in 17 executions)
  • [8.19] 18 failures in step openjdk17_checkpart1_java-fips-matrix (100.0% fail rate in 18 executions)
  • [8.19] 19 failures in step openjdk21_checkpart1_java-matrix (100.0% fail rate in 19 executions)
  • [8.19] 18 failures in step graalvm-ce17_checkpart1_java-matrix (100.0% fail rate in 18 executions)
  • [8.19] 19 failures in pipeline elasticsearch-periodic (100.0% fail rate in 19 executions)

Note:
This issue was created using new test triage automation. Please report issues or feedback to es-delivery.

@elasticsearchmachine elasticsearchmachine added :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs >test-failure Triaged test failures from CI labels Apr 23, 2025
@elasticsearchmachine
Copy link
Collaborator Author

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

@elasticsearchmachine elasticsearchmachine added needs:risk Requires assignment of a risk label (low, medium, blocker) Team:Distributed Coordination Meta label for Distributed Coordination team labels Apr 23, 2025
@nielsbauman
Copy link
Contributor

Same as #127290

@JeremyDahlgren JeremyDahlgren self-assigned this Apr 29, 2025
JeremyDahlgren added a commit to JeremyDahlgren/elasticsearch that referenced this issue Apr 29, 2025
Changes "ForkJoinPool-" to "ForkJoinPool." in the
Thread getName().startsWith() checks in
HdfsClientThreadLeakFilter.  This resolves the
"There are still zombie threads that couldn't be terminated"
errors in the Hdfs IT tests.

Closes elastic#127290
Closes elastic#127289
Closes elastic#127288
Closes elastic#127287
JeremyDahlgren added a commit to JeremyDahlgren/elasticsearch that referenced this issue Apr 29, 2025
Changes "ForkJoinPool-" to "ForkJoinPool." in the
Thread getName().startsWith() checks in
HdfsClientThreadLeakFilter.  This resolves the
"There are still zombie threads that couldn't be terminated"
errors in the Hdfs IT tests.

Closes elastic#127290
Closes elastic#127289
Closes elastic#127288
Closes elastic#127287
@JeremyDahlgren JeremyDahlgren added low-risk An open issue or test failure that is a low risk to future releases and removed needs:risk Requires assignment of a risk label (low, medium, blocker) labels Apr 29, 2025
JeremyDahlgren added a commit to JeremyDahlgren/elasticsearch that referenced this issue May 4, 2025
…#127534)

Adds the ForkJoinPool.commonPool-worker- prefix to the
Thread getName().startsWith() checks in HdfsClientThreadLeakFilter.
This resolves the
"There are still zombie threads that couldn't be terminated"
errors in the Hdfs IT tests.

Closes elastic#127290
Closes elastic#127289
Closes elastic#127288
Closes elastic#127287

(cherry picked from commit 4408e38)
JeremyDahlgren added a commit to JeremyDahlgren/elasticsearch that referenced this issue May 4, 2025
…#127534)

Adds the ForkJoinPool.commonPool-worker- prefix to the
Thread getName().startsWith() checks in HdfsClientThreadLeakFilter.
This resolves the
"There are still zombie threads that couldn't be terminated"
errors in the Hdfs IT tests.

Closes elastic#127290
Closes elastic#127289
Closes elastic#127288
Closes elastic#127287

(cherry picked from commit 4408e38)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs low-risk An open issue or test failure that is a low risk to future releases Team:Distributed Coordination Meta label for Distributed Coordination team >test-failure Triaged test failures from CI
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants