Skip to content

[CI] APMYamlTestSuiteIT test {yaml=/20_metrics_ingest/Test metrics-apm.app-* setting event.ingested via ingest pipeline} failing #118875

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
elasticsearchmachine opened this issue Dec 17, 2024 · 7 comments
Assignees
Labels
:Data Management/Data streams Data streams and their lifecycles low-risk An open issue or test failure that is a low risk to future releases Team:Data Management Meta label for data/management team >test-failure Triaged test failures from CI

Comments

@elasticsearchmachine
Copy link
Collaborator

elasticsearchmachine commented Dec 17, 2024

Build Scans:

Reproduction Line:

./gradlew ":x-pack:plugin:apm-data:yamlRestTest" --tests "org.elasticsearch.xpack.apmdata.APMYamlTestSuiteIT" -Dtests.method="test {yaml=/20_metrics_ingest/Test metrics-apm.app-* setting event.ingested via ingest pipeline}" -Dtests.seed=BD8F3887A7A3E5C5 -Dtests.locale=ce-Cyrl-RU -Dtests.timezone=ROK -Druntime.java=17

Applicable branches:
8.17

Reproduces locally?:
N/A

Failure History:
See dashboard

Failure Message:

java.lang.AssertionError: Failure at [/20_metrics_ingest:32]: field [hits.hits.1.fields.event\.ingested.0] is null

Issue Reasons:

  • [8.17] 3 failures in test test {yaml=/20_metrics_ingest/Test metrics-apm.app-* setting event.ingested via ingest pipeline} (0.6% fail rate in 501 executions)
  • [8.17] 2 failures in pipeline elasticsearch-periodic-platform-support (11.8% fail rate in 17 executions)

Note:
This issue was created using new test triage automation. Please report issues or feedback to es-delivery.

@elasticsearchmachine elasticsearchmachine added :Data Management/Data streams Data streams and their lifecycles >test-failure Triaged test failures from CI labels Dec 17, 2024
elasticsearchmachine added a commit that referenced this issue Dec 17, 2024
…0_metrics_ingest/Test metrics-apm.app-* setting event.ingested via ingest pipeline} #118875
@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch 8.x

Mute Reasons:

  • [8.x] 2 failures in test test {yaml=/20_metrics_ingest/Test metrics-apm.app-* setting event.ingested via ingest pipeline} (0.6% fail rate in 356 executions)

Build Scans:

@elasticsearchmachine elasticsearchmachine added Team:Data Management Meta label for data/management team needs:risk Requires assignment of a risk label (low, medium, blocker) labels Dec 17, 2024
@elasticsearchmachine
Copy link
Collaborator Author

Pinging @elastic/es-data-management (Team:Data Management)

@dakrone
Copy link
Member

dakrone commented Dec 17, 2024

@inge4pres I believe this is related to the recent APM work. Can you take a look at it?

@dakrone dakrone added low-risk An open issue or test failure that is a low risk to future releases and removed needs:risk Requires assignment of a risk label (low, medium, blocker) labels Dec 17, 2024
@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch 8.16

Mute Reasons:

  • [8.16] 4 failures in test test {yaml=/20_metrics_ingest/Test metrics-apm.app-* setting event.ingested via ingest pipeline} (0.6% fail rate in 665 executions)
  • [8.16] 2 failures in pipeline elasticsearch-periodic (9.5% fail rate in 21 executions)

Build Scans:

elasticsearchmachine added a commit that referenced this issue Jan 30, 2025
…0_metrics_ingest/Test metrics-apm.app-* setting event.ingested via ingest pipeline} #118875
@inge4pres
Copy link
Contributor

@inge4pres I believe this is related to the recent APM work. Can you take a look at it?

Any hints on why ingest pipeline would behave differently than before?

We introduced a change in how we populate APM data in 8.16 and 8.17 with #119209 and
#119208 and we haven't seen such failures happen.

The same change is also in main, so should we expect failures also for 8.18?

The failure message is suspicious

java.lang.AssertionError: Failure at [/20_metrics_ingest:32]: field [hits.hits.1.fields.event\.ingested.0] is null

I don't see how it could be null, since we populate it with the ingest pipeline

@simitt
Copy link
Contributor

simitt commented Feb 24, 2025

Nothing in https://github.com/elastic/elasticsearch/blob/8.17/x-pack/plugin/apm-data/src/main/resources/ingest-pipelines/apm%40pipeline.yaml#L27:L36 looks wrong to me and the pipeline is called (e.g. for traces in https://github.com/elastic/elasticsearch/blob/8.17/x-pack/plugin/apm-data/src/main/resources/ingest-pipelines/traces-apm%40pipeline.yaml#L42).
@dakrone I don't think we do anything really special here, do you spot anything obvious? Or could there be cases where the pipeline is not properly loaded?

@dakrone
Copy link
Member

dakrone commented Mar 4, 2025

It's possible, we've opened #101768 as this is something we have seen rarely before.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Data Management/Data streams Data streams and their lifecycles low-risk An open issue or test failure that is a low risk to future releases Team:Data Management Meta label for data/management team >test-failure Triaged test failures from CI
Projects
None yet
Development

No branches or pull requests

4 participants