Fleet search using wait_for_checkpoints can fail if the node executing the search is recovering

Currently searches can potentially be executed in an `INITIALIZING` shard: https://github.com/elastic/elasticsearch/blob/b90f374339dfd9595649940205b0e0b64a5e667b/server/src/main/java/org/elasticsearch/cluster/routing/OperationRouting.java#L249-L259

This means that the shard could be going through recovery and the requested checkpoint through the `wait_for_checkpoint` might be greater than the current max seq no throwing the following exception:
```
Cannot wait for unissued seqNo checkpoint [wait_for_checkpoint=1299, max_issued_seqNo=0]
```

If the shard where the search request gets executed is `INITIALIZING` we should wait until it moves to `STARTED` or even consider if we should just avoid executing search requests with `wait_for_checkpoints` in such shards.

	private ShardIterator shardRoutings(
	IndexShardRoutingTable indexShard,
	@Nullable ResponseCollectorService collectorService,
	@Nullable Map<String, Long> nodeCounts
	) {
	if (useAdaptiveReplicaSelection) {
	return indexShard.activeInitializingShardsRankedIt(collectorService, nodeCounts);
	} else {
	return indexShard.activeInitializingShardsRandomIt();
	}
	}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fleet search using wait_for_checkpoints can fail if the node executing the search is recovering #130555

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Fleet search using wait_for_checkpoints can fail if the node executing the search is recovering #130555

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions