You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Executing ESQL queries normally requires that the coordinator node and data nodes communicate: the coordinator sends logical plans to the data nodes, the data nodes send pages with results back to the coordinator.
In both directions, the transport message size seems to be unbounded, and there also seems to be no circuit breaker; we've seen cases where particularly large logical plans caused gigabytes of data to be in buffered in the NettyAllocator.
While some issues were addressed in #112008, #111447 and #111973, we should find other situations where this can happen, test it and fix it if needed. I.e.
Try to provoke huge LogicalPlans in other ways and test this.
Try to provoke huge pages to be sent from data nodes, e.g. super many columns, or with individual values that are huge etc.
This is similar to our HeapAttack tests, but distributed.
The text was updated successfully, but these errors were encountered:
This relates to a general problem where transport messages can become arbitrarily large; we should at least double check if this should maybe be solved in Elasticsearch more generally, rather than specifically for ESQL.
Executing ESQL queries normally requires that the coordinator node and data nodes communicate: the coordinator sends logical plans to the data nodes, the data nodes send pages with results back to the coordinator.
In both directions, the transport message size seems to be unbounded, and there also seems to be no circuit breaker; we've seen cases where particularly large logical plans caused gigabytes of data to be in buffered in the NettyAllocator.
While some issues were addressed in #112008, #111447 and #111973, we should find other situations where this can happen, test it and fix it if needed. I.e.
This is similar to our HeapAttack tests, but distributed.
The text was updated successfully, but these errors were encountered: