-
Notifications
You must be signed in to change notification settings - Fork 25.2k
ESQL: Fix count optimization with pushable union types #127225
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ESQL: Fix count optimization with pushable union types #127225
Conversation
Hi @alex-spies, I've created a changelog YAML for you. |
8e4867a
to
c90cacf
Compare
Pinging @elastic/es-analytical-engine (Team:Analytics) |
Ensure that the code path with pushdown is actually covered.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Interesting! I did not know the bug was about Lucene pushdown of STATS.
x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/action/EsqlCapabilities.java
Outdated
Show resolved
Hide resolved
// This means that an EsRelation[field1, field2, field3] where field1 and field 3 are missing will be replaced by | ||
// Project[field1, field2, field3] <- keeps the ordering intact | ||
// \_Eval[field1 = null, field3 = null] | ||
// \_EsRelation[field2] | ||
// \_EsRelation[field1, field2, field3] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If the field is missing, would it be in the EsRelation at all?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes! Because we're in the local optimizer, the field does exist overall, and thus is put into the EsRelation after the field caps call on the coordinator. But on the local node it's missing! (Or in the search stats that this optimization run uses, to be more precise) This optimizer rule applies exclusively to such fields.
@@ -94,15 +94,19 @@ private Tuple<List<Attribute>, List<EsStatsQueryExec.Stat>> pushableStats( | |||
// check if regular field | |||
else { | |||
if (target instanceof FieldAttribute fa) { | |||
var fName = fa.name(); | |||
var fName = fa.fieldName(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wow! Is this the fix? I imagine this could have impacts in many places, so could this fix other bugs we've not noticed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep, this is the fix. Simple oversight to use the correct field name - in the past, the field attribute name and the field name were the same, but union types had to break with this pattern.
I do not see other situations that this may fix, too, because the specific stats-pushdown optimization only triggers in a narrow slice of queries, anyway.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What I'm wondering is if we could make this mistake again and if we could prevent it.
Also, should we insert filters here using #fieldName()
?
Line 58 in 9e0a5af
if (field.foldable() == false && field instanceof FieldAttribute fa && stats.isIndexed(fa.name())) { |
query = QueryBuilders.existsQuery(fieldName); | ||
} | ||
} | ||
} | ||
if (fieldName != null) { | ||
if (count.hasFilter()) { | ||
// Note: currently, it seems like we never actually perform stats pushdown if we reach here. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It seems the question of what exactly gets pushed down and why is subtle. I wonder if we want some documentation on this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When we touch this rule again, I think we should add unit tests that demonstrate exactly what is pushed down and how.
I also found that union type filters don't seem to be pushed to Lucene, yet - maybe we could improve the documentation as part of that work?
If you agree, I can open up an issue for this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
// Note: currently, it seems like we never actually perform stats pushdown if we reach here.
We evolved to this. Before the aggs filter were extracted into an upstream filter and pushed down, this code was "alive".
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep yep, and if we manage to evolve the optimization to multiple stats, this code could come alive again.
There's an argument to be made that it should be deleted as long as it's dead - but properly double checking that this code path really is never ever used atm was beyond the scope that I could allocate to this bug fix.
So leaving a comment was the next best thing for the time being, I think :)
When pushing down STATS count(field::type) to Lucene for a union-typed field, use the correct field name in the Lucene query and not the synthetic attribute name $$field$converted_to$type.
When pushing down STATS count(field::type) to Lucene for a union-typed field, use the correct field name in the Lucene query and not the synthetic attribute name $$field$converted_to$type.
When pushing down STATS count(field::type) to Lucene for a union-typed field, use the correct field name in the Lucene query and not the synthetic attribute name $$field$converted_to$type.
// For any missing field, place an Eval right after the EsRelation to assign null values to that attribute (using the same name | ||
// id!), thus avoiding that InsertFieldExtrations inserts a field extraction later. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
query = QueryBuilders.existsQuery(fieldName); | ||
} | ||
} | ||
} | ||
if (fieldName != null) { | ||
if (count.hasFilter()) { | ||
// Note: currently, it seems like we never actually perform stats pushdown if we reach here. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
// Note: currently, it seems like we never actually perform stats pushdown if we reach here.
We evolved to this. Before the aggs filter were extracted into an upstream filter and pushed down, this code was "alive".
@@ -94,15 +94,19 @@ private Tuple<List<Attribute>, List<EsStatsQueryExec.Stat>> pushableStats( | |||
// check if regular field | |||
else { | |||
if (target instanceof FieldAttribute fa) { | |||
var fName = fa.name(); | |||
var fName = fa.fieldName(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What I'm wondering is if we could make this mistake again and if we could prevent it.
Also, should we insert filters here using #fieldName()
?
Line 58 in 9e0a5af
if (field.foldable() == false && field instanceof FieldAttribute fa && stats.isIndexed(fa.name())) { |
When pushing down STATS count(field::type) to Lucene for a union-typed field, use the correct field name in the Lucene query and not the synthetic attribute name $$field$converted_to$type.
When pushing down STATS count(field::type) to Lucene for a union-typed field, use the correct field name in the Lucene query and not the synthetic attribute name $$field$converted_to$type.
Oh, too late.
I will follow up on this. |
@bpintea , your comment #127225 (comment) is an excellent find. Dang, we missed another usage of Thank you for following up on this! Maybe let's also add a javadoc comment to To remove the sharp edge, we could consider refactoring methods using the actual field name to use an |
I think it's worth tracking this in an issue. Opened #127521 |
Fix #127200