ESQL: Fix LOOKUP attribute shadowing #109807

alex-spies · 2024-06-17T11:42:49Z

Fix #109392

This makes attribute shadowing of LOOKUP consistent with ENRICH, DISSECT/GROK and EVAL.

Unused.

…owing

elasticsearchmachine · 2024-06-17T16:37:22Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

elasticsearchmachine · 2024-06-17T16:37:22Z

Hi @alex-spies, I've created a changelog YAML for you.

alex-spies · 2024-06-17T16:37:47Z

x-pack/plugin/esql/qa/testFixtures/src/main/resources/dissect.csv-spec

@@ -14,6 +14,43 @@ foo bar | null | null
 ;


+shadowing


Added analogous csv tests to LOOKUP, ENRICH, DISSECT, GROK and EVAL, since they should handle attribute shadowing identically.

alex-spies · 2024-06-17T16:39:48Z

x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/plan/logical/Lookup.java

@@ -39,7 +38,7 @@ public class Lookup extends UnaryPlan {
 /**
 * References to the input fields to match against the {@link #localRelation}.
 */
- private final List<NamedExpression> matchFields;


These are always attributes (unresolved or resolved); the full generality of NamedExpression (including Alias etc.) makes reasoning about the match fields later down the road needlessly hard.

alex-spies · 2024-06-17T16:40:59Z

x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/plan/logical/join/Join.java

+ List<Attribute> fieldsAddedFromRight = removeCollisionsWithMatchFields(rightOutput, matchFieldSet, matchFieldNames);
+ yield mergeOutputAttributes(makeNullable(makeReference(fieldsAddedFromRight)), leftOutput);


Using mergeOutputAttributes correctly here is the crux of this PR.

alex-spies · 2024-06-17T16:44:16Z

I'm a little conflicted on whether some logical plans should have unit tests. Arguably, we do catch inconsistent logical plan outputs either via the dependency checker or in csv tests; on the other hand, computing the output esp. for Join is complex enough that I'd normally want to test this in particular.
For this PR, I remained consistent with the existing test infrastructure and only tested via csv tests.

astefan

LGTM

astefan · 2024-06-18T12:52:30Z

x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/plan/logical/join/Join.java

- }
- }
- return results;
+ return attributes.stream().filter(attr -> matchFields.contains(attr) || matchFieldNames.contains(attr.name()) == false).toList();


We tend to avoid using streams, except tests, as it's an unnecessary runtime complication.

…owing

nik9000 · 2024-06-20T12:56:13Z

x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/plan/physical/HashJoinExec.java

 public HashJoinExec(PlanStreamInput in) throws IOException {
 super(Source.readFrom(in), in.readPhysicalPlanNode());
 this.joinData = new LocalSourceExec(in);
- this.matchFields = in.readNamedWriteableCollectionAsList(NamedExpression.class);
+ this.matchFields = (List<Attribute>) (List) in.readNamedWriteableCollectionAsList(NamedExpression.class);


Since we haven't cut a release of with this and we don't allow these constructs in serverless yet I think you can change this to in.readNamedWriteableCollectionAsList(Attribute.class). There isn't any code that'll ever send anything that isn't an Attribute.

Also, if these were always Attribute subclasses in practice then this'd be safe here too - it's just on the read side.

…owing

alex-spies added 6 commits June 17, 2024 13:42

Add basic shadowing tests

4790c31

WIP: add some comments

760d99b

Remove variable shadowing for RIGHT/CROSS JOIN

4c3930d

Unused.

Simplify Join/Lookup matchfields

14f76bc

Add more tests

75df3a9

Make shadowing work correctly

b59e79e

elasticsearchmachine added the v8.15.0 label Jun 17, 2024

alex-spies added 7 commits June 17, 2024 14:46

Fix csv test bwc

953201f

Align Lookup.output() with Join.output()

465d634

Simplify Join.computeOutput()

182c476

Remove obsolete Join.mergeOutput

447156f

Fix makeReference

51d4a92

More enrich tests

7260775

Make shadowing csv tests consistent

4b6d9e8

alex-spies added >bug :Analytics/ES|QL AKA ESQL labels Jun 17, 2024

alex-spies marked this pull request as ready for review June 17, 2024 16:36

Merge remote-tracking branch 'upstream/main' into fix-lookup-var-shad…

2a90ff1

…owing

elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Jun 17, 2024

Update docs/changelog/109807.yaml

3c42d26

alex-spies commented Jun 17, 2024

View reviewed changes

alex-spies requested review from nik9000 and astefan June 17, 2024 16:42

astefan approved these changes Jun 18, 2024

View reviewed changes

Merge remote-tracking branch 'upstream/main' into fix-lookup-var-shad…

ad7595e

…owing

nik9000 approved these changes Jun 20, 2024

View reviewed changes

nik9000 mentioned this pull request Jun 20, 2024

ESQL: LOOKUP followups #109353

Open

10 tasks

alex-spies added 4 commits June 20, 2024 15:00

Unstreamify

123e8d6

Deserialize as Attributes

ed1025e

Remove leftover

5b4b8c6

Add non-limit-0 cases to enrich shadowing tests

91eb5ab

alex-spies added the auto-merge Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Jun 20, 2024

alex-spies mentioned this pull request Jun 20, 2024

ESQL: Fix Join references #109989

Merged

alex-spies added 2 commits June 25, 2024 09:57

Merge remote-tracking branch 'upstream/main' into fix-lookup-var-shad…

74cdf43

…owing

Update EsqlCapabilities

8ce8d0a

elasticsearchmachine merged commit 5b959b9 into elastic:main Jun 25, 2024
15 checks passed

alex-spies deleted the fix-lookup-var-shadowing branch June 25, 2024 09:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ESQL: Fix LOOKUP attribute shadowing #109807

ESQL: Fix LOOKUP attribute shadowing #109807

alex-spies commented Jun 17, 2024 •

edited

Loading

elasticsearchmachine commented Jun 17, 2024

elasticsearchmachine commented Jun 17, 2024

alex-spies Jun 17, 2024

alex-spies Jun 17, 2024

alex-spies Jun 17, 2024

alex-spies commented Jun 17, 2024

astefan left a comment

astefan Jun 18, 2024

nik9000 Jun 20, 2024

		List<Attribute> fieldsAddedFromRight = removeCollisionsWithMatchFields(rightOutput, matchFieldSet, matchFieldNames);
		yield mergeOutputAttributes(makeNullable(makeReference(fieldsAddedFromRight)), leftOutput);

ESQL: Fix LOOKUP attribute shadowing #109807

ESQL: Fix LOOKUP attribute shadowing #109807

Conversation

alex-spies commented Jun 17, 2024 • edited Loading

elasticsearchmachine commented Jun 17, 2024

elasticsearchmachine commented Jun 17, 2024

alex-spies Jun 17, 2024

Choose a reason for hiding this comment

alex-spies Jun 17, 2024

Choose a reason for hiding this comment

alex-spies Jun 17, 2024

Choose a reason for hiding this comment

alex-spies commented Jun 17, 2024

astefan left a comment

Choose a reason for hiding this comment

astefan Jun 18, 2024

Choose a reason for hiding this comment

nik9000 Jun 20, 2024

Choose a reason for hiding this comment

alex-spies commented Jun 17, 2024 •

edited

Loading