Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hashjoin indexing #1728

Merged
merged 31 commits into from
Apr 27, 2023
Merged

hashjoin indexing #1728

merged 31 commits into from
Apr 27, 2023

Conversation

jycor
Copy link
Contributor

@jycor jycor commented Apr 21, 2023

When we have plan of the following format:

InSubquery
    ...
    CrossJoin
        Left: Table
        Right: SubqueryAlias
            OuterScopeVisibility: true
            ...
            HashJoin
                HashLookup
                    source: ...
                    target: TableAlias
...

The indexes we assign to GetFields during analysis don't align with the indexes of the actual columns in each row during execution time. This is a result of StripNode, PrependNode, and the nested Joins with SubqueryAlias.

This error wasn't caught sooner as the incorrect indexes are too low, so they never threw IndexOutOfBounds errors and just returned potentially incorrect results instead.

The fix was to correct these indexes at analysis time.
Firstly, SubqueryAlias nodes with OuterScopeVisibilty = true inside joins need to see the left sibling node (in addition to the parent nodes). So Scope was modified to include some new fields, specifically for sibling nodes. Additionally, the file finalizeSubquery was changed to track the parent as well, so we could detect when we're analyzing a SubqueryAlias on the right side of a join, and add the left child to the scope.

Additionally, pushdownFilters was modified to not undo all the changes to the Analyzer for HashLookups.

At runtime, the PrependRow nodes cache the rows outside the InSubquery, while the buildJoinIter for CrossJoin would include both the outside and the left row. This meant that some parts of the inner HashJoin would receive extra columns while others didn't. The fix here was to alter the scope.InJoin depending on which parts of HashJoin we were building.

Lastly, to have these changes not break for PreparedStatements, we just needed to not redo finalizeUnions in postPrepared, as we don't replan joins in postPrepared, so we don't know if we're in a join or not, and the correct indexes are set in prePrepared.

Along the way, we discovered a query that panics, but the cause is different than the purpose of this fix, and it panicked the same way before these changes, so it is left as a skipped test.

Fix for: dolthub/dolt#5714

Copy link
Contributor

@max-hoffman max-hoffman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The PR summary is really good. The fixes are painful but maybe the only way to do this short of rewriting indexing. A comment pointing out how this fixes one layer of bugs from a larger general class of problems would be good. And then one more pass making sure we only target a specific edge case and are not causing collateral damage to currently working queries would be good. @zachmu this could use a pass, this is mostly my design and I hate it but I don't know a better way short of rewriting indexing.

Comment on lines +53 to +59
var joinParent *plan.JoinNode
var selFunc transform.SelectorFunc = func(c transform.Context) bool {
if jp, ok := c.Node.(*plan.JoinNode); ok {
joinParent = jp
}
return true
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this seems like it could have unpredictable effects. any reason why c.Parent.(*plan.JoinNode) below does not work?

var newSqa sql.Node
var same2 transform.TreeIdentity
var err error
if sqa.OuterScopeVisibility && joinParent != nil && !joinParent.Op.IsLeftOuter() {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

!joinParent.Op.IsLeftOuter() seems suspicious, why would any of the indexing be dependent on the parent being a left join? We should probably add a test for left joins.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similar to above comment, I think we should make sure sqa is the second child of the parent join. None of this generalizes, there are still holes in the logic, but we have some control over not making new bugs. Test to make sure we don't screw up the same case but with subquery alias on the left would be helpful.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The joinParent.Op.IsLeftOuter() check might've been added to prevent breaking other queries in a previous iteration, but it looks like it isn't needed now. I'll remove it.

var err error
if sqa.OuterScopeVisibility && joinParent != nil && !joinParent.Op.IsLeftOuter() {
if stripChild, ok := joinParent.Right().(*plan.StripRowNode); ok && stripChild.Child == sqa {
subScope := scope.newScopeInJoin(joinParent.Children()[0])
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should add a comment that this only fixes 1-degree of join nesting, and that a proper fix would re-index the entire tree at the end of analysis. If the subquery has more than 1 sibling, or a sibling higher in the join tree, we currently do not detect it. We probably don't need tests for those edge cases yet. Just gotta hope no one hits them soon.

@@ -7522,6 +7522,37 @@ SELECT * FROM my_cte;`,
{2},
},
},
{
Query: `
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

a comment tagging the original issue might be worthwhile

Copy link
Member

@zachmu zachmu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My one general comment is that while this is fine tactically, it seems like we are way overdue for general transform logic in the analyzer to consider sibling nodes correctly. Similar bugs are going to continue biting us until we fix this.

@@ -180,10 +180,18 @@ func (b *ExecBuilder) buildHashJoin(j *hashJoin, input sql.Schema, children ...s
if err != nil {
return nil, err
}
oldInJoin := false
if j.g.m.scope != nil {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rather than temporarily monkeying with the scope like this, why not pass a modified copy directly to this function?


joinlessScopeLen := scopeLen
if scope.inJoin {
scope.SetJoin(false)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same comment. Use these as values to compute things you need, rather than temporarily modifying

outerAttrs, err := b.buildFilters(j.g.m.scope, j.right.relProps.OutputCols(), expression.Tuple(j.outerAttrs))
tmpScope := j.g.m.scope
if tmpScope != nil {
tmpScope = tmpScope.newScopeInJoin(nil)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

could we have slightly more expressive methods? like j.g.m.scope.newScopeNoJoin()?

Copy link
Contributor

@max-hoffman max-hoffman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@jycor jycor merged commit 42232b2 into main Apr 27, 2023
@jycor jycor deleted the james/cte3 branch July 21, 2023 17:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants