Nested loop join

A nested loop join is a naive algorithm that joins two sets by using two nested loops.[1] Join operations are important for database management.

Algorithm

Two relations and are joined as follows:

algorithm nested_loop_join is
    for each tuple r in R do
        for each tuple s in S do
            if r and s satisfy the join condition then
                yield tuple <r,s>

This algorithm will involve nr*bs+ br block transfers and nr+br seeks, where br and bs are number of blocks in relations R and S respectively, and nr is the number of tuples in relation R.

The algorithm runs in I/Os, where and is the number of tuples contained in and respectively and can easily be generalized to join any number of relations ...

The block nested loop join algorithm is a generalization of the simple nested loops algorithm that takes advantage of additional memory to reduce the number of times that the relation is scanned.

Index join variation

If the inner relation has an index on the attributes used in the join, then the naive nest loop join can be replaced with an index join.

algorithm index_join is
    for each tuple r in R do
        for each tuple s in S in the index lookup do
            yield tuple <r,s>

The time complexity for this variation improves from O(M * N) to O(M * log N).

gollark: How do CC programs link any CC APIs?
gollark: This seems like makefiles but Lua-y.
gollark: What's an alfons?
gollark: Hmm, my compile process has become an unholy abomination consisting of a python script, shellscript, a Node.js program for Lua bundling, and Perl for some reason.
gollark: Deploying orbital laser strike.

See also

References


This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.