hcplantern's garden

Search

Recent Notes

9-设计模式
Dec 26, 2024
8-中间件
Dec 26, 2024
7-Spring
Dec 26, 2024
12-消息队列
Dec 26, 2024
11-设计题
Dec 26, 2024

❯

❯

❯

Hash Join

Sep 22, 20231 min read

database

Naive Hash Join

Naive Hash Join
Naive Hash Join performs an in-memory hash table build and then join.

Prerequisite: Relation $R$ can fit into memory (that is, having $R being \leq B - 2 pages big$ ). And this often not going to be possible, so solution is Grace Hash Join.

It’s basic idea is that we read all pages of relation $R$ , building an in-memory hash table, and then read in each records of $S$ to look it up in $R^{'} s$ table.
Link to original

Grace Hash Join

Grace Hash Join

Partitioning phase: We try to split R and S into partitions. Each partition has $R_{i}$ and $S_{i}$ (i.e. partition i of $R$ and partition i of $S$ ) and make sure either $R_{i}$ or $S_{i} \leq B - 2$ pages. If not, recursively do partition.

Make sure records with same hash value are in the same partition.

Build & Probe Phase: Load the smaller partition into memory and build an in-memory hash table. Perform a Naive Hash Join with the larger partition in the pair.

I/O COST:

First phase: read + write both relations

$2 ([R] + [S])$ I/Os

Second phase: read both relations, forward output

$[R] + [S]$ I/Os

Total cost of 2-pass hash join = $3 ([R] + [S])$

Link to original

Naive Hash Join
Grace Hash Join

Backlinks

Join

Graph View

Created with Quartz v4.2.3 © 2024

GitHub
Blog
RSS