Fusion tree

A fusion tree is a type of tree data structure that implements an associative array on w-bit integers. It uses O(n) space and performs searches in O(log_w n) time, which is asymptotically faster than a traditional self-balancing binary search tree, and actually better than the van Emde Boas tree when w is large. It achieves this speed by exploiting certain constant-time operations that can be done on a machine word. Fusion trees were invented in 1990 by Michael Fredman and Dan Willard.^[1]

Several advances have been made since Fredman and Willard's original 1990 paper. In 1999^[2] it was shown how to implement fusion trees under the AC⁰ model, in which multiplication no longer takes constant time. A dynamic version of fusion trees using hash tables was proposed in 1996^[3] which matched the O(log_w n) runtime in expectation. Another dynamic version using exponential tree was proposed in 2007^[4] which yields worst-case runtimes of O(log_w n + log log u) per operation, where u is the size of the largest key. It remains open whether dynamic fusion trees can achieve O(log_w n) per operation with high probability.

How it works

A fusion tree is essentially a B-tree with branching factor of w^1/5 (any small exponent is also possible), which gives it a height of O(log_w n). To achieve the desired runtimes for updates and queries, the fusion tree must be able to search a node containing up to w^1/5 keys in constant time. This is done by compressing ("sketching") the keys so that all can fit into one machine word, which in turn allows comparisons to be done in parallel. The rest of this article will describe the operation of a static Fusion Tree; that is, only queries are supported.

Sketching

Sketching is the method by which each w-bit key at a node containing k keys is compressed into only k-1 bits. Each key x may be thought of as a path in the full binary tree of height w starting at the root and ending at the leaf corresponding to x. To distinguish two paths, it suffices to look at their branching point (the first bit where the two keys differ). All k paths together have k-1 branching points, so at most k-1 bits are needed to distinguish any two of the k keys.

Visualization of the sketch function.

An important property of the sketch function is that it preserves the order of the keys. That is, sketch(x) < sketch(y) for any two keys x < y.

Approximating the sketch

If the locations of the sketch bits are b₁ < b₂ < ··· < b_r, then the sketch of the key x_w-1···x₁x₀ is the r-bit integer $x_{b_r}x_{b_{r-1}}\cdots x_{b_1}$ .

With only standard word operations, such as those of the C programming language, it is difficult to directly compute the sketch of a key in constant time. Instead, the sketch bits can be packed into a range of size at most r⁴, using bitwise AND and multiplication. The bitwise AND operation serves to clear all non-sketch bits from the key, while the multiplication shifts the sketch bits into a small range. Like the "perfect" sketch, the approximate sketch preserves the order of the keys.

Some preprocessing is needed to determine the correct multiplication constant. Each sketch bit in location b_i will get shifted to b_i + m_i via a multiplication by m = $\textstyle\sum_{i=1}^r$ 2^m_i. For the approximate sketch to work, the following three properties must hold:

b_i + m_j are distinct for all pairs (i, j). This will ensure that the sketch bits are uncorrupted by the multiplication.
b_i + m_j is a strictly increasing function of i. That is, the order of the sketch bits is preserved.
(b_r + m_r) - (b₁ - m₁) ≤ r⁴. That is, the sketch bits are packed into a range of size at most r⁴.

An inductive argument shows how the m_i can be constructed. Let m₁ = w − b₁. Suppose that 1 < t ≤ r and that m₁, m₂... m_t-1 have already been chosen. Then pick the smallest integer m_t such that both properties (1) and (2) are satisfied. Property (1) requires that m_t ≠ b_i − b_j + m_l for all 1 ≤ i, j ≤ r and 1 ≤ l ≤ t-1. Thus, there are less than tr² ≤ r³ values that m_t must avoid. Since m_t is chosen to be minimal, (b_t + m_t) ≤ (b_t-1 + m_t-1) + r³. This implies Property (3).

The approximate sketch is thus computed as follows:

Mask out all but the sketch bits with a bitwise AND.
Multiply the key by the predetermined constant m. This operation actually requires two machine words, but this can still by done in constant time.
Mask out all but the shifted sketch bits. These are now contained in a contiguous block of at most r⁴ < w^4/5 bits.

For the rest of this article, sketching will be taken to mean approximate sketching.

Parallel comparison

The purpose of the compression achieved by sketching is to allow all of the keys to be stored in one w-bit word. Let the node sketch of a node be the bit string

1sketch(x₁)1sketch(x₂)...1sketch(x_k)

We can assume that the sketch function uses exactly b ≤ r⁴ bits. Then each block uses 1 + b ≤ w^4/5 bits, and since k ≤ w^1/5, the total number of bits in the node sketch is at most w.

A brief notational aside: for a bit string s and nonnegative integer m, let s^m denote the concatenation of s to itself m times. If t is also a bit string st denotes the concatenation of t to s.

The node sketch makes it possible to search the keys for any b-bit integer y. Let z = (0y)^k, which can be computed in constant time (multiply y by the constant (0^b1)^k). Note that 1sketch(x_i) - 0y is always positive, but preserves its leading 1 iff sketch(x_i) ≥ y. We can thus compute the smallest index i such that sketch(x_i) ≥ y as follows:

Subtract z from the node sketch.
Take the bitwise AND of the difference and the constant (10^b)^k. This clears all but the leading bit of each block.
Find the most significant bit of the result.
Compute i, using the fact that the leading bit of the i-th block has index i(b+1).

Desketching

For an arbitrary query q, parallel comparison computes the index i such that

sketch(x_i-1) ≤ sketch(q) ≤ sketch(x_i)

Unfortunately, the sketch function is not in general order-preserving outside the set of keys, so it is not necessarily the case that x_i-1 ≤ q ≤ x_i. What is true is that, among all of the keys, either x_i-1 or x_i has the longest common prefix with q. This is because any key y with a longer common prefix with q would also have more sketch bits in common with q, and thus sketch(y) would be closer to sketch(q) than any sketch(x_j).

The length longest common prefix between two w-bit integers a and b can be computed in constant time by finding the most significant bit of the bitwise XOR between a and b. This can then be used to mask out all but the longest common prefix.

Note that p identifies exactly where q branches off from the set of keys. If the next bit of q is 0, then the successor of q is contained in the p1 subtree, and if the next bit of q is 1, then the predecessor of q is contained in the p0 subtree. This suggests the following algorithm:

Use parallel comparison to find the index i such that sketch(x_i-1) ≤ sketch(q) ≤ sketch(x_i).
Compute the longest common prefix p of q and either x_i-1 or x_i (taking the longer of the two).
Let l-1 be the length of the longest common prefix p.
1. If the l-th bit of q is 0, let e = p10^w-l. Use parallel comparison to search for the successor of sketch(e). This is the actual predecessor of q.
2. If the l-th bit of q is 1, let e = p01^w-l. Use parallel comparison to search for the predecessor of sketch(e). This is the actual successor of q.
Once either the predecessor or successor of q is found, the exact position of q among the set of keys is determined.

References

↑ M. L. Fredman and D. E. Willard. BLASTING through the information theoretic barrier with FUSION TREES. Proceedings of the twenty-second annual ACM symposium on Theory of Computing, 1–7, 1990.
↑ A. Andersson, P. B. Miltersen, and M. Thorup. Fusion trees can be implemented with AC0 instructions only. Theoretical Computer Science, 215:337–344, 1999.
↑ R. Raman. Priority queues: Small, monotone, and trans-dichotomous. Algorithms - ESA 1996, 121–137, 1996.
↑ A. Andersson and M. Thorup. Dynamic ordered sets with exponential search trees. Journal of the ACM, 54:3:13, 2007.

MIT CS 6.897: Advanced Data Structures: Lecture 4, Fusion Trees, Prof. Erik Demaine (Spring 2003)
MIT CS 6.897: Advanced Data Structures: Lecture 5, More fusion trees; self-organizing data structures, move-to-front, static optimality, Prof. Erik Demaine (Spring 2003)
MIT CS 6.851: Advanced Data Structures: Lecture 13, Fusion Tree notes, Prof. Erik Demaine (Spring 2007)
MIT CS 6.851: Advanced Data Structures: Lecture 12, Fusion Tree notes, Prof. Erik Demaine (Spring 2012)

[1] M. L. Fredman and D. E. Willard. BLASTING through the information theoretic barrier with FUSION TREES. Proceedings of the twenty-second annual ACM symposium on Theory of Computing, 1–7, 1990.

[2] A. Andersson, P. B. Miltersen, and M. Thorup. Fusion trees can be implemented with AC0 instructions only. Theoretical Computer Science, 215:337–344, 1999.

[3] R. Raman. Priority queues: Small, monotone, and trans-dichotomous. Algorithms - ESA 1996, 121–137, 1996.

[4] A. Andersson and M. Thorup. Dynamic ordered sets with exponential search trees. Journal of the ACM, 54:3:13, 2007.

[1]

[2]

[3]

[4]

v t e Tree data structures
Search trees (dynamic sets/associative arrays)	2–3 2–3–4 AA (a,b) AVL B B+ B* B^x (Optimal) Binary search Dancing HTree Interval Order statistic (Left-leaning) Red-black Scapegoat Splay T Treap UB Weight-balanced
Heaps	Binary Binomial Fibonacci Leftist Pairing Skew Van Emde Boas
Tries	Ctrie C-trie (compressed ADT) Hash Radix Suffix Ternary search X-fast Y-fast
Spatial data partitioning trees	BK BSP Cartesian Hilbert R k-d (implicit k-d) M Metric MVP Octree Priority R Quad R R+ R* Segment VP X
Other trees	Cover Exponential Fenwick Finger Fractal tree index Fusion Hash calendar iDistance K-ary Left-child right-sibling Link/cut Log-structured merge Merkle PQ Range SPQR Top

Fusion tree

Contents

How it works

Sketching

Approximating the sketch

Parallel comparison

Desketching

References

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools