Skip to content

Commit 2f885b3

Browse files
authored
Update SPATIAL_BTREE.md
Add performance section
1 parent fe855fe commit 2f885b3

File tree

1 file changed

+21
-33
lines changed

1 file changed

+21
-33
lines changed

docs/SPATIAL_BTREE.md

+21-33
Original file line numberDiff line numberDiff line change
@@ -112,90 +112,78 @@ but is functionally more similar to the
112112
[R-tree](https://en.wikipedia.org/wiki/R-tree) and the
113113
[UB-tree](https://en.wikipedia.org/wiki/UB-tree).
114114

115-
<h3>R-tree</h3>
116-
<p>
115+
### R-tree
116+
117117
Like the R-tree each child rectangle is the minimum bounding
118118
rectangle of the entire child tree.
119-
</p>
120119

121-
<p>
122120
A difference is that the R-tree stores all items at the leaf level, just
123121
like a B+Tree. While the Spatial B-tree stores items in the branches and leaves,
124122
just like a standard B-tree.
125-
</p>
126123

127-
<p>
128124
Another difference is that during insertion the R-tree and it's variants, such
129125
as the R*tree, go to great lengths to determine the best ordering of the branch
130126
rectangles and items. Whenever a new item is inserted into an R-tree, from root
131127
to leaf, a complicated algorithm is used to choose the best child node to insert
132128
the item into. Depending the quality of that algorithm, which isn't always
133129
identical with every implementations, the performance of inserting and
134130
searching can vary greatly.
135-
</p>
136131

137-
<p>
138132
The Spatial B-tree on the other hand inserts items exactly like a standard
139-
B-tree, by ordering on the item's key. As <a href="#key-order">stated above</a>,
133+
B-tree, by ordering on the item's key. As [stated above](#key-order),
140134
this means that you must choose your keys wisely.
141-
<p>
142135

143-
<p>
144136
One R-tree variant worth noting is the
145-
<a href="https://en.wikipedia.org/wiki/Hilbert_R-tree">Hilbert R-tree</a>, which
137+
[Hilbert R-tree](https://en.wikipedia.org/wiki/Hilbert_R-tree), which
146138
stores items in linear order using a Hilbert curve. This provides excellent
147139
search performance compared to other R-trees, and its ordering of items is very
148140
similar to a Spatial B-tree using a Hilbert curve in its key. But the
149141
structure is a bit more complicated that a traditional R-tree, it must
150142
track both LHVs (Largest Hilbert Value) and MBRs (Minimum Bounding Rectangle)
151143
for leaves and branches. This leads to extra work to maintain. And insertions
152144
and deletions are generally less efficient than a Spatial B-tree.
153-
</p>
154-
155145

156-
<h3>UB-tree</h3>
157-
<p>
146+
### UB-tree
158147
The Spatial B-tree and UB-tree both store items linearly based on the key.
159-
</p>
160148

161-
<p>
162149
The UB-tree stores all items in the leaves (just like the R-tree), while the
163150
Spatial B-tree stores items in branches and leaves, like a standard B-tree.
164-
</p>
165151

166-
<p>
167152
Another difference is that the UB-tree is designed to order on a Z-order curve,
168153
while the Spatial B-tree doesn't care, leaving it up to you what the ordering
169154
is. This opens up the Spatial B-tree to different strategies, such as Z-order
170155
or Hilbert or something else.
171-
</p>
172156

173-
<p>
174157
Also the UB-tree does not store the MBRs (Minimum Bounding Rectangle) and
175158
thus cannot scan the tree for intersections like an R-tree and Spatial B-tree.
176159
Instead it needs to use an algorithm which basically looks
177160
at ranges of the Z-curve to find nearby nodes that overlap a target area.
178161
Effectively working kind of like the
179-
<a href="https://www.ibm.com/docs/en/db2/11.5?topic=concepts-geohashes-geohash-covers">Geohash covers</a>
162+
[Geohash covers](https://www.ibm.com/docs/en/db2/11.5?topic=concepts-geohashes-geohash-covers)
180163
algorithm.
181-
</p>
182164

183-
<p>
184165
In general the Spatial B-tree is designed to search like an R-tree but have the
185166
simplicity of a standard B-tree.
186-
</p>
187167

188-
<p>
189168
One more thing, the Spatial B-tree and UB-tree guarantee stable ordering
190169
of items, meaning that no matter what the order of inserts and deletes for a
191170
specific set of items might be, those items will always be returned in the same
192171
order when searching. R-tree ordering is unstable. This may be an important
193172
detail if you desire deterministic results.
194-
</p>
195173

196-
<h2>Implementation</h2>
174+
## Performance
175+
176+
The Spatial B-tree is as fast as a standard B-tree for inserts and deletes, which
177+
generally beats the R-tree. And is as fast as a Hilbert R-tree for searches when
178+
using hilbert curves.
179+
180+
Much depends on the quality of the implementation when measuring the performance of
181+
these kinds of data structures.
182+
183+
Here are some [benchmark results](https://github.com/tidwall/bgen#performance) comparing the Spatial B-tree to an
184+
R-tree with hilbert ordered inserts.
185+
And here's a fast C library for calculating a hilbert curve. [tidwall/curve](https://github.com/tidwall/curve).
186+
187+
## Implementation
197188

198-
<p>
199-
You can use the Spatial B-tree today using the
200-
<a href="https://github.com/tidwall/bgen">Bgen B-tree Generator</a> for C.
201-
</p>
189+
You can use the Spatial B-tree today using the [bgen: B-tree generator for C](https://github.com/tidwall/bgen).

0 commit comments

Comments
 (0)