Motivation
- A hash table is an implementation of Set/Map aiming for average (expected) constant-time operations.
Suppose we have an array:
data:image/s3,"s3://crabby-images/ddebc/ddebc1ed299f3d4bb688905b22e2638d4dccd84f" alt=""
And we want to store a bunch of elements in it:
$$ \text{cat}, \space \text{bat}, \space \text{tap}, \space \text{mad}, \text{dam}, \space \text{nap}, \space \text{pat} $$
What will we do?
Well, we most likely store them sequentially, one after another.
data:image/s3,"s3://crabby-images/19b7c/19b7cc289eb63708898f42cd3d3e58ec980638cc" alt=""
Here, insertion is fast (constant time), and searching is slow (linear time).
We can get a logarithmic-time search (via binary search) if we keep the data sorted, but that means extra work for insertion (linear time).
data:image/s3,"s3://crabby-images/40cac/40cac30ccdd18a634201c456eefd67e2d1f7dd83" alt=""
We could organize the data in a (balanced) binary search tree to get logarithmic-time insert/remove and find.
data:image/s3,"s3://crabby-images/a5638/a56386ceb3252da0c6e212a12ba864a459ae4980" alt=""
But is there any way to get constant time for all the core operations?
A hash table is a data structure aiming for average (expected) constant-time operations.