"Grokking": Generalisation Beyond Overfitting

An unusual result on small algorithmic datasets.

27th June 2023