Joined 6/30/2020, 10:26:13 AM has 540 karma
DeepSeek's Multi-Head Latent Attention
Micrograd.jl
Covering All Birthdays
Generative transformer from first principles in Julia
Radix Tree in Julia
The Weiler-Atherton polygon clipping algorithm
Implementing the Gzip-kNN Classification Paper
How (not) to compare 2D scatter plots
Denoising Diffusion models from first principle in Julia
Building a transformer in Julia
Quaternions
Pinging the World from South Africa