LLM Hacker News 4h ago 2 min read
HN upvoted MacMind because it shrinks transformer mystique to something inspectable: 1,216 parameters in HyperTalk on a Macintosh SE/30. The demo learns bit-reversal for FFT using embeddings, positional encoding, self-attention, backpropagation and gradient descent.