Writing

GPUs, RAG, embeddings, and the rabbit holes in between — published here first, sometimes mirrored to Medium.