Sebastian Raschka

Sebastian Raschka is a machine learning researcher and educator known for his accessible technical writing and his ability to explain frontier AI concepts clearly without sacrificing depth.

Notable Work

Views on Architecture

Raschka’s consistent position: the transformer architecture is fundamentally unchanged from GPT-2. Modern frontier models are autoregressive decoder-only transformers with attention + FFN blocks — the core is the same; the innovations are engineering tweaks for efficiency and capacity (mixture-of-experts, GQA, MLA, RMSNorm, SwiGLU).

He views the “AGI” concept as continuous improvement (“amplification not paradigm change”), not a step-function transition.


Sources: fridman-lambert-raschka-2026-state-of-ai | raschka-2024-build-llm-from-scratch