My Knowledge Base
Search
Search
Dark mode
Light mode
Explorer
Tag: deep-learning
21 items with this tag.
Apr 28, 2026
AI Scaling Laws
ai
scaling
deep-learning
compute
training
inference
agentic-ai
rlvr
Apr 28, 2026
Attention Mechanism
ai
attention
transformer
llm
deep-learning
self-attention
multi-head-attention
Apr 28, 2026
Automatic Differentiation
machine-learning
deep-learning
calculus
software
autograd
Apr 28, 2026
Backpropagation
machine-learning
deep-learning
neural-networks
optimization
calculus
Apr 28, 2026
Chain Rule
calculus
machine-learning
deep-learning
mathematics
Apr 28, 2026
Computational Graph
machine-learning
deep-learning
autograd
software
Apr 28, 2026
GPT Architecture
ai
gpt
transformer
llm
deep-learning
architecture
Apr 28, 2026
Gradient Descent
machine-learning
optimization
neural-networks
deep-learning
Apr 28, 2026
Large Language Models (LLMs)
ai
llm
deep-learning
transformer
neural-network
moe
rlvr
Apr 28, 2026
Loss Function
machine-learning
deep-learning
optimization
neural-networks
Apr 28, 2026
Mixture of Experts (MoE)
ai
llm
architecture
transformer
moe
deep-learning
compute
Apr 28, 2026
Multi-Layer Perceptron (MLP)
machine-learning
deep-learning
neural-networks
Apr 28, 2026
Neural Network
machine-learning
deep-learning
neural-networks
optimization
Apr 28, 2026
Pretraining (LLMs)
ai
llm
pretraining
training
deep-learning
next-token-prediction
Apr 28, 2026
Transformer Architecture
ai
transformer
architecture
llm
attention
deep-learning
Apr 28, 2026
Word Embeddings
ai
llm
embeddings
nlp
deep-learning
positional-encoding
Apr 28, 2026
Andrej Karpathy
ai
deep-learning
neural-networks
person
Apr 28, 2026
Micrograd
software
machine-learning
autograd
deep-learning
python
Apr 28, 2026
PyTorch
software
machine-learning
deep-learning
framework
python
Apr 28, 2026
Karpathy 2022 — Micrograd: Building Backpropagation from Scratch
neural-networks
backpropagation
autograd
machine-learning
deep-learning
Apr 28, 2026
Build a Large Language Model (From Scratch) — Raschka (2024)
ai
llm
transformer
gpt
pytorch
attention
tokenization
pretraining
deep-learning