LLM vs JEPAMay 19, 2026 Part 5: Planning in Latent Space Action-conditioned JEPA on the bouncing ball. Add direction as input to the predictor and the bimodal-future problem disappears. Brute-force MPC in embedding space drives a goal-image rollout.
machine-learningpythonpytorchjepaself-supervised-learningworld-modelsplanning
Read article
LLM vs JEPAMay 18, 2026 Part 4: From Representations to World Models Running DINOv2 on a real image to see what a production joint-embedding encoder learns. Click a patch, get a similarity heatmap. No labels, no fine-tuning.
machine-learningpythonpytorchjepaself-supervised-learningcomputer-visiondino
Read article
LLM vs JEPAMay 17, 2026 Part 3: Predict Embeddings, Not Pixels Joint embedding training, the representation collapse problem, and how Barlow Twins, VICReg, and LeJEPA's SIGReg fix it. Tested on the same bouncing-ball toy from Part 2.
machine-learningpythonpytorchjepaself-supervised-learningvicreg
Read article
LLM vs JEPAMay 16, 2026 Part 2: Why Pixel Prediction Goes Blurry Yann LeCun says generative models fail on video because the future is ambiguous. I tested the claim with a 700K-parameter PyTorch model on a synthetic bouncing ball.
llmmachine-learningpythonpytorchcomputer-visionjepaself-supervised-learning
Read article
March 16, 2026 Attention Is All You Need: Building the Original Transformer that Started the LLM Revolution Attention Is All You Need replaced RNNs with self-attention and changed everything. I built the original encoder-decoder transformer from scratch and trained it to translate English to French.
machine-learningpythonpytorchnlptransformers
Read article