Tagged: computer-vision
Showing 1–5 of 7 articles
Part 4: From Representations to World Models Running DINOv2 on a real image to see what a production joint-embedding encoder learns. Click a patch, get a similarity heatmap. No labels, no fine-tuning. Read article Part 2: Why Pixel Prediction Goes Blurry Yann LeCun says generative models fail on video because the future is ambiguous. I tested the claim with a 700K-parameter PyTorch model on a synthetic bouncing ball. Read article Converting Custom Annotations to YOLO Keypoint Format My corner annotator outputs JSON. YOLO wants normalised bounding boxes with keypoints in a specific folder structure. Bridging the two formats. Read article Corner Annotation, Board Scraping, and Training YOLO No off-the-shelf model could find board corners reliably. I built a manual annotator, scraped training data, and trained a YOLO model on the results. Read article From Roboflow to SAM 3 on Rented GPUs Classical CV couldn't handle real-world Go board photos. I tried Roboflow, Meta's SAM 3, Ultralytics, Google Colab, and Vast.ai. Each solved one problem and introduced another. Read article