Claude/iterative training plan tbjy5 by fanrado · Pull Request #2 · fanrado/dinoLearning

fanrado · 2026-04-17T19:28:58Z

No description provided.

Implements the iterative training plan from the README as a runnable shell script with Pass 1 (full augmentations, shape learning, 200 epochs) and Pass 2 (minimal augmentations, color learning, 100 epochs). Supports --pass1-only and --pass2-only flags for independent execution. Includes k-NN evaluation after both passes for side-by-side comparison. https://claude.ai/code/session_01PxMmQiYRaGMb1ZXsULGVoN

main_dino.py loads ImageFolder directly from the root (no train/val split in the directory structure). eval_knn.py handles the 80/20 split internally via sklearn train_test_split. Removed the /train suffix from --data_path in both torchrun calls in run_dino_iterative.sh and corrected the matching README commands and dataset layout description. https://claude.ai/code/session_01PxMmQiYRaGMb1ZXsULGVoN

…rado/dinoLearning into claude/iterative-training-plan-tbjy5

Added option to load pre-trained weights. This is useful in the current approach doing the training in two stages

- run_dino_iterative.sh: increase batch size from 32 to 64 for both passes; double learning rates accordingly (P1: 0.0000625 -> 0.000125, P2: 0.00000625 -> 0.0000125) to maintain linear LR scaling - run_visualize_attention.sh: point checkpoint to epoch-40 snapshot (checkpoint0040.pth), switch input image to img.png, reduce image size from 1440x1440 to 960x960, set attention threshold to 0.3 - plot_training_metrics.ipynb: update notebook cell outputs/parameters

…tImages Outlines 8 steps to improve k-NN/linear accuracy beyond the current 88.96 (20-NN) baseline, including dataset cleaning, dropping the broken Pass 2 self-sup in favour of a linear probe, hyperparameter fixes (LR, out_dim, teacher_temp), patch size 8, augmentation calibration, self-distillation, and evaluation/infra upgrades.

claude and others added 11 commits April 10, 2026 20:02

Merge branch 'claude/iterative-training-plan-tbjy5' of github.com:fan…

16c5719

…rado/dinoLearning into claude/iterative-training-plan-tbjy5

added option allowing to enable/disable color augmentations.

f7e5449

Added option to load pre-trained weights. This is useful in the current approach doing the training in two stages

save the output of eval_knn

8b57e09

load checkpoint model from the OUTPUT_DINO_ITERATIVE

b9d2672

plot features from the output of eval_knn

d7dccaf

plot training metrics

d10c694

adjusted the learning rate based on the batch size

c65165f

fanrado merged commit 24e60c6 into main Apr 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Claude/iterative training plan tbjy5#2

Claude/iterative training plan tbjy5#2
fanrado merged 11 commits into
mainfrom
claude/iterative-training-plan-tbjy5

fanrado commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

fanrado commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants