ViT Tiny R S16 P8 AugReg

JSON →
timm vision
image

A tiny Vision Transformer with register tokens, patch size 8, trained on ImageNet-21k with augmentation and regularization (AugReg).

vision