ViT Tiny Patch16 224

JSON →
timm vision
image

A tiny Vision Transformer (ViT) pretrained on ImageNet-21k with augmentation regularization and fine-tuned on ImageNet-1k for image classification.

vision