ViT Tiny Patch16 224
JSON →A tiny Vision Transformer (ViT) pretrained on ImageNet-21k with augmentation regularization and fine-tuned on ImageNet-1k for image classification.
Capabilities
vision
A tiny Vision Transformer (ViT) pretrained on ImageNet-21k with augmentation regularization and fine-tuned on ImageNet-1k for image classification.