Swin V2 Tiny (Window 8, 256 resolution, ImageNet-1k)

JSON →
timm vision
image

A tiny Swin Transformer V2 model with 8x8 window size and 256x256 resolution trained on ImageNet-1k for image classification.

vision