Depth Anything ViT-L/14

JSON →
depth-anything vision
image

A monocular depth estimation model using a Vision Transformer (ViT-L/14) backbone from the original Depth Anything release.

vision
releasedJan 2024