Moondream

JSON →

A vision language model for image analysis, including captioning, VQA, and object detection.

uvx (Recommended

★ 47 GitHub stars