Rapid-MLX

JSON →
library 0.6.80 ·python
verified Jun 7, 2026

Rapid-MLX provides AI inference on Apple Silicon with a drop-in OpenAI-compatible API. It claims 2-4x speedups over Ollama. Current version is 0.6.80, under active development with a weekly release cadence.