Rapid-MLX

library 0.6.80 ·python

✓ verified Jul 3, 2026

Rapid-MLX provides AI inference on Apple Silicon with a drop-in OpenAI-compatible API. It claims 2-4x speedups over Ollama. Current version is 0.6.80, under active development with a weekly release cadence.

Traffic · last 30 days stale · no recent hits · indexed Sun Jun 07 · updated Sat Jul 11

total hits 9

actors 3 distinct systems

last hit 15d ago AhrefsBot

GPTBot

3

ByteDance

1

Humans

4

top countries 🇸🇬 Singapore · 🇺🇸 United States · 🇫🇷 France · 🇨🇦 Canada

Resources

githubgithub.com/raullenchai/Rapid-MLX ↗

homepagegithub.com/raullenchai/Rapid-MLX ↗

API endpoints

full doc /v1/registry/rapid-mlx

install /v1/registry/rapid-mlx/install