Ring Flash Attention

library 0.1.8 ·python

✓ verified Jul 3, 2026

Ring attention implementation with flash attention for efficient long-context LLM training. Supports distributed memory and compute parallelism. Current version: 0.1.8, actively maintained on GitHub, weekly releases.

Traffic · last 30 days stale · no recent hits · indexed Sun Jun 07 · updated Sat Jul 11

total hits 9

actors 2 distinct systems

last hit 16d ago AhrefsBot

GPTBot

3

Humans

5

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · 🇪🇸 Spain

Resources

githubgithub.com/zhuzilin/ring-flash-attention ↗

homepagegithub.com/zhuzilin/ring-flash-attention ↗

API endpoints

full doc /v1/registry/ring-flash-attn

install /v1/registry/ring-flash-attn/install