LM Dataformat
JSON →LM Dataformat (lm-dataformat) is a Python utility designed for efficient storage and reading of files specifically tailored for large language model (LLM) training. It provides functionalities to archive data with associated metadata and stream documents for processing. The current version is 0.0.20, but the project appears to be abandoned, with no active development or maintenance since its last release in 2021 and last GitHub commit over six years ago.
Traffic · last 30 days ↑1000% vs prev 7d
total hits 18
actors 7 distinct systems
last hit 19h ago human
top countries 🇺🇸 United States · 🇫🇷 France · 🇨🇦 Canada · BD · 🇩🇪 Germany
API endpoints
full doc /v1/registry/lm-dataformat
compatibility /v1/registry/lm-dataformat/compatibility