LM Dataformat

library 0.0.20 ·python abandoned

✓ verified May 26, 2026

LM Dataformat (lm-dataformat) is a Python utility designed for efficient storage and reading of files specifically tailored for large language model (LLM) training. It provides functionalities to archive data with associated metadata and stream documents for processing. The current version is 0.0.20, but the project appears to be abandoned, with no active development or maintenance since its last release in 2021 and last GitHub commit over six years ago.

Traffic · last 30 days ↑1000% vs prev 7d · indexed Fri Apr 17 · updated Mon Jun 01

total hits 18

actors 7 distinct systems

last hit 19h ago human

MetaBot

GPTBot

Script

Search engines

Humans

top countries 🇺🇸 United States · 🇫🇷 France · 🇨🇦 Canada · BD · 🇩🇪 Germany

Resources

githubgithub.com/leogao2/lm_dataformat ↗

packagepypi.org/project/lm-dataformat/ ↗

API endpoints

full doc /v1/registry/lm-dataformat

install /v1/registry/lm-dataformat/install

compatibility /v1/registry/lm-dataformat/compatibility