DeepEcho Synthetic Data Generator

0.8.1 · active · verified Fri Apr 17

DeepEcho is a Python library within the SDV ecosystem for generating sequential synthetic data from real-world datasets using Generative Adversarial Networks (GANs). It's designed for data that has a temporal or sequential component, such as time series or event logs. Currently at version 0.8.1, the library maintains an active development pace with frequent updates and bug fixes, often aligning with the broader SDV ecosystem's release cycle.

Common errors

Warnings

Install

Imports

Quickstart

This quickstart demonstrates how to use DeepEcho to generate synthetic sequential data. It uses `sdv.datasets.demo.get_sequential_demo` to obtain sample sequential data and its corresponding metadata, which is essential for DeepEcho to correctly model the data's structure. The `DeepEcho` model is then initialized with the metadata, fitted to the real data, and finally used to sample new synthetic sequences.

from deepecho.models import DeepEcho
from sdv.datasets.demo import get_sequential_demo

# Get demo data for sequential modeling from SDV
metadata, data = get_sequential_demo()

# Initialize and fit the DeepEcho model
# Metadata is crucial for DeepEcho to understand the sequential structure
model = DeepEcho(metadata=metadata)
model.fit(data)

# Generate 100 rows of synthetic sequential data
synthetic_data = model.sample(num_rows=100)

print(synthetic_data.head())

view raw JSON →