SDO is a schema creator first. You describe the structure — field names, types, constraints, relationships — and realistic records come out the other end. No scripts, no wrangling, no maintenance.
Most tools ask you to write generation scripts. SDO asks you to describe structure. Once you have the schema, the data follows automatically — at any volume.
Add fields, choose types, set constraints — all in a drag-and-drop editor. No code, no YAML, no config files. The schema you design is exactly what gets generated. Preview sample output before committing to a full run.
Open Schema CreatorStart from a ready-made schema — Bank Account, Patient Record, Transaction Log — and customise it to your needs. Open any template directly in the Schema Creator with one click.
Define parent-child relationships with referential integrity. Child records reference parent field values and respect inherited date ranges. Model real data hierarchies without writing joins.
Declare fields that intelligently branch based on a sibling field's value. Gender-correlated names, status-conditional amounts, role-based email domains — expressed as a schema rule, never as a script. The worker resolves dependency order automatically.
One schema. Four production-ready dbt layers — automatically. Bronze casts, silver validation, gold aggregations, and a quality monitoring model are all generated and wired together. No hand-editing, no boilerplate.
_ingested_at · _pipeline_run_id
_quality_score · dedup
validity_rate · below_threshold · per-run history
Workers run behind your firewall — no inbound ports required. They poll the orchestrator, generate locally, and push results back. Compute scales independently of the API surface.
Submit a task and get a task ID immediately. Large results stream back in chunks — no gateway timeouts, no memory pressure on the client side. Poll at your own pace.
Workers authenticate with per-request HMAC-SHA256 signatures. Replay-protected with a 5-minute timestamp window. No long-lived credentials sitting on worker machines.
The heavy lifting happens on the worker side. You only ever touch the schema.
Use the visual Schema Creator or POST a JSON schema with field types, constraints, and weighted distributions.
The orchestrator creates a task and returns a task ID in milliseconds. No waiting at submit time — the call is non-blocking.
A compute worker polls, claims the task, generates every record against your schema, and pushes the result back.
Poll the status endpoint or watch the dashboard. Download as JSON, stream via the paginated data API, or consume it in your pipeline.
Every type accepts options. Mix platform asset word lists with inline weighted values — the schema language is expressive by default.
The Schema Creator is free to use. Generation costs apply when you run a task. Start with the demo, no sign-in needed.
Try the Schema Creator immediately. No account, no credit card.
Unlimited schemas, unlimited records. Pay only for what you generate.
Local generation via SDK. Zero egress costs. For high-volume or air-gapped environments.
pip install sdo client SDK