Lab / Datasets
The lab
The data behind the smell fingerprints.
We fixed the data contracts before the first hardware run. Real bench, chamber, and field recordings are being collected now.
Data contracts first
The contract every dataset must follow
Before the first hardware run, we fixed the data contracts every real dataset must follow — row schema, feature shapes, run-bundle manifests, scenario-held-out splits, and the adversarial checks that gate every result. That is the standard the bench now records against.
Schema: fixed sample period · scenario-held-out splits · manifest + samples per run bundle · filenames carry no labels.
Bench & field
Collecting
Hardware datasets — in progress
First bench recordings are being captured under the AER protocols. Dataset cards will be published here as runs pass their truth gates.
Access & further reading
For dataset access or collaboration, contact [email protected].
- →Truth gates — the contracts and checks every dataset passes.
- →Drift & reality — why calibration data is the long-term moat.