Lab / Datasets
The lab

The data behind the smell fingerprints.

We fixed the data contracts before the first hardware run. Real bench, chamber, and field recordings are being collected now.

Data contracts first

The contract every dataset must follow

Before the first hardware run, we fixed the data contracts every real dataset must follow — row schema, feature shapes, run-bundle manifests, scenario-held-out splits, and the adversarial checks that gate every result. That is the standard the bench now records against.

Schema: fixed sample period · scenario-held-out splits · manifest + samples per run bundle · filenames carry no labels.

Bench & field

Collecting
Hardware datasets — in progress

First bench recordings are being captured under the AER protocols. Dataset cards will be published here as runs pass their truth gates.

Access & further reading

For dataset access or collaboration, contact [email protected].

  • Truth gates — the contracts and checks every dataset passes.
  • Drift & reality — why calibration data is the long-term moat.