Vitacore Research Collaborative

Resources

Datasets are research outputs.

Datasets are increasingly the primary commercial output of research institutions. UK Biobank is valued at over £10 billion in potential economic contribution. Vitacore operates a deliberate three-phase dataset strategy from the founding sprint onward.

01

Use

Access existing open datasets

  • PhysioNet · MIMIC-IV — ICU physiological data
  • UK Biobank — approved researcher access
  • ICBHI — respiratory sound database
  • OpenNeuro — neuroimaging for tremor and neurology
  • USGS Earthquake Hazards — seismic signal processing

Value: enables publications without primary data collection cost.

02

Annotate

Add value to existing datasets

  • Formal state annotations on physiological time-series
  • Sepsis trajectory state labels for ICU databases
  • Specification-annotated ECG and acoustic datasets
  • Published as Vitacore datasets alongside papers

Value: annotated datasets become citable, reusable resources — cited independently of the accompanying papers.

03

Generate

Original Vitacore datasets

  • Wearable physiological data — clinical study with NHS partner
  • Acoustic emission from materials and geology partners
  • RF sensing vital signs in clinical environments
  • Formal verification trace data from clinical AI systems

Value: proprietary datasets under controlled access — open to academic research, licensed to commercial users.

Dataset governance principles

  1. Open by default for academic research
  2. FAIR principles: Findable, Accessible, Interoperable, Reusable
  3. Participant consent and ethics approval for all clinical data
  4. Commercial access requires licence agreement and fee
  5. Vitacore retains dataset IP; contributing researchers credited
  6. Zenodo or Figshare for Phase 1–2; controlled infrastructure for Phase 3