About · VoxAtlas

What VoxAtlas does

VoxAtlas is a quick, ~90-second voice check that looks for changes in your voice vs. your own baseline. It is not a diagnosis. When there’s a noticeable change, it indicates where the shift seems to come from:

Laryngeal (voice source/quality: hoarseness, breathiness)
Pulmonary (breath support/phrase length: pauses, phrasing)
Neuro (timing/coordination: “pa-ta-ka” regularity)
Affect (prosody: pitch variability / monotony)

What’s novel here

Within-person, circadian-aware baselining. We compare you to you, and we bucket history by time-of-day to respect normal daily drift.
Compact, multi-task protocol. Quiet → sustained “aaah” → DDK (“pa-ta-ka”) → short read → cough/sniff → one sentence—broad coverage in < 2 minutes.
Transparent feedback. You see audio quality, system scores (0–1), and feature z-scores once a baseline exists.
Privacy-first. Features are computed locally on your device/Pi; raw audio isn’t saved unless you opt in.

Signals we analyze (at a glance)

Source/quality: spectral flatness, spectral centroid/roll-off; F0 (pitch) mean & variability (pYIN).
Timing/breathing: pause ratio, articulation/onset rate, breath-group cues from the short read.
Neuromotor: DDK rate and irregularity from “pa-ta-ka”.

We then compute robust z-scores vs your baseline (by hour-bin) and summarize which system changed.

How your data are used

On device: features are computed in the browser and on the Pi.
By default, no raw audio is stored. Only numeric features + session summaries are kept locally to build your baseline.
Opt-in audio saving: enable if you want better troubleshooting/research; you can delete the local database anytime.
Network: If you enable external access (e.g., Tailscale Funnel), traffic is HTTPS/TLS. No analytics, no third-party trackers.

Example result (with baseline)

Status: Watch (confidence 0.82). Small change from your usual morning voice, mainly in the laryngeal signal. Quality OK (SNR 23 dB, clipping 0.2%).

Why: ↑ spectral flatness (+1.6σ), slight ↑ DDK irregularity (+0.6σ).
What to do: Re-check tomorrow. If you feel unwell or it persists, consider a medical check.

VoxAtlas is a research prototype and not a medical device.

Selected references

Mauch & Dixon. pYIN: A Fundamental Frequency Estimator Using Probabilistic Threshold Distributions. ICASSP 2014. PDF
Bhattacharya et al. Coswara: A respiratory sounds and symptoms dataset. Scientific Data, 2023. Article
Allison et al. Automated diadochokinesis analysis (DDK). 2022. PMC
Tanchip et al. Validating automatic DDK methods in ALS. 2022. PMC
Garrett & Healey. Fluctuations in voices of normal speakers across the day. 1987. PubMed
Zacharia et al. Effect of circadian cycle on voice. 2018. Journal
Dubnov. Generalization of Spectral Flatness Measure. (tech report). PDF
Zeng et al. Acoustic & prosodic features in lung-cancer speech (pause ratio, etc.). 2023. PMC