Skip to content

Latest commit

 

History

History
34 lines (20 loc) · 2.35 KB

File metadata and controls

34 lines (20 loc) · 2.35 KB

Data Sources for Training Wake Words

Generated Samples

Piper sample generator uses text-to-speech to generate many wake word samples. We also generate adversarial phrase samples created using openWakeWord.

Augmentation Sources

We apply several augments to the generated samples. We use the following sources for background audio samples:

We reverberate the samples with room impulse responses from BIRD: Big Impulse Response Dataset.

Ambient Noises for Negative Samples

We use a variety of sources of ambient background noises as negative samples during training.

Ambient Speech

Ambient Background

Validation and Test Sets

We generate positive and negative samples solely for validation and testing. We augment these samples in the same way as the training data. We split the FSDK50K, FMA, and WHAM! datasets 90-10 into training and testing sets (they are not in the validation set). We estimate the false accepts per hour during training with the VOiCES validation set and DiPCo - Dinner Party Corpus (Community Data License Agreement – Permissive Version 1.0 License.) We test the false accepts per hour in streaming mode after training with the DiPCo set.