Skip to content
Open
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 3 additions & 5 deletions applications/KWS_Phoneme/auxiliary_files/README.md
Original file line number Diff line number Diff line change
@@ -1,12 +1,11 @@
# Auxiliary Files to help Download and Prepare the Data
# Python scripts to help download and down-sample the additive noise data from YouTube videos

## YouTube Additive Noise
Run the following commands to download the CSV Files to download the YouTube Additive Noise Data :

```
wget http://storage.googleapis.com/us_audioset/youtube_corpus/v1/csv/balanced_train_segments.csv
```
Followed by the extraction script to download the actual data :
Following the download of the CSV file, run the extraction script to download the actual audio data :
```
python download_youtube_data.py --csv_file=/path/to/csv_file.csv --target_folder=/path/to/target/folder/
```
Expand All @@ -17,8 +16,7 @@ The downloaded files would need to be converted to 16KHz for our pipeline. Pleas
```
python convert_sampling_rate.py --source_folder=/path/to/csv_file.csv --target_folder=/path/to/target/16KHz_folder/ --fs=16000 --log_rate=100
```
The script can convert the sampling rate of any .wav file to the specified --fs. But for our applications, we use 16KHz only.<br/>
Choose the log rate for how often the log should be printed for the sample rate conversion. This will print a string every log_rate iterations.
The script can convert the sampling rate of any .wav file to the specified --fs. But for our applications, we use 16KHz only. Choose the log rate for how often the log should be printed for the sample rate conversion. This will print a string every log_rate iterations.

Copyright (c) Microsoft Corporation. All rights reserved.
Licensed under the MIT license.
Loading