🦠 SVM-Based Cough Detector

Welcome to the SVM-Based Cough Detector project! This tool uses machine learning to automatically detect coughs in audio recordings.

🚀 What It Does

Loads and Segments Audio: Takes in .wav files and fetch positive (cough) segments based on the labels, and negative (not cough) segments that has the similar length distribution as positive samples.
Extracts Features: Pulls out MGCC features from each segment and aggregates them with mean and standard deviation to create uniform feature vectors.
Trains an SVM Model: Uses extracted features to train a Support Vector Machine (SVM) that can differentiate between cough and non-cough sounds.
Predicts coughs in audio collected by myself: Applies the trained model to new audio files to identify and timestamp cough events.
Store results: Generates label files where coughs are detected in the audio waveform.

📋 How to inference

Clone the Repository:

git clone https://github.com/yourusername/svm-cough-detector.git
cd svm-cough-detector

Install Dependencies: (recommend to do it in a virtual enviroment)
```
pip install -r requirements.txt
```

Inference:

python3 src/inference.py --model svm --input-dir dir/contains/one/data.wav

Want to record your own audio files?

python3 src/record_audio.py --output-dir dir/to/save/the/recording --duration 5

File type: The audio file is saved in .wav formate with sample rate at 16KHz and is monophonic sound.
File name: The audio file is saved as data.wav

Usage example:

# Record a 5 second audio and save it to my_data/positive/cough_example
python3 src/record_audio.py --output-dir my_data/positive/cough_example --duration 5

# Use the recorded audio for inferencing
python3 src/inference.py --model svm --input-dir my_data/positive/cough_example

🛠️ How It Works

1. Audio Preprocessing

Loading: Utilizes librosa to load audio files at a consistent sampling rate.
Segmentation: Splits audio into positive and negative segments. During this step, we try to keep both side has simialr distrubution in length and number of samaples.

2. Feature Extraction

MFCCs: Extracts Mel-Frequency Cepstral Coefficients (MFCCs) from each segment.
Aggregation: Calculates the mean and standard deviation of MFCCs to create a fixed-length feature vector for each segment.

3. Model Training

Scaling: Standardizes features using StandardScaler for better SVM performance.
SVM Training: Trains an SVM classifier with hyperparameter tuning using GridSearchCV to find the best settings.

4. Prediction Pipeline

Segmenting New Audio: Processes new .wav files under my_data folder. Breaks audio signal down into fixed-size (0.1s) chunks
Feature Processing: Extracts and scales features from new segments.
Classification: Predicts whether each segment contains a cough.
Mapping: Associates predictions with their corresponding time frames and filters out non-cough segments.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data/raw		data/raw
model/svm		model/svm
my_data		my_data
src		src
.gitignore		.gitignore
AED.ipynb		AED.ipynb
LICENSE		LICENSE
README.md		README.md
data_exploration.ipynb		data_exploration.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🦠 SVM-Based Cough Detector

🚀 What It Does

📋 How to inference

🛠️ How It Works

1. Audio Preprocessing

2. Feature Extraction

3. Model Training

4. Prediction Pipeline

About

Uh oh!

Releases

Packages

Languages

License

essencewxx/acoustic-event-detection

Folders and files

Latest commit

History

Repository files navigation

🦠 SVM-Based Cough Detector

🚀 What It Does

📋 How to inference

🛠️ How It Works

1. Audio Preprocessing

2. Feature Extraction

3. Model Training

4. Prediction Pipeline

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages