SpeechRecognition

The purpose of this project is to determine whether the use of feature engineering in convolutional neural networks, namely the pre-processing of vocal spectrograms by the extraction of maximum-energy ridges from a time-frequency matrix using a penalized forward-backward greedy algorithm, can significantly improve the performance of a voice recognition classifier. The model is a closed-vocabulary (categorizing from a set of listed words), speaker independent (no training required to match a speaker’s vocal idiosyncrasies) model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

SpeechRecognition

Files

README.md

Latest commit

History

README.md

File metadata and controls

SpeechRecognition