Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 557 Bytes

README.md

File metadata and controls

2 lines (2 loc) · 557 Bytes

SpeechRecognition

The purpose of this project is to determine whether the use of feature engineering in convolutional neural networks, namely the pre-processing of vocal spectrograms by the extraction of maximum-energy ridges from a time-frequency matrix using a penalized forward-backward greedy algorithm, can significantly improve the performance of a voice recognition classifier. The model is a closed-vocabulary (categorizing from a set of listed words), speaker independent (no training required to match a speaker’s vocal idiosyncrasies) model.