This repo provides step by step process from sctatch to fine tune facebook's wav2vec2-large model using transformers
-
Updated
Nov 7, 2024 - Jupyter Notebook
This repo provides step by step process from sctatch to fine tune facebook's wav2vec2-large model using transformers
Deployed Facebook's wav2vec2-large-90h model to transcribe 4,076 .mp3 files
Noise Cancellation Transcription Model Using Wav2Vec2
A fine-tuned Wav2Vec2-based Automatic Speech Recognition (ASR) system with data augmentation, efficient training, and transcription capabilities. Supports local and Mozilla Common Voice datasets, with evaluation via Word Error Rate (WER). 🚀
A fine-tuned Wav2Vec2-based Automatic Speech Recognition (ASR) system with data augmentation, efficient training, and transcription capabilities. Supports local and Mozilla Common Voice datasets, with evaluation via Word Error Rate (WER). 🚀
Add a description, image, and links to the wav2vec2-large-960h topic page so that developers can more easily learn about it.
To associate your repository with the wav2vec2-large-960h topic, visit your repo's landing page and select "manage topics."