Whisper fine-tuning for ASR

In this Repo, you can easily fine-tune different variations of the Whisper model to your specific multilingual data based on a simple manifest.

prepare_data.py :::: to prepare ".csv" files for train and test
train.py :::: train and save the fine-tuned Whisper model
decode.py :::: decode the test or any evaluation ".wav" file
whisper_transcribe_WER.py ::: another (easier) method for utilizing the Whisper model in transcription.

*** You can use different versions of the openai Whisper model.

Auxilary files

The required packages are listed in the "requirements.txt" file and you can easily install all of them using: pip install -r requirements.txt

It would be better to make a new Python environment using python3 -m venv myenv , after that, activate the venv using source myenv/bin/activate and then install the packages.

To run on the servers by Slurm, you can use the slurm_run.sh file.

The "files_test.csv" and "files_train.csv" help us understand better the required files for testing and training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisper fine-tuning for ASR

Auxilary files

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
decode.py		decode.py
files_test.csv		files_test.csv
files_train.csv		files_train.csv
prepare_data.py		prepare_data.py
requirements.txt		requirements.txt
slurm_run.sh		slurm_run.sh
train.py		train.py
whisper_transcribe_WER.py		whisper_transcribe_WER.py

areffarhadi/Whisper_fine_tuning_ASR

Folders and files

Latest commit

History

Repository files navigation

Whisper fine-tuning for ASR

Auxilary files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages