Audio Sample Generator

This app can generate new audio samples using the Stable Diffusion model (LoRA) for training.

Steps:

Convert existing audio samples to spectrograms.
Train a small Stable Diffusion model (LoRA) on spectrograms.
Generate new audio samples with a specified prompt.

Prerequisites

Python

How to install

git clone --recurse-submodules git@github.com:Danand/audio-sample-generator.git
cd audio-sample-generator
chmod +x run.sh

How to launch

./run.sh

How to use

Simply follow all pages from the sidebar sequentially.

Advanced settings are skipped here for convenience.

Extract Spectrograms

Open audio files.
Click the Extract button.
Review the spectrograms extracted from the audio files.
Proceed to the next page.

Prepare Dataset

Specify for each spectrogram:
- Subject
- Caption (comma-separated keywords)
- Optional: Weight
Click the Save button.

Train LoRA

Click the Train button.

Generate Audio with Stable Diffusion

Type in the Prompt.
Specify the Amount of audio to generate.
Click Generate.
Listen and save the generated samples if desired.

Extras

Batch Convert to Audio

That page is convenient for batch converting spectrograms to audio samples. You can experiment with any images of the respective size, not necessarily spectrograms.

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
.vscode		.vscode
audio_sample_generator		audio_sample_generator
pages		pages
sd_scripts @ dcf2cd8		sd_scripts @ dcf2cd8
sd_scripts_bridge/ui		sd_scripts_bridge/ui
.gitignore		.gitignore
.gitmodules		.gitmodules
Getting_Started.py		Getting_Started.py
README.md		README.md
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Sample Generator

Prerequisites

How to install

How to launch

How to use

Extract Spectrograms

Prepare Dataset

Train LoRA

Generate Audio with Stable Diffusion

Extras

Batch Convert to Audio

About

Releases

Languages

Danand/audio-sample-generator

Folders and files

Latest commit

History

Repository files navigation

Audio Sample Generator

Prerequisites

How to install

How to launch

How to use

Extract Spectrograms

Prepare Dataset

Train LoRA

Generate Audio with Stable Diffusion

Extras

Batch Convert to Audio

About

Topics

Resources

Stars

Watchers

Forks

Releases

Languages