Tacotron 2 Audio Preprocessor

css Copy code

Tacotron 2 Audio Preprocessor

This repository contains a Python script (tacotron2_preprocessor.py) that preprocesses audio files for training a Tacotron 2 text-to-speech model. The script trims silence, normalizes the audio, and saves the processed files to a specified output folder. It's specifically designed to work with .wav files to help create a clean and consistent dataset for Tacotron 2 model training.

Requirements

Python 3.6 or higher
Librosa
SoundFile

Installation

Clone this repository to your local machine.

git clone https://github.com/yourusername/tacotron2-audio-preprocessor.git

markdown Copy code

Install the required libraries:

pip install librosa soundfile

r Copy code

Usage

Update the input_path and output_path variables in the tacotron2_preprocessor.py script to point to your input folder containing the .wav files and the desired output folder for the processed files.

input_path = "path\\to\\your\\input_folder"
output_path = "path\\to\\your\\output_folder"
Run the tacotron2_preprocessor.py script:
Copy code
python tacotron2_preprocessor.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
Tacotron_2_Audio_Preprocessor.ipynb		Tacotron_2_Audio_Preprocessor.ipynb
tacotron2_preprocessor_wav_files.py		tacotron2_preprocessor_wav_files.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tacotron 2 Audio Preprocessor

Requirements

Installation

Usage

About

Releases

Packages

Languages

YTR76/Tacotron-2-Audio-Preprocessor

Folders and files

Latest commit

History

Repository files navigation

Tacotron 2 Audio Preprocessor

Requirements

Installation

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages