References
References
Here you can find preprocessing code.
Audio
Audio processing functions.
Audio processor
A class used to process audio signals and convert them into different representations.
Compute_YIN
Normalize Text
This class normalize the characters in the input text and normalize the input text with the nemo_text_processing
.
Preprocess LibriTTS
Preprocessing PreprocessLibriTTS
audio and text data for use with a TacotronSTFT
model.
TacotronSTFT
TacotronSTFT
module that computes mel-spectrograms from a batch of waves.
wav2vec aligner
The Wav2VecAligner model is designed for aligning audio data with text data. This class handles the training and validation of the Wav2VecAligner model.
TokenizerIPA
The tokenizer of IPA tokens with punctuation