Skip to content

References

References

Here you can find preprocessing code.

Audio

Audio processing functions.

Audio processor

A class used to process audio signals and convert them into different representations.

Compute_YIN

Original implementation

Normalize Text

This class normalize the characters in the input text and normalize the input text with the nemo_text_processing.

Preprocess LibriTTS

Preprocessing PreprocessLibriTTS audio and text data for use with a TacotronSTFT model.

TacotronSTFT

TacotronSTFT module that computes mel-spectrograms from a batch of waves.

wav2vec aligner

The Wav2VecAligner model is designed for aligning audio data with text data. This class handles the training and validation of the Wav2VecAligner model.

TokenizerIPA

The tokenizer of IPA tokens with punctuation