Style Token Layer (STL)
STL
Bases: Module
Style Token Layer (STL). This layer helps to encapsulate different speaking styles in token embeddings.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
model_config |
AcousticModelConfigType
|
An object containing the model's configuration parameters. |
required |
Attributes:
Name | Type | Description |
---|---|---|
embed |
Parameter
|
The style token embedding tensor. |
attention |
StyleEmbedAttention
|
The attention module used to compute a weighted sum of embeddings. |
Source code in models/tts/delightful_tts/reference_encoder/STL.py
9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 |
|
forward(x)
Forward pass of the Style Token Layer Args: x (torch.Tensor): The input tensor.
Returns torch.Tensor: The emotion embedded tensor after applying attention mechanism.
Source code in models/tts/delightful_tts/reference_encoder/STL.py
55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 |
|