ConvTransposed

`ConvTransposed`

Bases: Module

ConvTransposed applies a 1D convolution operation, with the main difference that it transposes the last two dimensions of the input tensor before and after applying the BSConv1d convolution operation. This can be useful in certain architectures where the tensor dimensions are processed in a different order.

The ConvTransposed class performs a BSConv operation after transposing the input tensor dimensions. Specifically, it swaps the channels and width dimensions of a tensor, applies the convolution, and then swaps the dimensions back to their original order. The intuition behind swapping dimensions can depend on the specific use case in the larger architecture; typically, it's used when the operation or sequence of operations expected a different arrangement of dimensions.

Parameters:

Name	Type	Description	Default
`in_channels`	`int`	Number of channels in the input	required
`out_channels`	`int`	Number of channels produced by the convolution	required
`kernel_size`	`int`	Size of the kernel used in convolution	`1`
`padding`	`int`	Zero-padding added around the input tensor along the width direction	`0`

Attributes:

Name	Type	Description
`conv`	`BSConv1d`	`BSConv1d` module to apply convolution.

Source code in models/tts/delightful_tts/conv_blocks/conv_transposed.py

class ConvTransposed(Module):
    r"""`ConvTransposed` applies a 1D convolution operation, with the main difference that it transposes the
    last two dimensions of the input tensor before and after applying the `BSConv1d` convolution operation.
    This can be useful in certain architectures where the tensor dimensions are processed in a different order.

    The `ConvTransposed` class performs a `BSConv` operation after transposing the input tensor dimensions. Specifically, it swaps the channels and width dimensions of a tensor, applies the convolution, and then swaps the dimensions back to their original order. The intuition behind swapping dimensions can depend on the specific use case in the larger architecture; typically, it's used when the operation or sequence of operations expected a different arrangement of dimensions.

    Args:
        in_channels (int): Number of channels in the input
        out_channels (int): Number of channels produced by the convolution
        kernel_size (int): Size of the kernel used in convolution
        padding (int): Zero-padding added around the input tensor along the width direction

    Attributes:
        conv (BSConv1d): `BSConv1d` module to apply convolution.
    """

    def __init__(
        self,
        in_channels: int,
        out_channels: int,
        kernel_size: int = 1,
        padding: int = 0,
    ):
        super().__init__()

        # Define BSConv1d convolutional layer
        self.conv = BSConv1d(
            in_channels,
            out_channels,
            kernel_size=kernel_size,
            padding=padding,
        )

    def forward(self, x: torch.Tensor) -> torch.Tensor:
        """Forward propagation method for the ConvTransposed layer.

        Args:
            x (torch.Tensor): input tensor

        Returns:
            x (torch.Tensor): output tensor after application of ConvTransposed
        """
        # Transpose the last two dimensions (dimension 1 and 2 here). Now the tensor has shape (N, W, C)
        x = x.contiguous().transpose(1, 2)

        # Apply BSConv1d convolution.
        x = self.conv(x)

        # Transpose the last two dimensions back to their original order. Now the tensor has shape (N, C, W)
        # Return final output tensor
        return x.contiguous().transpose(1, 2)

`forward(x)`

Forward propagation method for the ConvTransposed layer.

Parameters:

Name	Type	Description	Default
`x`	`Tensor`	input tensor	required

Returns:

Name	Type	Description
`x`	`Tensor`	output tensor after application of ConvTransposed

Source code in models/tts/delightful_tts/conv_blocks/conv_transposed.py

def forward(self, x: torch.Tensor) -> torch.Tensor:
    """Forward propagation method for the ConvTransposed layer.

    Args:
        x (torch.Tensor): input tensor

    Returns:
        x (torch.Tensor): output tensor after application of ConvTransposed
    """
    # Transpose the last two dimensions (dimension 1 and 2 here). Now the tensor has shape (N, W, C)
    x = x.contiguous().transpose(1, 2)

    # Apply BSConv1d convolution.
    x = self.conv(x)

    # Transpose the last two dimensions back to their original order. Now the tensor has shape (N, C, W)
    # Return final output tensor
    return x.contiguous().transpose(1, 2)