python-Implement SeparableConv2D in Pytorch

Poe Dator 2020-12-07 16:52:52

The linked definitions are generally agreeing. The best one is in the article.

"Depthwise" (not a very intuitive name since depth is not involved) - is a series of regular 2d convolutions, just applied to layers of the data separately. - "Pointwise" is same as Conv2d with 1x1 kernel.

I suggest a few correction to your SeparableConv2d class:

no need to use depth parameter - it is same as out_channels
I set padding to 1 to ensure same output size with kernel=(3,3). if kernel size is different - adjust padding accordingly, using same principles as with regular Conv2d. Your example class Net() is no longer needed - padding is done in SeparableConv2d.

This is the updated code, should be similar to tf.keras.layers.SeparableConv2D implementation:

class SeparableConv2d(nn.Module):

def __init__(self, in_channels, out_channels, kernel_size, bias=False):
    super(SeparableConv2d, self).__init__()
    self.depthwise = nn.Conv2d(in_channels, in_channels, kernel_size=kernel_size, 
                               groups=in_channels, bias=bias, padding=1)
    self.pointwise = nn.Conv2d(in_channels, out_channels, 
                               kernel_size=1, bias=bias)

def forward(self, x):
    out = self.depthwise(x)
    out = self.pointwise(out)
    return out

Jingles 2020-12-05 09:36:18

Thanks for your answer, I have questions on the self.depthwise layer. Why is it nn.Conv2d(1,1) and not nn.Conv2d(in_channels, in_channels). And why is the groups` parameter not used but instead performs x. reshape first followed by Conv2d with 1 filter?

Poe Dator 2020-12-05 13:29:26

My example was based on assumption that all layers are convolved depthwise with same filter. After reviewing tf.keras.layers.SeparableConv2D implementation, I see that TF uses different filters for each layer. You are correctly suggesting that it will be nn.Conv2d(in_channels=N, out_channels=N, groups=N) # N is in_channels. I will update the example in my solution with both options.

Jingles 2020-12-05 14:15:25

What is torch_s2d (I'm having name 'torch_s2d' is not defined here)? And what is self.depthwise.weight[1:] = torch_s2d.depthwise.weight[0] doing, initial weights?

Poe Dator 2020-12-07 08:53:50

torch_s2d is a leftover from testing. I removed it from the answer and also removed the second case - just left one that should match the TF implementation as you asked.

Implement SeparableConv2D in Pytorch

Main objective

What is the PyTorch equivalent for SeparableConv2D?

What about `padding = 'same'`?

All together

热门github