Applies a 1D power-average pooling over an input signal composed of
several input planes. If the sum of all inputs to the power of `p`

is
zero, the gradient is set to zero as well.

## Arguments

- input
the input tensor

- norm_type
if inf than one gets max pooling if 0 you get sum pooling ( proportional to the avg pooling)

- kernel_size
a single int, the size of the window

- stride
a single int, the stride of the window. Default value is kernel_size

- ceil_mode
when True, will use ceil instead of floor to compute the output shape