TinyNeuralNetwork TransposeConv wrong shape?

The TFlite code doesn't like the shape of the Transpose Convolution node I have in my TinyNN converted network. It seems consistent across Transpose Convolutions that it expects the bias to match the size of the first dimension, but it is in fact the size of the last dimension in my TinyNN-converted network. (The error is on the first TransposeConv, but there are a number of similarly-formed ones in the network which will presumably have the same error).

The network runs fine before conversion, and the tflite looks very sensible in netron, but TFlite runtime doesn't like it...

It gives an error of:

Unexpected failure when preparing tensor allocations: tensorflow/lite/kernels/transpose_conv.cc:289 NumElements(bias) != SizeOfDimension(weights, 0) (3 != 1) Node number 194 (TRANSPOSE_CONV) failed to prepare.

The node in question has (according to netron):

input 1x4x4x3 output shape <4> Weights <1x2x2x3> Bias<3> output 1x8x8x3

which evidently doesn't match TFLite's rules. Is this a conversion error, or how do I cope with it?

Incidentally, the biases in all the Transpose Convolutions are entirely 0s and the Weights a suspiciously simple arrangement of 1s and 0s. Bias being optional for this TFLite operator, is there a way to not include it?

Oct 18 '23 16:10 BmanClark

@BmanClark Could you please tell me which version of Tensorflow (Lite) are you testing against?

Oct 19 '23 02:10 peterjc123

It is indeed a conversion error. Could you please upload a TorchScript model and the shape of the inputs so that we can reproduce this issue.

Oct 19 '23 02:10 peterjc123

TFLite is 2.8.0. The line number is out a few from latest, but the code seems pretty much the same in transpose_conv.cc I'll work on getting a torchscript version of the file now, currently it's in pytorch + loading state.

Error will need fixing, but as a thought, if bias is 0 and optional, it would be more efficient to not include it in tflite model?

Oct 19 '23 11:10 BmanClark

Hmm, I've created the Torchscript version, but it's (just!) too big to attach, so I'm still working out how I can share it with you. input shape is 1x512x512x4xfloat32

Oct 19 '23 14:10 BmanClark

TFLite is 2.8.0. The line number is out a few from latest, but the code seems pretty much the same in transpose_conv.cc I'll work on getting a torchscript version of the file now, currently it's in pytorch + loading state.

Error will need fixing, but as a thought, if bias is 0 and optional, it would be more efficient to not include it in tflite model?

As for workaround, yeah, it should be easy. But I'm more interested with how it happened.

Hmm, I've created the Torchscript version, but it's (just!) too big to attach, so I'm still working out how I can share it with you. input shape is 1x512x512x4xfloat32

Google drive, onedrive, either will do if size is an issue. Or you may try out if the model can be traced using our code tracer(usage can be found out in examples/tracer) so that you don't need to upload the weights.

Oct 19 '23 14:10 peterjc123

I've put on my google drive and shared with you.

Oct 19 '23 14:10 BmanClark

Ex

I've put on my google drive and shared with you.

Thanks. I will take a look tomorrow.

Oct 19 '23 14:10 peterjc123

Tinkering further I can get around the problem by specifying group_conv_rewrite=True. This splits each of the Transpose Convolutions into 512? parallel ones and concatenates them again. It runs, but it doesn't look like it's as efficient as it could be... Gives netron a headache too 😄

Oct 19 '23 16:10 BmanClark

@BmanClark Hi, I've put up a fix to eliminate the zero bias tensors for the DeConv ops. https://github.com/alibaba/TinyNeuralNetwork/pull/263 But I'm not sure if group deconv is supported in TFLite. If that is unsupported, then this fix may be just useless.

Oct 20 '23 03:10 peterjc123

Update: looks like group deconvolution is not supported, at least in XNNPACK delegate. https://github.com/tensorflow/tensorflow/blob/master/tensorflow/lite/delegates/xnnpack/xnnpack_delegate.cc#L6213

Tinkering further I can get around the problem by specifying group_conv_rewrite=True. This splits each of the Transpose Convolutions into 512? parallel ones and concatenates them again. It runs, but it doesn't look like it's as efficient as it could be... Gives netron a headache too 😄

So this may be the only way to use grouped deconvolutions in TFLite. You may create a new issue in the TFLite repo.

Oct 20 '23 03:10 peterjc123

I'm not actually looking to target XNNPACK ultimately (although I would like it as a reference), but I've created an issue: https://github.com/tensorflow/tensorflow/issues/62181 Thanks for bias fix, I'll look to see if that's enough for ArmNN to run, or if I need to get them to fix it too...

Oct 20 '23 08:10 BmanClark

I'm not actually looking to target XNNPACK ultimately (although I would like it as a reference), but I've created an issue: tensorflow/tensorflow#62181 Thanks for bias fix, I'll look to see if that's enough for ArmNN to run, or if I need to get them to fix it too...

Actually, I didn't find the changes to enable group deconvolution for the general optimized kernel, either. So it is possibly that it isn't supported by TFLite interpreter. But as for ArmNN, the story may be different since they could support that case since TFLite is only a format for model representation for them.

Oct 23 '23 06:10 peterjc123

ArmNN don't currently support group deconvolution either. They might look at adding it for me, but I might not need them to, as the network creator has kindly changed the group deconvolution to an Upsample for me! I love the OSS community! Anyway, I'm away so not going to get to try until next week, but I can give you an update once I've had a chance to get further. Thanks for your help.

Oct 23 '23 16:10 BmanClark

Update: the network happily converts now that its Group Deconvolutions (aka Transpose Convolutions) are replaced with Upsamples. Thank you. I have had a request for more information on the Tensorflow issue raised (specifically on how I used TinyNN to convert), so I'm providing that.

Nov 03 '23 12:11 BmanClark

@BmanClark I have commented on that issue. Glad you solved it the other way.

Nov 03 '23 14:11 peterjc123