I am trying to create an encoder-decoder-model, which encodes an 10x10 list and should decode it to an 3x8x8 array/list. Which loss function should I choose to achieve this? I know that the shapes of the input and output are very random and I'm not quite sure how to even fit both into one encoder-decoder-model. The 3x8x8 output however is mandatory and the 10x10 shape is the difference between two nested lists.
From what I have researched so far, the loss functions need (somewhat of) the same shapes for prediction and target. Now I don't know which one to take, to fit my awkward shape requirements.