Let us assume 2x2 image, 3 channels (labels possibilities) and 2 batch data. Then the predicted scores (logits) is given by: Assume 2x2 array is arranged in a single row. For N = 1 (of batch 2): [ 9 2 5 8 ] Each row is a class [ 7 8 4 6 ] Each column is score of scores of every class for a pixel [ 8 4 6 7 ] For N = 2 (of batch 2): [ 7 3 0 3 ] [ 4 6 ...