Does my model have to reconstruct images in full resolution or can it be cropped?
The decoder has to produce PNG images where each image has the same resolution as the corresponding image in the validation or test set.
How is PSNR calculated?
We compute a single MSE value by averaging across all RGB channels of all pixels of the whole dataset, and from that calculate a PSNR value.
The evaluation server gives "ERROR: Missing image IMG_20170114_210112.png". What am I doing wrong?
The error means that the decoder failed and did not produce all required files. Make sure that the names of the files match the target files' names exactly.
In which directory should the decoder save images?
The decoder can save images in the current working directory
. or in any arbitrary subfolder
Do we need to encode the Y, U, and V channels of the P-frame data separately?
You may encode the data however you want. For example, you may encode the Y, U, and V channels of one frame into a single file. However, your decoder needs to output separate PNGs for each channel.
Does the model count towards the target bit-rate?
For the P-frame track, the model and data size is combined to calculate a total size (
model_size + 100 * data_size) which has to be below a threshold. For the low-rate track, the model size is not considered and only the size of the encoded data matters.
Is the input to the P-frame decoder a compressed frame or an uncompressed frame?
The decoders of the P-frame challenge will be provided with an uncompressed frame as input.