How to train CNN with an image in the input and an image in the output?

24 visualizaciones (últimos 30 días)
I have a task to train CNN with an image as input and an image as output. I have tried to do it at the beginning with Matlab tutorial, but matlab has no image as output, but a vector.
any idea how i can do this tutorial again but with image as output?
For example you could insert input as image for number one and the output is also for number one but rotated or deformed.
Please don't send me other people's question, because i have seen all answers about it.
  6 comentarios
Osama Tabbakh
Osama Tabbakh el 15 de Ag. de 2020
Hello Ritesh,
Unfortunately I haven't done it in Python yet. But I wish you all the best with it, good luck.
Florent Busnoult
Florent Busnoult el 25 de Ag. de 2020
Editada: Florent Busnoult el 25 de Ag. de 2020
There is an example in the 5G toolbox that uses a CNN network with a picture as an input and a picture as an output.
The variable used to store the "labels" or (ouput images) is the following:
trainLabels(:,:,:,i) = est_grid;
It's a 4-D double variable.
And you can look at the different output images used to trian the network by using the command below:
>> imagesc(abs(trainLabels(:,:,:,2))); % output image number 2
>> imagesc(abs(trainLabels(:,:,:,3))); % output image number 3
>> imagesc(abs(trainLabels(:,:,:,25))); % output image number 25
and so on.

Iniciar sesión para comentar.

Respuesta aceptada

Vasilis Giannoglou
Vasilis Giannoglou el 16 de Sept. de 2020
I was also struggling with having an image as an output and I found the solution. Skip the fullyConnectedLayer(n) part, because it causes the network to have an output of either a vector or just a number. However, make sure not to include any Pooling layers if you want your output to be of the same size as your input, because they cause downsampling. Finally, keep in mind that the number of filters of your final conv network is the same number as the 3rd dimension of your output image. For example,
convolution2dLayer(3,1,'Padding','same') gives an output image of (M,N,1), where M,N are the rows and columns of the initial image. The next code runs to me.
layers = [
imageInputLayer([64 64 1]) % My initial image is 64x64
convolution2dLayer(3,8,'Padding','same')
batchNormalizationLayer
reluLayer
% averagePooling2dLayer(2,'Stride',1) % Don't want this
convolution2dLayer(3,16,'Padding','same')
batchNormalizationLayer
reluLayer
% averagePooling2dLayer(2,'Stride',1) % Don't want this
convolution2dLayer(3,32,'Padding','same')
batchNormalizationLayer
reluLayer
convolution2dLayer(3,1,'Padding','same') % Because my initial image 3rd dimension is one.
batchNormalizationLayer
reluLayer
dropoutLayer(0.2)
% fullyConnectedLayer(10) % Don't want this
regressionLayer];
  2 comentarios
mary john
mary john el 20 de Mayo de 2022
I tried to do this and gets an error with the regression layer. It states that the ''output size does not match response size", when I use the deep network designer.
Can you please help me to solve. Thanks in advance

Iniciar sesión para comentar.

Más respuestas (1)

Johanna Pingel
Johanna Pingel el 29 de Abr. de 2019
  3 comentarios
supriya Naik
supriya Naik el 28 de Ag. de 2020
Did you get the solution sir?? Actually I am facing same problem now.

Iniciar sesión para comentar.

Categorías

Más información sobre Image Data Workflows en Help Center y File Exchange.

Productos


Versión

R2018b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by