Multi-input imagedatastore

Question

0 votos

I am trying to train a network with N inputs to perform binary classification. Each image is multispectral (more than 3 channels rgb). I read in a comment from mathworks staff which creates data and stores it in a datastore to train a network with two inputs: image and vector data.

I have also looked at the example Image Classification using CNN with Multi Input, which uses two inuts to classify digits.

Both of these examples do not use imageDatastore. I was wondering if we could use

combine

to combine the imageDatastore for each data sample. I read in Datastores for Deep Learning that the datastore must be a combined or transformed datastore that returns a cell array with (numInputs+1) columns containing the predictors and the responses, where numInputs is the number of network inputs and numResponses is the number of responses.

My question is the following. Can I have multiple imageDatastore objects and combine them? If so, how do I store the label column? I tried the code below and got an error.

imds1 = imageDatastore(...);
imds2 = imageDatastore(...);
labels = ???
datastore = combine(imds1, imds2, labels);

Usually, we can assign the label through imds1.Labels. I also want to know if once combined, data augmentation can be done on the images and random split for training and validation.

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Follow Question

Answer 1

Mohammad Sami el 5 de Sept. de 2020

Editada: Mohammad Sami el 5 de Sept. de 2020

Abrir en MATLAB Online

0 votos

You can save the labels as a text file and then create a tabularTextDatastore to read them back.

Something like this could work.

read_size = 4; % readsize must be fixed otherwise concatenation will fail
digitDatasetPath = fullfile(matlabroot,'toolbox','nnet', ...
'nndemos','nndatasets','DigitDataset');
imds = imageDatastore(digitDatasetPath, ...
'IncludeSubfolders',true, ...
'LabelSource','foldernames');
imds.ReadSize = read_size;
labels = imds.Labels; % your labels here
% the order of your images and label must be the same
writematrix(labels,'labels.txt');
% %C = categorical
labelStore = tabularTextDatastore('labels.txt','TextscanFormats','%C',"ReadVariableNames",false);
labelStore.ReadSize = read_size;
labelStoreCell = transform(labelStore,@setcat_and_table_to_cell);
finalStore = combine(imds,labelStoreCell);
% test read
finalStore.read

You will also need to correct the categorical array categories during transformation.

function [dataout] = setcat_and_table_to_cell(datain)
validcats = string(0:9); % define valid labels for categorical array
datain.(1) = setcats(datain.(1),validcats);
dataout = table2cell(datain);
end

9 comentarios
Mostrar 7 comentarios más antiguos Ocultar 7 comentarios más antiguos

OJ27 el 10 de Sept. de 2020

Abrir en MATLAB Online

it doesn't work with read_size = 1. Perhaps it is my cnn? My second imagedatastore is grayscale but adding imdsTrain as the second datastore in the combine just as an example.

% Read tiny images train
imdsTrain = imageDatastore('E:/tiny-imagenet-200/train/', ...
'IncludeSubfolders',true,'FileExtensions','.jpeg');
labels = split(imdsTrain.Files,'E:\tiny-imagenet-200\train\');
labels = split(labels(:,2),'\images');
labels = labels(:,1);
labels = categorical(labels);  
imdsTrain.Labels = labels;
read_size = 1;
imdsTrain.ReadSize = read_size;
% Combine
trainlabels = imdsTrain.Labels; % your labels here
writematrix(trainlabels,'trainlabels.txt');
labelStore = tabularTextDatastore('trainlabels.txt','TextscanFormats','%C',"ReadVariableNames",false);
labelStore.ReadSize = read_size;
labelStoreCell = transform(labelStore,@setcat_and_table_to_cell);
train_multi = combine(imdsTrain,imdsTrain,labelStoreCell);
train_multi.read
% layerGraph
layers1 = createLayer('_1',3)
layers2 = createLayer('_2',3)
concat = depthConcatenationLayer(2,'Name','concat_1');
layers1 = [layers1; concat; convolution2dLayer([3 3],64,"Name","conv_6","Padding","same");...
        fullyConnectedLayer(400,"Name","fc_3");...
        fullyConnectedLayer(200,"Name","fc_4");...
        softmaxLayer('Name','softmax');...
        classificationLayer('Name','classOutput')]
plot(layerGraph(layers1))
lgraph = layerGraph(layers1);
lgraph = addLayers(lgraph,layers2)
lgraph = connectLayers(lgraph,'relu_5_2','concat_1/in2');
plot(lgraph)
%%
options = trainingOptions('adam', ...
    'InitialLearnRate',0.005, ...
    'LearnRateSchedule','piecewise',...
    'MaxEpochs',100, ...
    'MiniBatchSize',1024, ...
    'Verbose',1, ...
    'Plots','training-progress',...
    'Shuffle','never');
[net, info] = trainNetwork(train_multi,lgraph,options);
%%
function layers=createLayer(s, channels)
layers = [
    imageInputLayer([64 64 channels],"Name",strcat("imageinput",s))
    convolution2dLayer([3 3],8,"Name",strcat("conv_1",s),"Padding","same")
    batchNormalizationLayer("Name",strcat("batchnorm_1",s))
    reluLayer("Name",strcat("relu_1",s))
    maxPooling2dLayer([2 2],"Name",strcat("maxpool_1",s),"Stride",[2 2])
    
    convolution2dLayer([3 3],16,"Name",strcat("conv_2",s),"Padding","same")
    batchNormalizationLayer("Name",strcat("batchnorm_2",s))
    reluLayer("Name",strcat("relu_2",s))
    maxPooling2dLayer([2 2],"Name",strcat("maxpool_2",s),"Stride",[2 2])
    
    convolution2dLayer([3 3],32,"Name",strcat("conv_3",s),"Padding","same")
    batchNormalizationLayer("Name",strcat("batchnorm_3",s))
    reluLayer("Name",strcat("relu_3",s))
    maxPooling2dLayer([2 2],"Name",strcat("maxpool_3",s),"Stride",[2 2])
    
    convolution2dLayer([3 3],64,"Name",strcat("conv_4",s),"Padding","same")
    batchNormalizationLayer("Name",strcat("batchnorm_4",s))
    reluLayer("Name",strcat("relu_4",s))
    maxPooling2dLayer([2 2],"Name",strcat("maxpool_4",s),"Stride",[2 2])
    
    convolution2dLayer([3 3],64,"Name",strcat("conv_5",s),"Padding","same")
    batchNormalizationLayer("Name",strcat("batchnorm_5",s))
    reluLayer("Name",strcat("relu_5",s))]
end

Girish Tiwari el 20 de Oct. de 2020

Editada: Girish Tiwari el 20 de Oct. de 2020

Thanks M. Sami. Works like a charm.

However, I am getting an error while training the network with the datastore as "The output size (7) of the last layer does not match the number of classes (10)".

I have verified that there are only 7 labels and fullyconnected layer also has 7 outputs.

(See my code at: https://www.mathworks.com/matlabcentral/answers/619943-invalid-training-data-the-output-size-7-of-the-last-layer-does-not-match-the-number-of-classes-1)

Mohammad Sami el 21 de Oct. de 2020

It's because we set the 10 categories in the setcat_and_table_to_cell.

You should change the variable validcats to the appropriate 7 categories for your case.

Iniciar sesión para comentar.

Multi-input imagedatastore

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Respuestas (1)

9 comentarios
Mostrar 7 comentarios más antiguos Ocultar 7 comentarios más antiguos

Categorías

Productos

Versión

Etiquetas

Community Treasure Hunt

Multi-input imagedatastore

0 comentarios Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Respuestas (1)

9 comentarios Mostrar 7 comentarios más antiguos Ocultar 7 comentarios más antiguos

Categorías

Productos

Versión

Etiquetas

Ver también

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

9 comentarios
Mostrar 7 comentarios más antiguos Ocultar 7 comentarios más antiguos