Data format for multiple categories in R-CNN
Mostrar comentarios más antiguos
I am trying to train an R-CNN to identify objects of different categories in an image. However, there is one part in the official documentation that I do not understand. It is clear that one needs to increase the number of nodes in the last fully-connected layer to take the categories into account, but I cannot understand the explanation for how the training data table should be structured. It says;
'Labeled ground truth images, specified as a table with two or more columns. The first column must contain path and file names to images that are either grayscale or true color (RGB). The remaining columns must contain bounding boxes related to the corresponding image. Each column represents a single object class, such as a car, dog, flower, or stop sign.'
So each column after the first column corresponds to a category. But what if not every category is present in an image? Should the content in those columns just be an empty cell, or a [0 0 0 0] bounding box? Also, how should one define the bounding box if there are more than one instance of a category in the same image?
Respuesta aceptada
Más respuestas (0)
Categorías
Más información sobre Semantic Segmentation en Centro de ayuda y File Exchange.
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!