Semantic Segmentation

Label ground truth and perform semantic segmentation using pretrained AI models, train custom networks like U-Net with transfer learning

Semantic segmentation tools in Computer Vision Toolbox™ enable you to perform pixel-level classification of images using both pretrained AI models and custom deep learning networks. You can start by creating labeled ground truth using the Image Labeler and Video Labeler apps, which support interactive and AI-assisted pixel-level annotation in images and videos with class labels. For more information, see Label Pixels for Semantic Segmentation.

The toolbox provides pretrained semantic segmentation models such as BiSeNet V2. You can use these models directly for inference, or adapt them to specific applications. You can also train custom segmentation networks using transfer learning with architectures like U-Net, 3D U-Net, and DeepLab v3+. For more information, see Get Started with Semantic Segmentation Using Deep Learning.

To prepare training data, the toolbox offers utilities for loading and managing data sets along with organizing and analyzing label distributions. The toolbox also supports data augmentation and preprocessing. For more information, see Training Data for Object Detection and Semantic Segmentation.

After you generate predictions using pretrained or custom models, you can evaluate segmentation predictions against the ground truth, and compute evaluation metrics like the contour matching score, Sørensen-Dice similarity, Jaccard similarity, and confusion matrix. For more information, see evaluateSemanticSegmentation.

Input image of a seascape, then a series of cubes representing a deep learning network, and a semantically segmented output image of the input image.

Apps

Image Labeler	Label images for computer vision applications
Video Labeler	Label video for computer vision applications

Functions

expand all

Pretrained Semantic Segmentation AI Models

`bisenetv2`	Create BiSeNet v2 convolutional neural network for semantic segmentation (Since R2025a)
`semanticseg`	Semantic image segmentation using deep learning

Train Semantic Segmentation Networks Using Transfer Learning

`bisenetv2`	Create BiSeNet v2 convolutional neural network for semantic segmentation (Since R2025a)
`unet`	Create U-Net convolutional neural network for semantic segmentation (Since R2024a)
`unet3d`	Create 3-D U-Net convolutional neural network for semantic segmentation of volumetric images (Since R2024a)
`deeplabv3plus`	Create DeepLab v3+ convolutional neural network for semantic image segmentation (Since R2024a)
`focalCrossEntropy`	Compute focal cross-entropy loss
`generalizedDice`	Generalized Sørensen-Dice similarity coefficient for image segmentation

Load Training Data

`combine`	Combine data from multiple datastores
`countEachLabel`	Count occurrence of pixel or box labels
`groundTruth`	Ground truth label data
`imageDatastore`	Datastore for image data
`pixelLabelDatastore`	Datastore for pixel label data
`pixelLabelTrainingData`	Create training data for semantic segmentation from ground truth

Augment and Preprocess Training Data

`balancePixelLabels`	Balance pixel labels by oversampling block locations in large images
`imwarp`	Apply geometric transformation to image
`imcrop`	Crop image
`imresize`	Resize image
`transform`	Transform datastore
`randomAffine2d`	Create randomized 2-D affine transformation
`randomWindow2d`	Randomly select rectangular region in image
`centerCropWindow2d`	Create rectangular center cropping window

Evaluate Semantic Segmentation Results

`evaluateSemanticSegmentation`	Evaluate semantic segmentation data set against ground truth
`bfscore`	Contour matching score for image segmentation
`dice`	Sørensen-Dice similarity coefficient for image segmentation
`jaccard`	Jaccard similarity coefficient for image segmentation
`segmentationConfusionMatrix`	Confusion matrix of multi-class pixel-level image segmentation
`semanticSegmentationMetrics`	Semantic segmentation quality metrics

Visualize Segmentation

`labeloverlay`	Overlay label matrix regions on 2-D image
`volshow`	Display volume
`insertObjectMask`	Insert masks in image or video stream

Topics

Get Started

Get Started with Semantic Segmentation Using Deep Learning
Segment objects by class using deep learning networks such as U-Net and DeepLab v3+.
Choose Function to Visualize Detected Objects
Compare visualization functions.

Create Ground Truth for Semantic Segmentation

Label Pixels for Semantic Segmentation
Label pixels for training a semantic segmentation network by using a labeling app.
How Labeler Apps Store Exported Pixel Labels
Learn how the labeling apps store pixel label data.

Prepare Training Data for Semantic Segmentation

Training Data for Object Detection and Semantic Segmentation
Create training data for object detection or semantic segmentation using the Image Labeler or Video Labeler.
Datastores for Deep Learning (Deep Learning Toolbox)
Learn how to use datastores in deep learning applications.
Get Started with Image Preprocessing and Augmentation for Deep Learning
Preprocess data for deep learning applications with deterministic operations such as resizing, or augment training data with randomized operations such as random cropping.