For example:
A = gpuArray.ones(100, 'single');
B = gpuArray.ones(50, 'single');
A(1:10,1:10) = conv2(A(1:10,1:10), B, 'same');
Error using gpuArray/gather
An unexpected error occurred during CUDA execution. The CUDA error was:
CUDA_ERROR_LAUNCH_FAILED
Error in dispInternal>iTransferPortionDense (line 36)
data = gather( subsref( x, s ) );
Error in parallel.internal.shared.buildDisplayHelper>iFirstNNumericDisplayHelper (line 72)
maybeTruncatedValue = transferDenseFcn( x, rangeStruct );
Error in parallel.internal.shared.buildDisplayHelper>iBuildDisplayHelper (line 33)
dh = iFirstNNumericDisplayHelper( ...
Error in parallel.internal.shared.buildDisplayHelper (line 24)
dh = iBuildDisplayHelper( x, transferDenseFcn, transferSparseFcn, xClassName, xName, N );
Error in dispInternal (line 13)
dh = parallel.internal.shared.buildDisplayHelper( ...
Error in gpuArray/display (line 21)
dh = dispInternal( obj, thisClassName, objName );