How do I match nested parenthesis (brackets, or braces) with dynamic regular expressions?

49 visualizaciones (últimos 30 días)
One can read all over the web how it is impossible to use regular expressions to match nexted parenthesis. However MATLAB has this cool feature called 'dynamic regular expressions' that allow one to insert some MATLAB code to do all kinds of special 'gymnastics'. Is there a way to use this feature to count instances of parenthesis and, in turn, find their matches? Consider the following string:
g = 'asdf (( dwer e: ( asdedsdskek))::)asd fg ( qwe 4 dfy5 57) q34 dqa5';
or
g = 'asdf (( (dwer ) e: ( asdedsdskek))::)asd fg ( qwe 4 dfy5 57) q34 dqa5';
or
g = 'asdf ( dwer e: )asd fg ( qwe 4 dfy5 57) q34 dqa5';
Specifically, my need is only to match the first left parenthesis with its partner but one would think the more general solution of matching all sets of parenthesis is feasible with dynamic regular expressions. If anyone can help with this, it would be much appreciated.

Respuesta aceptada

Stephen23
Stephen23 el 1 de Abr. de 2020
Editada: Stephen23 el 1 de Abr. de 2020
This matches the outer-most matched pair of parentheses:
>> str = 'asdf (( dwer e: ( asdedsdskek))::)asd fg ( qwe 4 dfy5 57) q34 dqa5';
>> fun = @(s)sprintf('.{%d}',find(cumsum((s==')' )-(s=='('))>0,1,'first'));
>> out = regexp(str,'\((??@fun($''))','match')
out =
'(( dwer e: ( asdedsdskek))::)' '( qwe 4 dfy5 57)'
  2 comentarios
Dan
Dan el 1 de Abr. de 2020
Wow ... spending your covid-19 quarentine perusing old MATLAB questions? I appreciate the effort.

Iniciar sesión para comentar.

Más respuestas (2)

Walter Roberson
Walter Roberson el 17 de Mzo. de 2014
It might be possible, but it will not be easy.
The regular expressions supported by MATLAB are very similar to the regular expressions supported by Perl.
Here is one way to use Perl just to count to see if parens are matched:
In pattern matching in Perl in which you are trying to balance pairs, see
and the (?PARNO) construct described at
The (?PARNO) and recurse constructs are not supported by MATLAB.
You just might be able to use the dynamic expressions to invoke a function that names itself inside of a dynamic expression, thus achieving recursion.
Warning: you will spend a lot of time getting it right. It would be much easier to write some code that did the analysis then to try to use regular expressions for it.
  1 comentario
Dan
Dan el 18 de Mzo. de 2014
Thanks, Walter ... I ended up pulling in the string and writing a little MATLAB subroutine to do the task so I'm not motivated to research your references. Hopefully someone else can use the information in the future. Dan

Iniciar sesión para comentar.


Daniel Renjewski
Daniel Renjewski el 15 de Mzo. de 2023
Editada: Daniel Renjewski el 15 de Mzo. de 2023
I have got a similar problem as I wanted to identify fractions in the string of an equation to replace it with proper latex code. The following function gives you the position of all pairs of open and closing brackets with their respective position in the string, assuming there are indeed only pairs.
str = 'asdf (( dwer e: ( asdedsdskek))::)asd fg ( qwe 4 dfy5 57) q34 dqa5'
str = 'asdf (( dwer e: ( asdedsdskek))::)asd fg ( qwe 4 dfy5 57) q34 dqa5'
br = detect_brackets(str)
br = 4×2
17 30 7 31 6 34 42 57
for idx = 1:size(br,1)
display(str(br(idx,1):br(idx,2)))
end
( asdedsdskek) ( dwer e: ( asdedsdskek)) (( dwer e: ( asdedsdskek))::) ( qwe 4 dfy5 57)
function [oc] = detect_brackets(str)
oc = [];
% find all opening and closing brackets in the string
op=strfind(str,'(');
cl=strfind(str,')');
% search for pairs until all are identified
while ~isempty(op | cl)
% find opening bracket for first closing bracket
idx = find(op < cl(1),1,'last');
% append this pair to function output
oc = [oc;op(idx) cl(1)];
% remove found opening bracket from vector
op(idx) = [];
% remove found closing bracket from vector
cl(1) = [];
end
end

Productos

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by