Info

La pregunta está cerrada. Vuélvala a abrir para editarla o responderla.

Regular expressions, words in a pattern

2 visualizaciones (últimos 30 días)
bububu
bububu el 20 de Nov. de 2013
Cerrada: MATLAB Answer Bot el 20 de Ag. de 2021
I am trying to generate a pattern including the following items:
do not run
does not run
has not run
have not run
and here comes the problem...
have not recently run
or
do not go out or run
so, what I am using is the following: '((do(es)|ha(s|ve)|did|are)\s?no(t)\s?(\w+)\s?run*)' trying to capture with '(\w+)\s' the fact that other words could be in between, but it does not work with words in between. The text is a string... Thanks in advance

Respuestas (3)

Simon
Simon el 20 de Nov. de 2013
Hi!
"\w" does not match spaces!
  3 comentarios
Simon
Simon el 20 de Nov. de 2013
It does in my test:
str = 'have not recently run';
pattern = '((do(es)|ha(s|ve)|did|are)\s?no(t)\s?(\w+)\s?run*)';
[a,b] = regexp(str, pattern, 'start', 'tokens');
Do you have more than one space in front of or behind "recently"? Then "\s?" doesn't match.
You may also remove the outer parentheses around your pattern to get all matches like:
str = 'have not recently run';
pattern = '(do(es)|ha(s|ve)|did|are)\s?no(t)\s?(\w+)\s?run*';
[a,b] = regexp(str, pattern, 'start', 'tokens');
b{:}
bububu
bububu el 20 de Nov. de 2013
THANKS A LOT! IT WAS USEFUL!

Image Analyst
Image Analyst el 20 de Nov. de 2013
I'm very confused by what you want to do - I have no idea if you want to generate patterns from combinations of those phrases, or if you want to extract words or phrases in that list from larger sentences. Anyway (whatever you want to do), I think that either ismember () or allwords may be useful to you.

bububu
bububu el 20 de Nov. de 2013
I am looking for a certain pattern within a text. The pattern includes: i) do not run, ii) does not run, iii) has not run, iv) have not run or v) have not recently run. However, when I have more than one word between NOT and RUN, as in vi) do not go out or run, my pattern: pattern =' ((do(es)|ha(s|ve)|did|are)\s?no(t)\s?((\w+)\s|\s)?run*)'; does not find vi) do not go out or run with regexp. I would like to have not only characters between NOT and RUN, but also words... such that it can also find "do NOT go out or RUN"

La pregunta está cerrada.

Etiquetas

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by