Extract only text between quotes of a string

49 visualizaciones (últimos 30 días)
Kirk woellert
Kirk woellert el 1 de Mzo. de 2012
Respondida: Pierre Harouimi el 8 de Mzo. de 2022
Folks, could use an assist. New to REGEXP but persistent. Desire only the literal text between quotes for this string:
imt = "e";
[subchunk] = regexp(textline,'\".*?\"','match','ignorecase');
imt = subchunk;
When I return the argument from my readtext file function, and print using disp(imt), I get this:
'"e"'
When I only want:
e
I assume the single quotes are associated with the disp() function?

Respuesta aceptada

Walter Roberson
Walter Roberson el 1 de Mzo. de 2012
Editada: John Kelly el 26 de Feb. de 2015
subchunk = regexp(textline, '(?<=")[^"]+(?=")', 'match');
imt = subchunk{1};
You could also consider just using a basic strfind() for '"', removing everything up to the first match and everything from the second match on.
  8 comentarios
Walter Roberson
Walter Roberson el 2 de Mzo. de 2012
(?<=") is always "look behind" (from where you are), so
(?<=")[^"]+(?<=") would mean to look behind for a double-quote, match a bunch of non-double-quote stuff, and then look behind from between the last non-double quote and the next character (or end of string) to see if the previous character was a double-quote. Which it could not be because it was only non-double-quotes in that pattern.
Look-behind from where you are, look ahead from where you are, different operators.
The output from regexp() is always a cell array, even when only one thing is being returned. You can return that cell array, and that might be appropriate in some cases, but be sure you do not try to switch() on the cell array itself: switch on the _content_ of the cell array. The
imt = subchunk{1};
strips away the cell array layer, leaving imt as a plain string (which _can_ be switch()'d on.)
Kirk woellert
Kirk woellert el 2 de Mzo. de 2012
Awesome. Thanks for your help.

Iniciar sesión para comentar.

Más respuestas (1)

Pierre Harouimi
Pierre Harouimi el 8 de Mzo. de 2022
From R2020b, you can use pattern, much easier than the complicated regexp function.
For your pb, you the extractBetween function seems the best one:
subchunk = extractBetween("'e'", "'", "'")

Categorías

Más información sobre C Shared Library Integration en Help Center y File Exchange.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by