reading data after a string

1 visualización (últimos 30 días)
Sergio
Sergio el 27 de Oct. de 2013
Editada: Cedric el 27 de Oct. de 2013
Hi,
I would like to read the data below the words 'Link Results'. What i would really like to know is how to get the line number in the text file that I'm trying to read that has 'Link Results'. After I get that I know how to read the data. so my question would be how to find that number.
Thanks!
  3 comentarios
Sergio
Sergio el 27 de Oct. de 2013
Here is a piece of the text I would like to read
. . . 877 0.01 0.01 0.01 72.01 0.01 1047 0.01 0.01 0.01 72.01 0.01
************
Link Results
************
<<< Link 1301 >>>
-------------------------------------------------------------
Flow Velocity Depth Percent
Date Time CFS ft/sec feet Full
-------------------------------------------------------------
SEP-29-2010 00:15:00 0.000 0.000 0.000 0.0
SEP-29-2010 00:30:00 0.000 0.000 0.000 0.0
SEP-29-2010 00:45:00 0.000 0.000 0.000 0.0
.
.
.
I figured I could just get the line index where it reads 'Link Results' (which is 7063 in this case but changes with the other text files I need to run through) and add 8 to get to my data
Thanks!
Cedric
Cedric el 27 de Oct. de 2013
And how does the table end? Is it the end of file or is there some other content afterwards?

Iniciar sesión para comentar.

Respuesta aceptada

Cedric
Cedric el 27 de Oct. de 2013
Editada: Cedric el 27 de Oct. de 2013
Here is one way, assuming that the block of data is the last content that you have in the file. If not, I can update the answer. As Walter is pointing out, we have to read the file from the beginning (here we read the full content) and match pattern(s). This solution does that using a a regular expression, which is quite concise, but you could implement a more basic loop which scans each line.
content = fileread( 'myFile.txt' ) ;
blocks = regexp( content, 'Link Results.*?Full.*?\-\s(.*)', 'tokens' ) ;
block = blocks{1}{1} ;
data = textscan( block, '%s %s %f %f %f %f' ) ;
After running this, you get..
>> data
data =
{3x1 cell} {3x1 cell} [3x1 double] [3x1 double] [3x1 double] [3x1 double]
>> data{1}
ans =
'SEP-29-2010'
'SEP-29-2010'
'SEP-29-2010'
>> data{2}
ans =
'00:15:00'
'00:30:00'
'00:45:00'
>> data{3}
ans =
0
0
0
etc. Then you can post-process this cell array to get e.g. a cell array of time stamps, and a numeric array of data..
timeStamps = arrayfun( @(r) [data{1}{r}, ' ', data{2}{r}], ...
1:numel(data{1}), 'UniformOutput', false ).' ;
dataNum = [data{3:end}] ;
which leads to:
>> timeStamps
timeStamps =
'SEP-29-2010 00:15:00'
'SEP-29-2010 00:30:00'
'SEP-29-2010 00:45:00'
>> dataNum
dataNum =
0 0 0 0
0 0 0 0
0 0 0 0
  4 comentarios
Sergio
Sergio el 27 de Oct. de 2013
This would be the end of the text file **** OCT-01-2010 23:30:00 15.833 1.067 0.819 1.7 OCT-01-2010 23:45:00 15.496 1.061 0.810 1.7 OCT-02-2010 00:00:00 15.175 1.055 0.801 1.7
Analysis begun on: Sun Oct 27 16:18:20 2013
Analysis ended on: Sun Oct 27 16:18:37 2013
Total elapsed time: 00:00:17
Cedric
Cedric el 27 de Oct. de 2013
Editada: Cedric el 27 de Oct. de 2013
Ok, just update the pattern in the regexp call as follows:
blocks = regexp( content, 'Link Results.*?Full.*?\-\s(.*?)\s*Ana', ...
'tokens' ) ;

Iniciar sesión para comentar.

Más respuestas (1)

Walter Roberson
Walter Roberson el 27 de Oct. de 2013
There are no functions built into MATLAB, or to any of the operating systems that current MATLAB run on, that can tell you which line number of a file that you are positioned to. None of the operating systems supported have any inherent concept of "line number".
Therefore if you want to know which line number something is on, you need to start at the beginning of the file, read line by line, counting each as you go, until you find the pattern you are looking for.

Categorías

Más información sobre Standard File Formats en Help Center y File Exchange.

Etiquetas

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by