webread is not getting all text on a website

1 visualización (últimos 30 días)
Will Kinsman
Will Kinsman el 28 de Feb. de 2016
Respondida: Walter Roberson el 28 de Feb. de 2016
Hi all,
I am trying to build a program to get the plain text from a website. The issue I am encountering is that webread does not seem to be collecting all of the text on the site (specifically, the table; see website below). I see my options are twofold:
  1. query a third-party html-to-plain text website that can do a better job
  2. determine if there is a workaround that catches more text than the webread method I am using now
here is my code:
html = webread('https://finance.yahoo.com/q/bs?s=MXWL');
txt = regexprep(html,'<script.*?/script>','');
saveTXT(txt,'htmlplaintext');
help is greatly appreciated; I love you guys!
Will

Respuestas (1)

Walter Roberson
Walter Roberson el 28 de Feb. de 2016

Categorías

Más información sobre Environment and Settings en Help Center y File Exchange.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by