Read text from PDF, Microsoft Word, HTML, and plain text files
specifies additional options using one or more name-value pair arguments.str
= extractFileText(filename
,Name,Value
)
To read text directly from HTML code, use extractHTMLText
.
extractHTMLText
| readPDFFormData
| tokenizedDocument
| writeTextDocument