Read Document
Input
file
The file port.
Output
output
The output port.
Parameters
File
Name of the file to read the data from.
Extract text only
If checked, structural information like xml or html tags will be ignored and discarded.
Use file extension as type
If checked, the type of the files will be determined by their extensions. Unknown extensions will be treated as text files.
Content type
The content type of the input texts
Encoding
The encoding used for reading or writing files.