Skip to main content

Extract Text From Image

Synopsis

This operator extracts text from a single image using the Tesseract OCR library.

Description

This operator creates a new ExampleSet containing extracted text from the image provided as well as the given input image locations.

The operator uses the Tesseract OCR library in version 4.1.1. It identifies the text on the image using a pre-trained LSTM model. Currently, this operator only ships with an English language model.

Input

ExampleSet

This input port expects a file containing image location. If this port is not connected, the "single file location" parameter will appear, otherwise no parameter.

Output

ExampleSet(Data Table)

A new ExampleSet with extracted text and the location of the image used will be provided.

Parameters

Single image location

The location of an image from which the text value should be extracted.