Paper Mode in VisionScript Programs

Under the hood, Paper Mode is powered by three lines of VisionScript:

Load["image.jpg"]
GetText[]
Say[]

This program:

Loads an image from a file;
Reads the text in the image, and;
Returns the text as plain text.

Behind the scenes, GetText[] calls the Google Cloud OCR API to read the text in the image. This API was chosen due to its superior performance over various well-known open source OCR libraries that were tested.

Then, error correction is applied to try and fix any errors present.

The text is returned to the Notebook and turned into a VisionScript program in Interactive Mode.

Paper Mode in VisionScript Programs

See Also