Automate - Run Native OCR on Linux

The material in this document is for informational purposes only. This guide assumes that the most recent version of Rampiva Automate is in use unless otherwise noted in the prerequisites. The products it describes are subject to change without prior notice, due to the manufacturer’s continuous development program. Rampiva makes no representations or warranties with respect to this document or with respect to the products described herein. Rampiva shall not be liable for any damages, losses, costs or expenses, direct, indirect or incidental, consequential or special, arising out of, or related to the use of this material or the products described herein.

Introduction

This document outlines the configuration required to run the Rampiva Native OCR on a Linux installation.

Prerequisites 

  • Rampiva Automate 7 or later

  • Ubuntu 20.04 LTS or later


Instructions

A. Install Tesseract OCR

1

Install the Tesseract OCR package.

For a different Linux distribution, run the equivalent comments to install the tesseract-ocr package version 5 or later.

sudo apt update sudo apt install tesseract-ocr
2

Confirm the folder in which Tesseract OCR was installed.

Sample output:

/usr/bin/tesseract

which tesseract

B. Configure the Workflow

1

Edit the Workflow.

2

Select the Configure Native OCR operation.

3

Check the option OCR engine binaries folder and enter the path where the tesseract binary was installed (ex: /usr/bin).

4

Save and test the Workflow.