Soft blue gradient with fluid, wave-like abstract shapes.

Localization

India

Number of employees

11-50

Creation date

1985

Description

Primary activity: Tesseract OCR is an open-source optical character recognition (OCR) engine — originally developed by Hewlett-Packard and maintained by Google since 2006 — dedicated to extracting text from images.

Products / services: distributed OCR software/library (open-source project, public repository on GitHub) usable as a local engine or integrated into APIs and applications to convert images/scans into usable text.

Target market: developers and companies across all sectors requiring OCR capabilities (banks and financial services, IT/software, public services/government, consulting, etc.). Examples of cited users: ING, HSBC, Bajaj Finserv, Scalable Capital, Evalueserve.