• Australia: 1300 553 313
  • New Zealand: 0800 450 168

OCR SDK Technology

/ LEADTOOLS Products / OCR SDK Technology

OCR SDK Technology

LEADTOOLS provides fast and highly accurate Optical Character Recognition SDK technology for .NET (C# & VB), C/C++, iOS, macOS, Linux, Java, and web developers. Leverage the high-level LEADTOOLS OCR toolkit to rapidly develop robust, scalable, and high-performance recognition and document processing applications that extract text from scanned documents and convert images to text-searchable formats such as PDF, PDF/A, DOC, DOCX, XML, and XPS.

The advanced OCR SDK technology in LEADTOOLS is multi-faceted and can be used as a standalone feature as well as the driving force behind more advanced technologies such as forms recognition, check recognition and document conversion. On its own, programmers can write as few as three lines of code to convert an image to text-searchable documents.

With extensive support for more than forty character sets, programmers can expand their customer base by providing the same solution for many languages, including English, Spanish, French, German, Japanese, Chinese, and Arabic.

Free Live Online Demo – Click here to see LEADTOOLS OCR in action

Overview of LEADTOOLS OCR SDK Technology

Speed and Reliability

  • Fast, accurate, and reliable optical character recognition for use in any application and environment
    • Large volume document batch processing
    • Single and multi-page documents
    • Tablet and mobile devices
    • Web and Cloud applications
  • Utilise multiple cores for unparalleled performance

Versatility

  • Fully featured SDK
    • High-level classes for one-shot recognition from image to final document
    • Low-level functionality for full customisation
  • Supports multiple text recognition engines
  • Fully configurable recognition engine
    • Narrow down possible results with character and numeric filters
    • Multiple voting techniques for enhanced accuracy
    • Trace and progress callbacks
    • Enable/disable font characteristics for fast text only recognition
  • Comprehensive results reporting
    • Character location, size and baseline
    • Character attributes (end of word, end of line, end of paragraph, etc.)
    • Font properties (monospace, proportional, serif, sans-serif, bold, italic, underline, strikethrough)
    • Confidence values
    • Obtain the recognised words directly for a zone or page without saving to an external document
  • Integrates seamlessly with LEADTOOLS Forms Recognition and Processing
  • Implement large-scale distributed OCR applications using LEADTOOLS Distributed Computing SDK

Cross-Platform

  • Native, full-featured Optical Character Recognition libraries for many platforms
    • Windows desktops, servers, and Phones
    • iOS and macOS
    • Android
    • Web Services
  • Native mobile libraries run on the device without need to call external services
  • Utilise camera preview on phones and tablets for real-time text recognition and processing
  • Specialised image processing and recognition for deformations unique to mobile devices
    • 3D Perspective Deskew
    • Keystone Correction
    • Fixed-focus camera optimisations
  • Programming interfaces closely resemble each other, making it easy to port an application to multiple platforms

Accuracy

  • Spell checking dictionary support
  • Automatically detect, segment, and recognize multiple languages on the same document
  • Full-page analysis and Zonal recognition
    • Automatic table area segmentation
    • Automatic OMR area segmentation
    • Automatic vertical text segmentation with its orientation angle (i.e. 90, 270 or 0)
    • Automatic segmentation of vertical text within horizontal text pages
  • Automatic document cleanup
    • Omni-directional noise removal
    • Undither text
    • Dot matrix correction
    • Option to remove lines from tables
  • Automatic document preprocessing
    • Deskew of scanned document
    • Detect and correct the orientation of the document (flipped or reversed) with multi or single-page modes
    • Remove borders
    • Split pages
  • Unique colour and bitonal image recognition for scanned documents and pictures can detect text regardless of foreground/background colours
  • Output searchable text document formats such as PDF, PDF/A, DOC, DOCX, XML, XPS, and more, maintaining the original look and feel
    • Text with detected font characteristics (font-family name, style, size, bold, italic, underline, strikeout, slope angle, etc.)
    • Location
    • Tables reconstruction
    • Layout
    • Graphics

TIFF to Searchable PDF in Three Lines of Code

The LEADTOOLS OCR SDK provides a high-level programming interface that allows developers to create complex recognition applications in record time. For example, with the AutoRecognizeManager, developers can convert any of 150+ image formats into a text-searchable format such as PDF or DOC in as little as three lines of code!

OCR Languages

Recognise text from more than 40 languages and character sets.

LEADTOOLS OCR Engines

The LEADTOOLS OCR SDK contains the following recognition engines:

LEAD OCR Engine

Developed in-house with native libraries for Windows x86/x64, .NET (C# & VB), Linux, iOS, macOS, and Android.

Benefits of In-House Development:

  • New features and enhancements to speed and accuracy are continuously added
  • Customer feedback and feature requests are used to enhance and expand the engine with solutions to real-world applications

OmniPage OCR Engine

Licenses the Nuance Omnipage recognition engine and includes native libraries for Windows x86/x64 and .NET.

Interchangeable High-level Interface

LEADTOOLS features a high-level interface that abstracts and simplifies the use of multiple recognition engines with only a single line of code.

// Use LEAD engine
IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.LEAD false);
// Use OmniPage engine
IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.OmniPage, false);

TIFF to Searchable PDF in Three Lines of Code

The LEADTOOLS OCR SDK provides a high-level programming interface that allows developers to create complex recognition applications in record time. For example, with the AutoRecognizeManager, developers can convert any of 150+ image formats into a text-searchable format such as PDF or DOC in as little as three lines of code!

IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.LEAD, false);
ocrEngine.Startup(null, null, null, null);
ocrEngine.AutoRecognizeManager.Run(@"C:\in.tif", @"C:\out.pdf", 
    DocumentFormat.Pdf, null, null);

LEADTOOLS ICR Module – OmniPage Engine

The LEADTOOLS ICR Module – OmniPage Engine is an add-on product for integrating intelligent character recognition (ICR) and optical character recognition (OCR) technology into applications and provides everything programmers need to develop robust, high-performance, scalable image-recognition solutions. LEADTOOLS ICR Module – OmniPage Engine includes the OmniPage Engine and seamlessly combines with LEADTOOLS SDKs in the Document and Medical product lines.

The OmniPage OCR engine is a 3rd-party engine packaged and licensed for use within LEADTOOLS. It is an interchangeable alternative to the in-house developed LEAD OCR engine. LEADTOOLS ICR Module – OmniPage Engine is the only LEADTOOLS product to provide ICR.

LEADTOOLS OCR Module – OmniPage Engine with Additional Languages

LEADTOOLS OCR Module – OmniPage Engine with Additional Languages is an add-on product for integrating Arabic, Chinese, Japanese, and Korean optical character recognition (OCR) technology into applications and provides everything programmers need to develop robust, high performance, scalable image recognition solutions. LEADTOOLS OCR Module – OmniPage Engine with Additional Languages seamlessly integrates with LEADTOOLS SDKs in the Document and Medical product families.

The OmniPage OCR engine is a 3rd-party OCR engine packaged and licensed for use within LEADTOOLS and is an interchangeable alternative to the in-house developed LEAD OCR engine. LEADTOOLS OCR Module – OmniPage Engine with Additional Languages extends the OmniPage OCR engine to add Arabic and Asian language character set support. LEADTOOLS OCR Module – OmniPage Engine with Additional Languages is one OCR product that provides support for Asian languages within the Document Imaging Family. The LEAD OCR engine, which is included with LEADTOOLS Recognition Imaging and LEADTOOLS Document Imaging Suite products also provides recognition of Asian language character sets.