OCR using Tesseract

•Download as PPTX, PDF•

9 likes•5,173 views

This presentation explains the working of OCR engine for character recognition. It demonstrates the working of the Tesseract by Google.

Engineering

Real time OCR
using Tesseract
12BCE094
SHOBHIT CHITTORA

Brief History Of Tesseract
 Open Source OCR engine sponsored by Google since 2006.
 One of the most accurate open source OCR engines currently
available.
 Originally developed by HP between 1985-1994.
 Lot of it is written in C and C++.

Spaces between words are tricky
too
 Italics, digits, punctuation all create special-case font-dependent
spacing.
 Fully justified text in narrow columns can have vastly varying spacing
on different lines.

Outline Approximation
 Polygonal approximation is a double-edged sword.
 Noise and some pertinent information are both lost.

Why it’s called Tesseract?
 Elements of the polygonal approximation, clustered within a
character/font combination.
 x, y position, direction, and length (as a multiple of feature length)

Character Classifier (Features and
Matching)
 Static classifier uses outline fragments as features. Broken characters are
easily recognizable by a small->large matching process in classifier. (This is
slow.)
 Adaptive classifier uses the same technique!

Classifier as Histogram of Gradients
 Quantize character area.
 Compute gradients within.
 Histograms of gradients map to fixed dimension feature vector.

Character Segmentation
 Segmentation Graphs

Rating and Certainty
 Rating = Distance * Outline length
○ Total rating over a word (or line if you prefer) is normalized
○ Different length transcriptions are fairly comparable
 Certainty = -20 * Distance
○ Measures the absolute classification confidence
○ Surrogate for log probability and is used to decide what needs
more work.

Implementation using Tess-two( Tess
port for Android)
 The Tess-two library is an open source port of Tesseract engine for
Android.
 Only the most basic and popular functionalities are ported.
 Things such as deep neutral nets are not ported.
 A lot of tweaking is required to produce desired results.

Implementing Real Time OCR and
challenges
 Image processing on memory limited devices is difficult.
 Limited clock speeds to process huge matrices.
 Running the Camera Surface Holder in MainUI and preprocessing
and OCR on user threads.
 Maintaining huge Bitmaps for preprocessing and sending to multiple
threads.
 Avoiding Garbage Collection of important preprocessed data.

What's hot

CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...Editor IJMTER

Handwriting RecognitionBindu Karki

Hand Written Character Recognition Using Neural Networks Chiranjeevi Adi

Optical Character Recognition (OCR)Vidyut Singhania

Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...iosrjce

A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUESijcsitcejournal

Optical character recognition (ocr) pptDeijee Kalita

optical character recognition systemVijay Apurva

Optical Character RecognitionDurjoy Saha

Ocr abstractPunya Prakash

Edge linking in image processingVARUN KUMAR

Optical character recognition IEEE Paper StudyEr. Ashish Pandey

Final Report on Optical Character Recognition Vidyut Singhania

TEXT-SPEECH PPT.pptxNsaroj kumar

Object detection with deep learningSushant Shrivastava

Optical Character Recognition (OCR) Systemiosrjce

Text reader [OCR]MisbahUddin52

Object detectionJksuryawanshi

Real Time Object TrackingVanya Valindria

3 d display methodsShami Al Rahad

What's hot (20)

CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...

Handwriting Recognition

Hand Written Character Recognition Using Neural Networks

Optical Character Recognition (OCR)

Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...

A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES

Optical character recognition (ocr) ppt

optical character recognition system

Optical Character Recognition

Ocr abstract

Edge linking in image processing

Optical character recognition IEEE Paper Study

Final Report on Optical Character Recognition

TEXT-SPEECH PPT.pptx

Object detection with deep learning

Optical Character Recognition (OCR) System

Text reader [OCR]

Object detection

Real Time Object Tracking

3 d display methods

Viewers also liked

Tamil OCR using Tesseract OCR Enginebalamurugan.k Kalibalamurugan

OCR using TesseractShobhit Chittora

Tesseract OCR Engine - OpenFest 2009Svetlin Nakov

Scalable OCR with NiFi and TesseractDataWorks Summit/Hadoop Summit

Text Detection and RecognitionBadruz Nasrin Basri

ocrSangram Keshari Senapati

Tasract OCRRaghu nath

BrailleOCR: An Open Source Document to Braille Converter Applicationpijush15

Tiny Google ProjectsOstap Andrusiv

2 architecture anddatastructuresSolin TEM

TCDL15 Beyond eMOPMatt Christy

reelyActive Brick & Mortar Retail SolutionreelyActive

ocr with N NMarwa Alkubaissy

As Ict (Ocr) G061 3.1.6 Application Software used for the Presentation & Comm...Christos Demetriou

Ocr colorVijay Krishna

OCRjacekb

Node wayEthan Zhang

Introduction to python for Beginners Sujith Kumar

Introduction to Python amiable_indian

My Final Year B.Tech Research ProjectEeshan Srivastava

Viewers also liked (20)

Tamil OCR using Tesseract OCR Engine

OCR using Tesseract

Tesseract OCR Engine - OpenFest 2009

Scalable OCR with NiFi and Tesseract

Text Detection and Recognition

ocr

Tasract OCR

BrailleOCR: An Open Source Document to Braille Converter Application

Tiny Google Projects

2 architecture anddatastructures

TCDL15 Beyond eMOP

reelyActive Brick & Mortar Retail Solution

ocr with N N

As Ict (Ocr) G061 3.1.6 Application Software used for the Presentation & Comm...

Ocr color

OCR

Node way

Introduction to python for Beginners

Introduction to Python

My Final Year B.Tech Research Project

Similar to OCR using Tesseract

Script Identification Using MATLABAnimesh Mishra

Rust presentation convergeconfKrishna Kumar Thokala

Texture features based text extraction from images using DWT and K-means clus...Divya Gera

License Plate RecognitionGilbert

Introduction to Tensor Flow-v1.pptxJanagi Raman S

"Source Code Abstracts Classification Using CNN", Vadim Markovtsev, Lead Soft...Dataconomy Media

Turtlebot Poster_Summer 2016Ye Sung (Rebecca) Kim

IRJET- Text Extraction from Text Based Image using AndroidIRJET Journal

Modern C++Richard Thomson

Design and Description of Feature Extraction Algorithm for Old English FontIRJET Journal

An Efficient Segmentation Technique for Machine Printed Devanagiri Script: Bo...iosrjce

Building scalable and language-independent Java services using Apache Thrift ...IndicThreads

Building scalable and language independent java services using apache thriftTalentica Software

Ocr 1Manoj Nanduri

TensorFlow for HPC?inside-BigData.com

Optimization of Incremental Queries CloudMDE2015József Makai

Scripting in InduSoft Web StudioAVEVA

Greg Hogan – To Petascale and Beyond- Apache Flink in the CloudsFlink Forward

Developing Actors in Azure with .netMarco Parenzan

Ary Mouse for Image ProcessingIJERA Editor

Similar to OCR using Tesseract (20)

Script Identification Using MATLAB

Rust presentation convergeconf

Texture features based text extraction from images using DWT and K-means clus...

License Plate Recognition

Introduction to Tensor Flow-v1.pptx

"Source Code Abstracts Classification Using CNN", Vadim Markovtsev, Lead Soft...

Turtlebot Poster_Summer 2016

IRJET- Text Extraction from Text Based Image using Android

Modern C++

Design and Description of Feature Extraction Algorithm for Old English Font

An Efficient Segmentation Technique for Machine Printed Devanagiri Script: Bo...

Building scalable and language-independent Java services using Apache Thrift ...

Building scalable and language independent java services using apache thrift

Ocr 1

TensorFlow for HPC?

Optimization of Incremental Queries CloudMDE2015

Scripting in InduSoft Web Studio

Greg Hogan – To Petascale and Beyond- Apache Flink in the Clouds

Developing Actors in Azure with .net

Ary Mouse for Image Processing

Recently uploaded

University management System project report..pdfKamal Acharya

Java Programming :Event Handling(Types of Events)simmis5

BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxfenichawla

Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis

VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEslot gacor bisa pakai pulsa

result management system report for college projectTonystark477637

Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth

KubeKraft presentation @CloudNativeHooghlysanyuktamishra911

Introduction to IEEE STANDARDS and its different types.pptxupamatechverse

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona

UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan

ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat

Recently uploaded (20)

University management System project report..pdf

Java Programming :Event Handling(Types of Events)

BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx

Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...

VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE

result management system report for college project

Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...

KubeKraft presentation @CloudNativeHooghly

Introduction to IEEE STANDARDS and its different types.pptx

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts

Processing & Properties of Floor and Wall Tiles.pptx

UNIT-V FMM.HYDRAULIC TURBINE - Construction and working

ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...

OCR using Tesseract

1. Real time OCR using Tesseract 12BCE094 SHOBHIT CHITTORA

2. Brief History Of Tesseract  Open Source OCR engine sponsored by Google since 2006.  One of the most accurate open source OCR engines currently available.  Originally developed by HP between 1985-1994.  Lot of it is written in C and C++.

3. TessOCR Architecture

4. Adaptive Thresholding is Essential

5. Baselines are rarely perfectly straight

6. Spaces between words are tricky too  Italics, digits, punctuation all create special-case font-dependent spacing.  Fully justified text in narrow columns can have vastly varying spacing on different lines.

7. Tesseract Word Recognizer

8. Outline Approximation  Polygonal approximation is a double-edged sword.  Noise and some pertinent information are both lost.

9. Why it’s called Tesseract?  Elements of the polygonal approximation, clustered within a character/font combination.  x, y position, direction, and length (as a multiple of feature length)

10. Character Classifier (Features and Matching)  Static classifier uses outline fragments as features. Broken characters are easily recognizable by a small->large matching process in classifier. (This is slow.)  Adaptive classifier uses the same technique!

11. Classifier as Histogram of Gradients  Quantize character area.  Compute gradients within.  Histograms of gradients map to fixed dimension feature vector.

12. Character Segmentation  Segmentation Graphs

13.

14. Rating and Certainty  Rating = Distance * Outline length ○ Total rating over a word (or line if you prefer) is normalized ○ Different length transcriptions are fairly comparable  Certainty = -20 * Distance ○ Measures the absolute classification confidence ○ Surrogate for log probability and is used to decide what needs more work.

15. Tesseract Training

16. Implementation using Tess-two( Tess port for Android)  The Tess-two library is an open source port of Tesseract engine for Android.  Only the most basic and popular functionalities are ported.  Things such as deep neutral nets are not ported.  A lot of tweaking is required to produce desired results.

17. DEMO

18. Implementing Real Time OCR and challenges  Image processing on memory limited devices is difficult.  Limited clock speeds to process huge matrices.  Running the Camera Surface Holder in MainUI and preprocessing and OCR on user threads.  Maintaining huge Bitmaps for preprocessing and sending to multiple threads.  Avoiding Garbage Collection of important preprocessed data.

19. Thank You

OCR using Tesseract

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to OCR using Tesseract

Similar to OCR using Tesseract (20)

Recently uploaded

Recently uploaded (20)

OCR using Tesseract