SlideShare a Scribd company logo
1 of 24
Optical Character
Recognition
( OCR )
Karan Panjwani
T.E – B , 68
Guided By :
Prof. Shalini Wankhade
Contents
 Definition
 Introduction To OCR
 Problem Overview
 Uses
 Types
 Steps in OCR
 Accuracy
 Software Implementation
 Pros and Cons
 Research
Optical Character Recognition (OCR) is the
mechanical or electronic conversion of images of
typewritten or printed text into machine-encoded
text.
Definition
Introduction to OCR
 1 2 3 4 5 6 7 8 9 0
Problem overview
 Humans are bound to make errors – some time or the other – especially
while performing mundane boring tasks like digitization or Security,
continuously.
 Many times we are unable to perceive certain digits due to various factors
– motion, lack digit clarity, illumination and so on.
 It is these problems which have lead us to delve into this topic.
USES
 It is widely used as a form of Data Entry from Printed
Paper data records, whether Passport Documents,
Invoices, Bank Statements, Business Card, Mail or Other
Documents.
 It is common method of Digitizing Printed Texts so that
it can be Electronically edited, searched, stored more
compactly, displayed on-line, and used in Machine
Processes such as Machine Translation, Text-to-Speech,
Key Data and Text Mining.
TYPES
1) Optical Character Recognition ( OCR ) -
 Targets typewritten text, one Glyph or Character at a
time.
2) Optical Word Recognition ( OWR ) -
 Targets typewritten text, one word at a time (for
languages that use a space as a word divider).
3) Intelligent Character Recognition ( ICR ) –
 Targets handwritten print script or cursive text one
glyph or character at a time, usually involving machine
learning.
TYPES( contd…)
4) Intelligent Word Recognition ( IWR ) -
 Targets handwritten print script or cursive text, one
word at a time.
 This is especially useful for languages where glyphs
are not separated in cursive script.
Steps in OCR
Steps in ocr
Pre - processing
• Deals with Improving
quality of the Image for
better recognition by the
system. OCR software often
"pre-processes" images to
improve the chances of
successful recognition.
 Techniques include:
• De-Skew
• Despeckle
• Binarization
• Line Removal
• Zoning
• Line and Word Detection
• Script Recognition
• Segmentation
• Normalize Aspect Ratio and
Scale
Character Recognition
 There are two basic types of core OCR algorithm, which
may produce a ranked list of candidate characters.
• Matrix matching involves comparing an image to a stored
glyph on a pixel-by-pixel basis; it is also known as “pattern
matching”. This relies on the input glyph being correctly
isolated from the rest of the image, and on the stored
glyph being in a similar font and at the same scale. This
technique works best with typewritten text and does not
work well when new fonts are encountered.
• Feature extraction decomposes glyphs into “features” like
lines, closed loops, line direction, and line intersections.
Feature Extraction serves two purposes; one is to extract
properties that can identify a character uniquely. Second is
to extract properties that can differentiate between similar
characters.
Post - processing
 OCR accuracy can be increased if the output is
constrained by a lexicon – a list of words that are
allowed to occur in a document. This might be, for
example, all the words in the English language, or a
more technical lexicon for a specific field. This
technique can be problematic if the document contains
words not in the lexicon, like proper nouns. Tesseract
uses its dictionary to influence the character
segmentation step, for improved accuracy.
Accuracy
 Recognition of Latin-script, typewritten text
is still not 100% accurate even where clear
imaging is available. One study based on
recognition of 19th- and early 20th-century
newspaper pages concluded that character-
by-character OCR accuracy for commercial
OCR software varied from 81% to 99%; total
accuracy can be achieved by human review or
Data Dictionary Authentication.
 Other areas—including recognition of hand
printing, cursive handwriting, and printed
text in other scripts are still the subject of
active research.
Accuracy(contd..)
 Accuracy rates can be measured in several ways, and
how they are measured can greatly affect the reported
accuracy rate.
 For example, if word context (basically a lexicon of
words) is not used to correct software finding non-
existent words, a character error rate of 1% (99%
accuracy) may result in an error rate of 5% (95%
accuracy) or worse if the measurement is based on
whether each whole word was recognized with no
incorrect letters.
Use of Freeocr software
Pros and Cons
 OCR reduces time for processing for processing data
from large number of forms.
 If done manually, may lead to human error and takes up
much of the time.
 Recognition of cursive text is an active area of research,
with recognition rates even lower than that of hand-
printed text.
 Higher rates of recognition of general cursive script will
likely not be possible without the use of contextual or
grammatical information.
Research
 Recognition of cursive text is an active area
of research, with recognition rates even lower
than that of hand-printed text.
 Higher rates of recognition of general cursive
script will likely not be possible without the
use of contextual or grammatical information.
 For example, recognizing entire words from a
dictionary is easier than trying to parse
individual characters from script.
Conclusion
• OCR technology provides fast, automated
data capture which can save considerable
time and labour costs of organisations.
• The system has its advantages such as
Automation of mundane tasks, Less Time
Complexity, Very Small Database and High
Adaptability to untrained inputs with only
a small number of features to calculate.
References
 INTERNET :
 www.google.co.in
 www.slideshare.net
 http://www.ijsrp.org/research_paper_may2012/ijsrp-
may-2012-68.pdf
 en.wikipedia.org/wiki/Optical_character_recognition
 BOOKS’ :
 Character Recognition Systems by Mohamed Cheriet,
Nawwaf, Cheng-lin, Ching Y
THANK YOU

More Related Content

What's hot

Handwriting Recognition
Handwriting RecognitionHandwriting Recognition
Handwriting RecognitionBindu Karki
 
optical character recognition system
optical character recognition systemoptical character recognition system
optical character recognition systemVijay Apurva
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character RecognitionDurjoy Saha
 
Handwritten character recognition using artificial neural network
Handwritten character recognition using artificial neural networkHandwritten character recognition using artificial neural network
Handwritten character recognition using artificial neural networkHarshana Madusanka Jayamaha
 
Handwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer VersionHandwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer VersionNaiyan Noor
 
Presentation on OCR
Presentation on OCRPresentation on OCR
Presentation on OCRxsconfused
 
Face recognition ppt
Face recognition pptFace recognition ppt
Face recognition pptSantosh Kumar
 
Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks Chiranjeevi Adi
 
Optical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyOptical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyEr. Ashish Pandey
 
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...iosrjce
 
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUESA STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUESijcsitcejournal
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character RecognitionRahul Mallik
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingRishikese MR
 
Character Recognition using Machine Learning
Character Recognition using Machine LearningCharacter Recognition using Machine Learning
Character Recognition using Machine LearningRitwikSaurabh1
 
Handwritten digit recognition using image processing
Handwritten digit recognition using image processing Handwritten digit recognition using image processing
Handwritten digit recognition using image processing anita maharjan
 
Digital Image Processing
Digital Image ProcessingDigital Image Processing
Digital Image ProcessingSahil Biswas
 

What's hot (20)

Handwriting Recognition
Handwriting RecognitionHandwriting Recognition
Handwriting Recognition
 
optical character recognition system
optical character recognition systemoptical character recognition system
optical character recognition system
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
Text reader [OCR]
Text reader [OCR]Text reader [OCR]
Text reader [OCR]
 
Handwritten character recognition using artificial neural network
Handwritten character recognition using artificial neural networkHandwritten character recognition using artificial neural network
Handwritten character recognition using artificial neural network
 
Handwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer VersionHandwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer Version
 
Presentation on OCR
Presentation on OCRPresentation on OCR
Presentation on OCR
 
Face recognition ppt
Face recognition pptFace recognition ppt
Face recognition ppt
 
Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks
 
Optical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyOptical character recognition IEEE Paper Study
Optical character recognition IEEE Paper Study
 
Sign language recognizer
Sign language recognizerSign language recognizer
Sign language recognizer
 
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
 
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUESA STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
 
ocr
ocrocr
ocr
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Character Recognition using Machine Learning
Character Recognition using Machine LearningCharacter Recognition using Machine Learning
Character Recognition using Machine Learning
 
Biometrics ppt
Biometrics pptBiometrics ppt
Biometrics ppt
 
Handwritten digit recognition using image processing
Handwritten digit recognition using image processing Handwritten digit recognition using image processing
Handwritten digit recognition using image processing
 
Digital Image Processing
Digital Image ProcessingDigital Image Processing
Digital Image Processing
 

Viewers also liked

Project report of OCR Recognition
Project report of OCR RecognitionProject report of OCR Recognition
Project report of OCR RecognitionBharat Kalia
 
Detecting text from natural images with Stroke Width Transform
Detecting text from natural images with Stroke Width TransformDetecting text from natural images with Stroke Width Transform
Detecting text from natural images with Stroke Width TransformPooja G N
 
Text extraction From Digital image
Text extraction From Digital imageText extraction From Digital image
Text extraction From Digital imageKaushik Godhani
 
How to Access Utility Data
How to Access Utility DataHow to Access Utility Data
How to Access Utility DataUrjanet
 
SPARK15: Architecting The Future of Energy & Sustainability
SPARK15: Architecting The Future of Energy & SustainabilitySPARK15: Architecting The Future of Energy & Sustainability
SPARK15: Architecting The Future of Energy & SustainabilityUrjanet
 
OCR vs. Urjanet
OCR vs. UrjanetOCR vs. Urjanet
OCR vs. UrjanetUrjanet
 
The Credit Score Present and Future
The Credit Score Present and FutureThe Credit Score Present and Future
The Credit Score Present and FutureUrjanet
 
SPARK16 Presentation: Urjanet Product Vision
SPARK16 Presentation: Urjanet Product VisionSPARK16 Presentation: Urjanet Product Vision
SPARK16 Presentation: Urjanet Product VisionUrjanet
 
Spark 2017 Key Takeaways
Spark 2017 Key TakeawaysSpark 2017 Key Takeaways
Spark 2017 Key TakeawaysUrjanet
 
SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...
SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...
SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...Urjanet
 
SPARK15: Simplifying Sustainability Through Gamification
SPARK15: Simplifying Sustainability Through GamificationSPARK15: Simplifying Sustainability Through Gamification
SPARK15: Simplifying Sustainability Through GamificationUrjanet
 

Viewers also liked (12)

Text Detection and Recognition
Text Detection and RecognitionText Detection and Recognition
Text Detection and Recognition
 
Project report of OCR Recognition
Project report of OCR RecognitionProject report of OCR Recognition
Project report of OCR Recognition
 
Detecting text from natural images with Stroke Width Transform
Detecting text from natural images with Stroke Width TransformDetecting text from natural images with Stroke Width Transform
Detecting text from natural images with Stroke Width Transform
 
Text extraction From Digital image
Text extraction From Digital imageText extraction From Digital image
Text extraction From Digital image
 
How to Access Utility Data
How to Access Utility DataHow to Access Utility Data
How to Access Utility Data
 
SPARK15: Architecting The Future of Energy & Sustainability
SPARK15: Architecting The Future of Energy & SustainabilitySPARK15: Architecting The Future of Energy & Sustainability
SPARK15: Architecting The Future of Energy & Sustainability
 
OCR vs. Urjanet
OCR vs. UrjanetOCR vs. Urjanet
OCR vs. Urjanet
 
The Credit Score Present and Future
The Credit Score Present and FutureThe Credit Score Present and Future
The Credit Score Present and Future
 
SPARK16 Presentation: Urjanet Product Vision
SPARK16 Presentation: Urjanet Product VisionSPARK16 Presentation: Urjanet Product Vision
SPARK16 Presentation: Urjanet Product Vision
 
Spark 2017 Key Takeaways
Spark 2017 Key TakeawaysSpark 2017 Key Takeaways
Spark 2017 Key Takeaways
 
SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...
SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...
SPARK16 Presentation: Measuring for Results: Data and the Changing Energy Lan...
 
SPARK15: Simplifying Sustainability Through Gamification
SPARK15: Simplifying Sustainability Through GamificationSPARK15: Simplifying Sustainability Through Gamification
SPARK15: Simplifying Sustainability Through Gamification
 

Similar to Optical Character Recognition( OCR )

Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Editor IJARCET
 
Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Editor IJARCET
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
OCR 's Functions
OCR 's FunctionsOCR 's Functions
OCR 's Functionsprithvi764
 
OCR Algorithm Accuracy Enhancement
OCR Algorithm Accuracy EnhancementOCR Algorithm Accuracy Enhancement
OCR Algorithm Accuracy EnhancementRahul Taneja
 
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...E42 (Light Information Systems Pvt Ltd)
 
Online Hand Written Character Recognition
Online Hand Written Character RecognitionOnline Hand Written Character Recognition
Online Hand Written Character RecognitionIOSR Journals
 
What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?ARC Document Solutions
 
Automatic handwriting recognition
Automatic handwriting recognitionAutomatic handwriting recognition
Automatic handwriting recognitionBIJIT GHOSH
 
OCR training dataset (1).docx
OCR training dataset (1).docxOCR training dataset (1).docx
OCR training dataset (1).docxShalini104884
 
IRJET- A Novel Approach – Automatic paper evaluation system
IRJET-  	  A Novel Approach – Automatic paper evaluation systemIRJET-  	  A Novel Approach – Automatic paper evaluation system
IRJET- A Novel Approach – Automatic paper evaluation systemIRJET Journal
 

Similar to Optical Character Recognition( OCR ) (20)

Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015
 
Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015
 
CRC Final Report
CRC Final ReportCRC Final Report
CRC Final Report
 
O45018291
O45018291O45018291
O45018291
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
D017222226
D017222226D017222226
D017222226
 
OCR 's Functions
OCR 's FunctionsOCR 's Functions
OCR 's Functions
 
OCR Algorithm Accuracy Enhancement
OCR Algorithm Accuracy EnhancementOCR Algorithm Accuracy Enhancement
OCR Algorithm Accuracy Enhancement
 
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
How Intelligent Character Recognition (ICR) is Overcoming OCR Limitations in ...
 
Online Hand Written Character Recognition
Online Hand Written Character RecognitionOnline Hand Written Character Recognition
Online Hand Written Character Recognition
 
Bj35343348
Bj35343348Bj35343348
Bj35343348
 
50120130406005
5012013040600550120130406005
50120130406005
 
Ocr abstract
Ocr abstractOcr abstract
Ocr abstract
 
Telugu letters dataset and parallel deep convolutional neural network with a...
Telugu letters dataset and parallel deep convolutional neural  network with a...Telugu letters dataset and parallel deep convolutional neural  network with a...
Telugu letters dataset and parallel deep convolutional neural network with a...
 
What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?
 
Automatic handwriting recognition
Automatic handwriting recognitionAutomatic handwriting recognition
Automatic handwriting recognition
 
OCR training dataset (1).docx
OCR training dataset (1).docxOCR training dataset (1).docx
OCR training dataset (1).docx
 
OCR, optical character reader
OCR, optical character readerOCR, optical character reader
OCR, optical character reader
 
IRJET- A Novel Approach – Automatic paper evaluation system
IRJET-  	  A Novel Approach – Automatic paper evaluation systemIRJET-  	  A Novel Approach – Automatic paper evaluation system
IRJET- A Novel Approach – Automatic paper evaluation system
 
Hardware to Software
Hardware to SoftwareHardware to Software
Hardware to Software
 

Recently uploaded

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsRussian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 

Recently uploaded (20)

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsRussian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 

Optical Character Recognition( OCR )

  • 1. Optical Character Recognition ( OCR ) Karan Panjwani T.E – B , 68 Guided By : Prof. Shalini Wankhade
  • 2. Contents  Definition  Introduction To OCR  Problem Overview  Uses  Types  Steps in OCR  Accuracy  Software Implementation  Pros and Cons  Research
  • 3. Optical Character Recognition (OCR) is the mechanical or electronic conversion of images of typewritten or printed text into machine-encoded text. Definition
  • 4. Introduction to OCR  1 2 3 4 5 6 7 8 9 0
  • 5. Problem overview  Humans are bound to make errors – some time or the other – especially while performing mundane boring tasks like digitization or Security, continuously.  Many times we are unable to perceive certain digits due to various factors – motion, lack digit clarity, illumination and so on.  It is these problems which have lead us to delve into this topic.
  • 6. USES  It is widely used as a form of Data Entry from Printed Paper data records, whether Passport Documents, Invoices, Bank Statements, Business Card, Mail or Other Documents.  It is common method of Digitizing Printed Texts so that it can be Electronically edited, searched, stored more compactly, displayed on-line, and used in Machine Processes such as Machine Translation, Text-to-Speech, Key Data and Text Mining.
  • 7. TYPES 1) Optical Character Recognition ( OCR ) -  Targets typewritten text, one Glyph or Character at a time. 2) Optical Word Recognition ( OWR ) -  Targets typewritten text, one word at a time (for languages that use a space as a word divider). 3) Intelligent Character Recognition ( ICR ) –  Targets handwritten print script or cursive text one glyph or character at a time, usually involving machine learning.
  • 8. TYPES( contd…) 4) Intelligent Word Recognition ( IWR ) -  Targets handwritten print script or cursive text, one word at a time.  This is especially useful for languages where glyphs are not separated in cursive script.
  • 11. Pre - processing • Deals with Improving quality of the Image for better recognition by the system. OCR software often "pre-processes" images to improve the chances of successful recognition.  Techniques include: • De-Skew • Despeckle • Binarization • Line Removal • Zoning • Line and Word Detection • Script Recognition • Segmentation • Normalize Aspect Ratio and Scale
  • 12. Character Recognition  There are two basic types of core OCR algorithm, which may produce a ranked list of candidate characters. • Matrix matching involves comparing an image to a stored glyph on a pixel-by-pixel basis; it is also known as “pattern matching”. This relies on the input glyph being correctly isolated from the rest of the image, and on the stored glyph being in a similar font and at the same scale. This technique works best with typewritten text and does not work well when new fonts are encountered. • Feature extraction decomposes glyphs into “features” like lines, closed loops, line direction, and line intersections. Feature Extraction serves two purposes; one is to extract properties that can identify a character uniquely. Second is to extract properties that can differentiate between similar characters.
  • 13.
  • 14. Post - processing  OCR accuracy can be increased if the output is constrained by a lexicon – a list of words that are allowed to occur in a document. This might be, for example, all the words in the English language, or a more technical lexicon for a specific field. This technique can be problematic if the document contains words not in the lexicon, like proper nouns. Tesseract uses its dictionary to influence the character segmentation step, for improved accuracy.
  • 15. Accuracy  Recognition of Latin-script, typewritten text is still not 100% accurate even where clear imaging is available. One study based on recognition of 19th- and early 20th-century newspaper pages concluded that character- by-character OCR accuracy for commercial OCR software varied from 81% to 99%; total accuracy can be achieved by human review or Data Dictionary Authentication.  Other areas—including recognition of hand printing, cursive handwriting, and printed text in other scripts are still the subject of active research.
  • 16. Accuracy(contd..)  Accuracy rates can be measured in several ways, and how they are measured can greatly affect the reported accuracy rate.  For example, if word context (basically a lexicon of words) is not used to correct software finding non- existent words, a character error rate of 1% (99% accuracy) may result in an error rate of 5% (95% accuracy) or worse if the measurement is based on whether each whole word was recognized with no incorrect letters.
  • 17. Use of Freeocr software
  • 18.
  • 19.
  • 20. Pros and Cons  OCR reduces time for processing for processing data from large number of forms.  If done manually, may lead to human error and takes up much of the time.  Recognition of cursive text is an active area of research, with recognition rates even lower than that of hand- printed text.  Higher rates of recognition of general cursive script will likely not be possible without the use of contextual or grammatical information.
  • 21. Research  Recognition of cursive text is an active area of research, with recognition rates even lower than that of hand-printed text.  Higher rates of recognition of general cursive script will likely not be possible without the use of contextual or grammatical information.  For example, recognizing entire words from a dictionary is easier than trying to parse individual characters from script.
  • 22. Conclusion • OCR technology provides fast, automated data capture which can save considerable time and labour costs of organisations. • The system has its advantages such as Automation of mundane tasks, Less Time Complexity, Very Small Database and High Adaptability to untrained inputs with only a small number of features to calculate.
  • 23. References  INTERNET :  www.google.co.in  www.slideshare.net  http://www.ijsrp.org/research_paper_may2012/ijsrp- may-2012-68.pdf  en.wikipedia.org/wiki/Optical_character_recognition  BOOKS’ :  Character Recognition Systems by Mohamed Cheriet, Nawwaf, Cheng-lin, Ching Y