SlideShare a Scribd company logo
1 of 45
Introduction
       to
Computer Vision

ashishkhare@jkinstitute.org
goals of field of vision
• understand how animals represent and
  process information carried by light, by
  – measuring and modeling visual performance
    in humans and other animals
  – finding ways to build artificial visual systems
  – characterizing neural mechanisms that
• implement visual systems
  – apply this understanding to obtain medical,
    technological advances
processing of images in humans
• as a first approximation, rods and cones
  (sensory cells in the retina) represent image as
  large 2D array of light intensities
  – about 126 million sensory cells!
• this image representation is processed by brain
  enabling complex cognitive functions
  – recognize a familiar face or scene
  – disambiguate overlapping objects
  – read sloppy handwriting
• how does the brain do all of this? how might
  image processing be partitioned into subtasks?
image processing tasks of brain
• possible tasks:
  – extraction of contour (e.g. sharp light intensity
    changes in the image)
  – extraction of motion
  – identification of object parts
• still unclear: how are these integrated to
  enable us to extract meaning from what
  we see?
psychophysical experiments
• used to test hypotheses about how the
  brain/mind processes optical information
Examples
• Inattentional Blindness
• Change Blindness
• Figure-Ground Segregation
Inattentional Blindness
Mack & Rock (1998)
• Definition: the failure to see consciously, caused by
  lack of attention
• We can miss perceiving very obvious changes if we
  are not attending. Subjects do not consciously
  perceive features of the visual scene that they do
  not attend to.
• Subjects were engaged in tasks that demanded a
  high degree of attention, such as looking at a cross
  and trying to determine which arm is longer.
Trials 1
Trials 2
Trials 3
Inattention Trial
Recognition Test
Inattentional Blindness
• 25% of subjects failed to see the square when it was
  presented in the parafovea (2° from fixation).
• But 65% failed to see it when it was at fixation!
• What is missed?
   – Sad or Neutral face
   – A word (priming for the word is present)
• What is not missed?
   – Name
   – Smiling face
Change Blindness
• Change Blindness is the phenomena were
  we fail to perceive large changes, in our
  surroundings as well as in experimental
  conditions.
• Change could be in existence, properties,
  semantic identity and spatial layout.
• Attention is required to perceive change,
  and in the absence of localized motion
  signals, attention is directed by high level
  of interest (Rensink et al, 1997).
Flicker paradigm


• Basically, alternate an
  original image A with a
  modified image A’, with brief
  blank fields placed between
  successive images
Change Blindness
• “Visual perception of
  change in an object
  occurs only when that
  object is given focused
  attention”
• “In the absence of
  such attention, the
  contents of visual
  memory are simply
  overwritten by
  subsequent stimuli,
  and so cannot be used
  to make comparisons”
CB – Another Example
Why CB?
• Change blindness could be due to –
  – Poor representation of pre- and post change
    scene or
  – Pre change representation gets over-written
    by post change representation or
  – Capacity to retrieve and compare information
    is limited (Hollingworth, 2003).
• Color change detection with multi-element
  displays
Figure-Ground Segregation
• Discovered by Edgar
  Rubin (Fig.1, 1921).
• Only one side of the
  contour is seen as figure.
• Has shape, appears
  closer.
• Background appears
                               Fig.1 (Faces/vase display)
  behind the figure and has
  no shape.
Background
• Configural cues:
            – symmetry
            – convexity
            – area        ………….. Gestalt psychologists


• Lower region
•      lower region of a display will be seen as figure than the upper
  region.           (Vecera et al. 2002)

• Top-bottom polarity
•       Stimuli having wide base and narrow top were perceived as
  figures than the ones which had narrow base and wide top.
•                                 (Hulleman and Humphreys, 2004)
• Higher cognitive processes
                 » Object memory (Mary Peterson et al. 1991)
                 » Attention ( Vecera et al. 2004)
Motivation
• Palmer and his colleagues conducted a study in which
  they used temporal frequency (flicker) and manipulated
  edge synchrony with the two regions (left and right).
• They concluded from their studies that edge plays the
  key role in determining figure-ground segregation. In
  their study they found that the region with which the
  edge synchronizes will be seen as figure irrespective of
  whether the region is flickering or not.
• Wong and Weisstein (1987) demonstrated that spatial
  and temporal frequencies play a major role in assigning
  figural status to a region.
Reference Books
• Fundamentals of Digital Image Processing:
  Anil K. Jain
• Digital Image Processing: Gonzalez & Woods
• The Image Processing Handbook: J.C. Russ
• Digital Image Processing: B. Jahne

• Image Processing, Analysis and Machine
  Vision: M. Sonka, V. Hlavac, R. Boyle
• Computer Vision Handbook: B. Jahne
• Computer Vision: M. Brady

More Related Content

Viewers also liked

Image pre processing-restoration
Image pre processing-restorationImage pre processing-restoration
Image pre processing-restorationAshish Kumar
 
Digital Image Processing
Digital Image ProcessingDigital Image Processing
Digital Image ProcessingSahil Biswas
 
Comparison of image segmentation
Comparison of image segmentationComparison of image segmentation
Comparison of image segmentationHaitham Ahmed
 
Digital Image Processing: Digital Image Fundamentals
Digital Image Processing: Digital Image FundamentalsDigital Image Processing: Digital Image Fundamentals
Digital Image Processing: Digital Image FundamentalsMostafa G. M. Mostafa
 
Digitized images and
Digitized images andDigitized images and
Digitized images andAshish Kumar
 
Digital Image Processing: Image Enhancement in the Spatial Domain
Digital Image Processing: Image Enhancement in the Spatial DomainDigital Image Processing: Image Enhancement in the Spatial Domain
Digital Image Processing: Image Enhancement in the Spatial DomainMostafa G. M. Mostafa
 
Digital Image Processing: Image Enhancement in the Frequency Domain
Digital Image Processing: Image Enhancement in the Frequency DomainDigital Image Processing: Image Enhancement in the Frequency Domain
Digital Image Processing: Image Enhancement in the Frequency DomainMostafa G. M. Mostafa
 
Fuzzy c-means clustering for image segmentation
Fuzzy c-means  clustering for image segmentationFuzzy c-means  clustering for image segmentation
Fuzzy c-means clustering for image segmentationDharmesh Patel
 
Enhancement in frequency domain
Enhancement in frequency domainEnhancement in frequency domain
Enhancement in frequency domainAshish Kumar
 
Digital Image Processing: Image Segmentation
Digital Image Processing: Image SegmentationDigital Image Processing: Image Segmentation
Digital Image Processing: Image SegmentationMostafa G. M. Mostafa
 
Enhancement in spatial domain
Enhancement in spatial domainEnhancement in spatial domain
Enhancement in spatial domainAshish Kumar
 
Computer architecture
Computer architecture Computer architecture
Computer architecture Ashish Kumar
 
Image pre processing
Image pre processingImage pre processing
Image pre processingAshish Kumar
 

Viewers also liked (20)

Image pre processing-restoration
Image pre processing-restorationImage pre processing-restoration
Image pre processing-restoration
 
Digital Image Processing
Digital Image ProcessingDigital Image Processing
Digital Image Processing
 
color
colorcolor
color
 
Comparison of image segmentation
Comparison of image segmentationComparison of image segmentation
Comparison of image segmentation
 
Image segmentation
Image segmentationImage segmentation
Image segmentation
 
Digital Image Processing: Digital Image Fundamentals
Digital Image Processing: Digital Image FundamentalsDigital Image Processing: Digital Image Fundamentals
Digital Image Processing: Digital Image Fundamentals
 
Digitized images and
Digitized images andDigitized images and
Digitized images and
 
Image segmentation
Image segmentation Image segmentation
Image segmentation
 
Digital Image Processing: Image Enhancement in the Spatial Domain
Digital Image Processing: Image Enhancement in the Spatial DomainDigital Image Processing: Image Enhancement in the Spatial Domain
Digital Image Processing: Image Enhancement in the Spatial Domain
 
Digital Image Processing: Image Enhancement in the Frequency Domain
Digital Image Processing: Image Enhancement in the Frequency DomainDigital Image Processing: Image Enhancement in the Frequency Domain
Digital Image Processing: Image Enhancement in the Frequency Domain
 
Fuzzy c-means clustering for image segmentation
Fuzzy c-means  clustering for image segmentationFuzzy c-means  clustering for image segmentation
Fuzzy c-means clustering for image segmentation
 
Enhancement in frequency domain
Enhancement in frequency domainEnhancement in frequency domain
Enhancement in frequency domain
 
Image and Perception
Image and PerceptionImage and Perception
Image and Perception
 
Dip Image Segmentation
Dip Image SegmentationDip Image Segmentation
Dip Image Segmentation
 
Segmentation
SegmentationSegmentation
Segmentation
 
Digital Image Processing: Image Segmentation
Digital Image Processing: Image SegmentationDigital Image Processing: Image Segmentation
Digital Image Processing: Image Segmentation
 
Enhancement in spatial domain
Enhancement in spatial domainEnhancement in spatial domain
Enhancement in spatial domain
 
Computer architecture
Computer architecture Computer architecture
Computer architecture
 
Image pre processing
Image pre processingImage pre processing
Image pre processing
 
IMAGE SEGMENTATION.
IMAGE SEGMENTATION.IMAGE SEGMENTATION.
IMAGE SEGMENTATION.
 

Similar to Introduction vision

Chapter 6 pwrpt 1 1
Chapter 6  pwrpt 1 1Chapter 6  pwrpt 1 1
Chapter 6 pwrpt 1 1jamiesamel
 
Ch 5 Object Recognition.pptx
Ch 5 Object Recognition.pptxCh 5 Object Recognition.pptx
Ch 5 Object Recognition.pptxLarry195181
 
MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1
MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1
MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1zukun
 
Understanding Human Player: Attention, Perception and Motivation / Sergei Sav...
Understanding Human Player: Attention, Perception and Motivation / Sergei Sav...Understanding Human Player: Attention, Perception and Motivation / Sergei Sav...
Understanding Human Player: Attention, Perception and Motivation / Sergei Sav...DevGAMM Conference
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015Jia-Bin Huang
 
Imagery and Cognitive Maps.pptx
Imagery and Cognitive Maps.pptxImagery and Cognitive Maps.pptx
Imagery and Cognitive Maps.pptxSritha Sandon
 
NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1zukun
 
Visual perception concepts
Visual perception conceptsVisual perception concepts
Visual perception conceptsLopamudra Ghosh
 
Intro to data visualization
Intro to data visualizationIntro to data visualization
Intro to data visualizationJan Aerts
 
Chapter 9 pwrpt
Chapter 9 pwrptChapter 9 pwrpt
Chapter 9 pwrptjamiesamel
 
Visual Composition Slideshow - Anna Rasmussen
Visual Composition Slideshow - Anna RasmussenVisual Composition Slideshow - Anna Rasmussen
Visual Composition Slideshow - Anna Rasmussenbluedevils
 
Perciving the world
Perciving the worldPerciving the world
Perciving the worldAmmar Faruki
 
17 june 29 Attention and consciousness @ University of Sydney
17 june 29 Attention and consciousness @ University of Sydney17 june 29 Attention and consciousness @ University of Sydney
17 june 29 Attention and consciousness @ University of SydneyNao (Naotsugu) Tsuchiya
 
DIGITAL IMAGE FUNDAS.ppt
DIGITAL IMAGE FUNDAS.pptDIGITAL IMAGE FUNDAS.ppt
DIGITAL IMAGE FUNDAS.pptHari M
 

Similar to Introduction vision (20)

Chapter 6 pwrpt 1 1
Chapter 6  pwrpt 1 1Chapter 6  pwrpt 1 1
Chapter 6 pwrpt 1 1
 
Ch 5 Object Recognition.pptx
Ch 5 Object Recognition.pptxCh 5 Object Recognition.pptx
Ch 5 Object Recognition.pptx
 
MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1
MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1
MIT6.870 Grounding Object Recognition and Scene Understanding: lecture 1
 
Depth & space
Depth & spaceDepth & space
Depth & space
 
Ventral stream
Ventral streamVentral stream
Ventral stream
 
Promising avenues for interdisciplinary research in vision
Promising avenues for interdisciplinary research in visionPromising avenues for interdisciplinary research in vision
Promising avenues for interdisciplinary research in vision
 
Understanding Human Player: Attention, Perception and Motivation / Sergei Sav...
Understanding Human Player: Attention, Perception and Motivation / Sergei Sav...Understanding Human Player: Attention, Perception and Motivation / Sergei Sav...
Understanding Human Player: Attention, Perception and Motivation / Sergei Sav...
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015
 
Imagery and Cognitive Maps.pptx
Imagery and Cognitive Maps.pptxImagery and Cognitive Maps.pptx
Imagery and Cognitive Maps.pptx
 
CP wk 3
CP wk 3CP wk 3
CP wk 3
 
Psykofysik
PsykofysikPsykofysik
Psykofysik
 
NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1
 
Visual perception concepts
Visual perception conceptsVisual perception concepts
Visual perception concepts
 
digitalimagefundamentals.ppt
digitalimagefundamentals.pptdigitalimagefundamentals.ppt
digitalimagefundamentals.ppt
 
Intro to data visualization
Intro to data visualizationIntro to data visualization
Intro to data visualization
 
Chapter 9 pwrpt
Chapter 9 pwrptChapter 9 pwrpt
Chapter 9 pwrpt
 
Visual Composition Slideshow - Anna Rasmussen
Visual Composition Slideshow - Anna RasmussenVisual Composition Slideshow - Anna Rasmussen
Visual Composition Slideshow - Anna Rasmussen
 
Perciving the world
Perciving the worldPerciving the world
Perciving the world
 
17 june 29 Attention and consciousness @ University of Sydney
17 june 29 Attention and consciousness @ University of Sydney17 june 29 Attention and consciousness @ University of Sydney
17 june 29 Attention and consciousness @ University of Sydney
 
DIGITAL IMAGE FUNDAS.ppt
DIGITAL IMAGE FUNDAS.pptDIGITAL IMAGE FUNDAS.ppt
DIGITAL IMAGE FUNDAS.ppt
 

More from Ashish Kumar

Image pre processing - local processing
Image pre processing - local processingImage pre processing - local processing
Image pre processing - local processingAshish Kumar
 
process management
 process management process management
process managementAshish Kumar
 
resource management
  resource management  resource management
resource managementAshish Kumar
 
distributed shared memory
 distributed shared memory distributed shared memory
distributed shared memoryAshish Kumar
 
remote procedure calls
  remote procedure calls  remote procedure calls
remote procedure callsAshish Kumar
 
Introduction image processing
Introduction image processingIntroduction image processing
Introduction image processingAshish Kumar
 
video compression techique
video compression techiquevideo compression techique
video compression techiqueAshish Kumar
 
online game over cryptography
online game over cryptographyonline game over cryptography
online game over cryptographyAshish Kumar
 

More from Ashish Kumar (16)

Lecture2 color
Lecture2 colorLecture2 color
Lecture2 color
 
Image pre processing - local processing
Image pre processing - local processingImage pre processing - local processing
Image pre processing - local processing
 
Data structures
Data structuresData structures
Data structures
 
Lecture2 light
Lecture2 lightLecture2 light
Lecture2 light
 
process management
 process management process management
process management
 
resource management
  resource management  resource management
resource management
 
distributed shared memory
 distributed shared memory distributed shared memory
distributed shared memory
 
message passing
 message passing message passing
message passing
 
remote procedure calls
  remote procedure calls  remote procedure calls
remote procedure calls
 
Introduction image processing
Introduction image processingIntroduction image processing
Introduction image processing
 
system design
system designsystem design
system design
 
video compression techique
video compression techiquevideo compression techique
video compression techique
 
Compression
CompressionCompression
Compression
 
1.animation
1.animation1.animation
1.animation
 
online game over cryptography
online game over cryptographyonline game over cryptography
online game over cryptography
 
MHEG
MHEGMHEG
MHEG
 

Introduction vision

  • 1. Introduction to Computer Vision ashishkhare@jkinstitute.org
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24. goals of field of vision • understand how animals represent and process information carried by light, by – measuring and modeling visual performance in humans and other animals – finding ways to build artificial visual systems – characterizing neural mechanisms that • implement visual systems – apply this understanding to obtain medical, technological advances
  • 25. processing of images in humans • as a first approximation, rods and cones (sensory cells in the retina) represent image as large 2D array of light intensities – about 126 million sensory cells! • this image representation is processed by brain enabling complex cognitive functions – recognize a familiar face or scene – disambiguate overlapping objects – read sloppy handwriting • how does the brain do all of this? how might image processing be partitioned into subtasks?
  • 26.
  • 27. image processing tasks of brain • possible tasks: – extraction of contour (e.g. sharp light intensity changes in the image) – extraction of motion – identification of object parts • still unclear: how are these integrated to enable us to extract meaning from what we see?
  • 28. psychophysical experiments • used to test hypotheses about how the brain/mind processes optical information
  • 29. Examples • Inattentional Blindness • Change Blindness • Figure-Ground Segregation
  • 30. Inattentional Blindness Mack & Rock (1998) • Definition: the failure to see consciously, caused by lack of attention • We can miss perceiving very obvious changes if we are not attending. Subjects do not consciously perceive features of the visual scene that they do not attend to. • Subjects were engaged in tasks that demanded a high degree of attention, such as looking at a cross and trying to determine which arm is longer.
  • 36. Inattentional Blindness • 25% of subjects failed to see the square when it was presented in the parafovea (2° from fixation). • But 65% failed to see it when it was at fixation! • What is missed? – Sad or Neutral face – A word (priming for the word is present) • What is not missed? – Name – Smiling face
  • 37. Change Blindness • Change Blindness is the phenomena were we fail to perceive large changes, in our surroundings as well as in experimental conditions. • Change could be in existence, properties, semantic identity and spatial layout. • Attention is required to perceive change, and in the absence of localized motion signals, attention is directed by high level of interest (Rensink et al, 1997).
  • 38. Flicker paradigm • Basically, alternate an original image A with a modified image A’, with brief blank fields placed between successive images
  • 39. Change Blindness • “Visual perception of change in an object occurs only when that object is given focused attention” • “In the absence of such attention, the contents of visual memory are simply overwritten by subsequent stimuli, and so cannot be used to make comparisons”
  • 40. CB – Another Example
  • 41. Why CB? • Change blindness could be due to – – Poor representation of pre- and post change scene or – Pre change representation gets over-written by post change representation or – Capacity to retrieve and compare information is limited (Hollingworth, 2003). • Color change detection with multi-element displays
  • 42. Figure-Ground Segregation • Discovered by Edgar Rubin (Fig.1, 1921). • Only one side of the contour is seen as figure. • Has shape, appears closer. • Background appears Fig.1 (Faces/vase display) behind the figure and has no shape.
  • 43. Background • Configural cues: – symmetry – convexity – area ………….. Gestalt psychologists • Lower region • lower region of a display will be seen as figure than the upper region. (Vecera et al. 2002) • Top-bottom polarity • Stimuli having wide base and narrow top were perceived as figures than the ones which had narrow base and wide top. • (Hulleman and Humphreys, 2004) • Higher cognitive processes » Object memory (Mary Peterson et al. 1991) » Attention ( Vecera et al. 2004)
  • 44. Motivation • Palmer and his colleagues conducted a study in which they used temporal frequency (flicker) and manipulated edge synchrony with the two regions (left and right). • They concluded from their studies that edge plays the key role in determining figure-ground segregation. In their study they found that the region with which the edge synchronizes will be seen as figure irrespective of whether the region is flickering or not. • Wong and Weisstein (1987) demonstrated that spatial and temporal frequencies play a major role in assigning figural status to a region.
  • 45. Reference Books • Fundamentals of Digital Image Processing: Anil K. Jain • Digital Image Processing: Gonzalez & Woods • The Image Processing Handbook: J.C. Russ • Digital Image Processing: B. Jahne • Image Processing, Analysis and Machine Vision: M. Sonka, V. Hlavac, R. Boyle • Computer Vision Handbook: B. Jahne • Computer Vision: M. Brady