SlideShare a Scribd company logo
1 of 20
Download to read offline
The 5-number summary,
     boxplots and outliers




9/4/2011                     Slide 1
The 5-number summary
•      An example 5-number summary:
                  5 12 14 17 21
•      5 is the minimum value in the set
•      12 is the first quartile
•      14 is the median
•      17 is the third quartile
•      21 is the maximum value
9/4/2011                        Slide 2
The 5-number summary
• Quartiles?
     – The 1st quartile is the point at which 25% of
       the data is below and 75% above.
     – The 2nd quartile (the MEDIAN) is the point at
       which 50% of the data is below and 50%
       above.
     – The 3rd quartile is the point at which 75% of
       the data is below and 25% above

9/4/2011                           Slide 3
The 5-number summary
•        Back to the example
                           5 12 14 17 21
•        So, if these are a scores on a 22 point quiz from a
         class…
     –     The lowest score in the class was 5 points
     –     25% of students earned 12 or fewer points (75% earned 12 or
           more)
     –     50% of students earned 14 or fewer points (50% earned 14 or
           more) – the median
     –     75% of students earned 17 or fewer points (25% earned 17 or
           more)
     –     The highest score in the class was 21 points

9/4/2011                                      Slide 4
The 5-number summary
• Finding the minimum and maximum is
  pretty simple
• Finding the median was discussed in
  lesson 1.6
• So to find the quartiles…




9/4/2011                  Slide 5
The 5-number summary
• Finding the quartiles
     – To find the 1st quartile, simply find the median
       of the lower half of the data (the lower half of
       the data does NOT include the median of the
       data).
     – To find the 3rd quartile, simply find the median
       of the upper half of the data (the upper half of
       the data does NOT include the median of the
       data).

9/4/2011                            Slide 6
The 5-number summary
                                                     2
• Example of finding
                                                     3    Median of lower
  the quartiles          Lower Half of Data          3    half is 1st
                                                          quartile = 3
                                                     5
                Median is (8+9)/2 or 8.5             8
                                                     9
                                                     9
                                                          Median of
                         Upper Half of Data          12   upper half is 3rd
                                                          quartile = 12
                                                     13
                                                     13



9/4/2011                                   Slide 7
The 5-number summary
• Find the 5-number         17.1        2.1
  summary of the            5.8         2.0
  following data (which
  are salaries (in          5.0         1.0
  millions) of an NBA       4.5         1.0
  team:
                            4.3         0.8
                            4.2         0.7
• Go to the next slide to
  check your work.          3.1         0.3
9/4/2011                      Slide 8
The 5-number summary
• Your answer should be:
                0.3 1.0 2.6 4.5 17.1

• Another measure of spread is found from the 5-number
  summary, the interquartile range (or IQR).
• The IQR is simply the 3rd quartile (Q3) minus the 1st
  quartile (Q1).
• So, IQR = Q3 – Q1
• This is simply a variation on the definition of range.



9/4/2011                             Slide 9
Outliers and extreme values
• The 17.1 million dollar salary is quite high.
• Is it an outlier among the data?

• Tukey’s rule: A data point is an outlier if it
  falls more than 1.5 IQR below the 1st
  quartile OR 1.5 IQR above the 3rd quartile.


9/4/2011                      Slide 10
Outliers and extreme values
• Recall the summary: 0.3 1.0 2.6 4.5 17.1
• The IQR = 4.5 – 1.0 = 3.5
• Check for the high outlier:
     –     Is 17.1 more than 1.5IQR above the 3rd quartile (4.5)?
     –     Is 17.1 > 1.5(3.5) + 4.5 ?
     –     Is 17.1 > 9.75 ?
     –     Yes, so the $17.1 million salary is an outlier on the
           team’s payroll.


9/4/2011                                   Slide 11
Boxplots
• The boxplot displays the 5-number
  summary: minimum, lower quartile (Q1),
  median upper quartile (Q3), maximum.
• It also shows the Inter-quartile range (IQR)
  and outliers.
• It also gives us information about the
  symmetry of the distribution.

9/4/2011                     Slide 12
Example Boxplot
                                                                   The largest
                                                                   observation
                                                                   (max) is
                                                                   approximately
                                                                   4.

The smallest
observation (min)
is approximately
0.


                    The first          The second             The third quartile
                    quartile (Q1) is   quartile (Q2) is       (Q3) is approximately
                    approximately      approximately          2.7.
  9/4/2011          1.                 1.6.        Slide 13
Example Boxplot (modified)
                    Outliers show up as
                    circles. In this case, it
                    is now the max.



                    This is the largest
                    observation that is
                    NOT an out outlier.
9/4/2011                  Slide 14
Boxplots
• Example
     – Verbal GMAT scores of 12 students: 10, 22, 24, 27,
       31, 33, 39, 40, 42, 43, 44, 45
     – The 5-number summary is:
                     10 25.5 36 42.5 45
     – Now the box-plot is constructed as follows:
           •   The line inside the box indicates the median.
           •   The left side of this box indicates the lower quartile (Q1).
           •   The right side of this box indicates the upper quartile (Q3).
           •   A straight line is then drawn from the lowest value of this
               distribution to the box (at Q1) and another straight line from
               the box (at Q3) to the highest value of this distribution.


9/4/2011                                            Slide 15
Boxplot




9/4/2011             Slide 16
Modified Boxplot
• Consider the 5-number summary of NBA
  salaries: 0.3 1.0 2.6 4.5 17.1
• The modified boxplot shows outliers
  separately.
• On the next slide, note how the outlier
  17.1 is plotted separately.
• The line only goes to the maximum (or
  minimum) point that is NOT an outlier.

9/4/2011                   Slide 17
Modified Boxplot




9/4/2011               Slide 18
Using Technology to Make
                   Boxplots
• Use StatCrunch
• On your calculator:
     – To create a boxplot:
           • Enter your list of data.
           • Now select STAT PLOT by pressing the 2nd key followed by
             the key labeled Y=.
           • Press the ENTER key, then select the option of ON, and select
             the boxplot type that shows outliers (as dots)
           • The Xlist should indicate the name of your list (L1 or other),
             and the Freq value should be set to 1.
           • Now press the ZOOM key and select option 9 (ZoomStat).
           • Press the ENTER key and the boxplot should be displayed.
           • You can use the TRACE key along with the arrow keys in order
             to read the values of the five number summary (Min, Q1, Med,
             Q3, Max).

9/4/2011                                         Slide 19
The 5-number summary, boxplots
           and outliers
• This concludes the presentation.




9/4/2011                    Slide 20

More Related Content

What's hot

Ace open ended rubric strategy
Ace open ended rubric strategyAce open ended rubric strategy
Ace open ended rubric strategylisa handline
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regressionMaria Theresa
 
Normal distribution slide share
Normal distribution slide shareNormal distribution slide share
Normal distribution slide shareKate FLR
 
Bcs 040 Descriptive Statistics
Bcs 040 Descriptive StatisticsBcs 040 Descriptive Statistics
Bcs 040 Descriptive StatisticsNarayan Thapa
 
11.4 Geometric Probability
11.4 Geometric Probability11.4 Geometric Probability
11.4 Geometric Probabilitysmiller5
 
Techniques of Integration ppt.ppt
Techniques of Integration ppt.pptTechniques of Integration ppt.ppt
Techniques of Integration ppt.pptJaysonFabela1
 
Simplex Method
Simplex MethodSimplex Method
Simplex MethodSachin MK
 
1-06 Even and Odd Functions Notes
1-06 Even and Odd Functions Notes1-06 Even and Odd Functions Notes
1-06 Even and Odd Functions Notesnechamkin
 
3.3 Measures of relative standing and boxplots
3.3 Measures of relative standing and boxplots3.3 Measures of relative standing and boxplots
3.3 Measures of relative standing and boxplotsLong Beach City College
 
Numerical analysis kuhn tucker eqn
Numerical analysis  kuhn tucker eqnNumerical analysis  kuhn tucker eqn
Numerical analysis kuhn tucker eqnSHAMJITH KM
 
Linear Algebra and Matrix
Linear Algebra and MatrixLinear Algebra and Matrix
Linear Algebra and Matrixitutor
 
Duality in Linear Programming Problem
Duality in Linear Programming ProblemDuality in Linear Programming Problem
Duality in Linear Programming ProblemRAVI PRASAD K.J.
 
Chebyshev's inequality
Chebyshev's inequalityChebyshev's inequality
Chebyshev's inequalityPradipPanda6
 

What's hot (20)

Ace open ended rubric strategy
Ace open ended rubric strategyAce open ended rubric strategy
Ace open ended rubric strategy
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
Normal distribution slide share
Normal distribution slide shareNormal distribution slide share
Normal distribution slide share
 
Bcs 040 Descriptive Statistics
Bcs 040 Descriptive StatisticsBcs 040 Descriptive Statistics
Bcs 040 Descriptive Statistics
 
The integral
The integralThe integral
The integral
 
Two Proportions
Two Proportions  Two Proportions
Two Proportions
 
Concept of Duality
Concept of DualityConcept of Duality
Concept of Duality
 
Operations Research - The Dual Simplex Method
Operations Research - The Dual Simplex MethodOperations Research - The Dual Simplex Method
Operations Research - The Dual Simplex Method
 
11.4 Geometric Probability
11.4 Geometric Probability11.4 Geometric Probability
11.4 Geometric Probability
 
Techniques of Integration ppt.ppt
Techniques of Integration ppt.pptTechniques of Integration ppt.ppt
Techniques of Integration ppt.ppt
 
Simplex Method
Simplex MethodSimplex Method
Simplex Method
 
1-06 Even and Odd Functions Notes
1-06 Even and Odd Functions Notes1-06 Even and Odd Functions Notes
1-06 Even and Odd Functions Notes
 
Prime numbers
Prime numbersPrime numbers
Prime numbers
 
Quartile
QuartileQuartile
Quartile
 
3.3 Measures of relative standing and boxplots
3.3 Measures of relative standing and boxplots3.3 Measures of relative standing and boxplots
3.3 Measures of relative standing and boxplots
 
Numerical analysis kuhn tucker eqn
Numerical analysis  kuhn tucker eqnNumerical analysis  kuhn tucker eqn
Numerical analysis kuhn tucker eqn
 
Linear Algebra and Matrix
Linear Algebra and MatrixLinear Algebra and Matrix
Linear Algebra and Matrix
 
Duality in Linear Programming Problem
Duality in Linear Programming ProblemDuality in Linear Programming Problem
Duality in Linear Programming Problem
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
Chebyshev's inequality
Chebyshev's inequalityChebyshev's inequality
Chebyshev's inequality
 

Viewers also liked

Lesson 1-4 -- Five Number Summary
Lesson 1-4 -- Five Number SummaryLesson 1-4 -- Five Number Summary
Lesson 1-4 -- Five Number Summarychrismac47
 
Finding Interquartile Range from Cumulative Frequency Histogram Polygon
Finding Interquartile Range from Cumulative Frequency Histogram PolygonFinding Interquartile Range from Cumulative Frequency Histogram Polygon
Finding Interquartile Range from Cumulative Frequency Histogram PolygonMoonie Kim
 
Finding the Mean from Dot Plot
Finding the Mean from Dot PlotFinding the Mean from Dot Plot
Finding the Mean from Dot PlotMoonie Kim
 
Finding Interquartile Range from Stem-Leaf Plot 1
Finding Interquartile Range from Stem-Leaf Plot 1Finding Interquartile Range from Stem-Leaf Plot 1
Finding Interquartile Range from Stem-Leaf Plot 1Moonie Kim
 
Finding Interquartile Range from Stem-Leaf Plot 2
Finding Interquartile Range from Stem-Leaf Plot 2Finding Interquartile Range from Stem-Leaf Plot 2
Finding Interquartile Range from Stem-Leaf Plot 2Moonie Kim
 
Box and whiskers power point
Box and whiskers power pointBox and whiskers power point
Box and whiskers power pointmanswag123
 
Inter quartile range
Inter quartile rangeInter quartile range
Inter quartile rangeKen Plummer
 
Finding Interquartile Range from Dot Plot 1
Finding Interquartile Range from Dot Plot 1Finding Interquartile Range from Dot Plot 1
Finding Interquartile Range from Dot Plot 1Moonie Kim
 
Finding Interquartile Range from Dot Plot 2
Finding Interquartile Range from Dot Plot 2Finding Interquartile Range from Dot Plot 2
Finding Interquartile Range from Dot Plot 2Moonie Kim
 
Further3 summarising univariate data
Further3  summarising univariate dataFurther3  summarising univariate data
Further3 summarising univariate datakmcmullen
 
Further4 box plots, 5 number summary and outliers
Further4  box plots, 5 number summary and outliersFurther4  box plots, 5 number summary and outliers
Further4 box plots, 5 number summary and outlierskmcmullen
 
Finding the Mean Introduction
Finding the Mean IntroductionFinding the Mean Introduction
Finding the Mean IntroductionMoonie Kim
 
Managing Investment in Employees Strategically
Managing Investment in Employees StrategicallyManaging Investment in Employees Strategically
Managing Investment in Employees StrategicallyPat Wright
 
Creative Compensation Strategies to Maintain Morale & Retain Talent
Creative Compensation Strategies to Maintain Morale & Retain Talent Creative Compensation Strategies to Maintain Morale & Retain Talent
Creative Compensation Strategies to Maintain Morale & Retain Talent CBIZ, Inc.
 
Stem and-leaf plots
Stem and-leaf plotsStem and-leaf plots
Stem and-leaf plotsValPatton
 
Stem and leaf plot
Stem and leaf plotStem and leaf plot
Stem and leaf plotbbeiers
 

Viewers also liked (20)

Lesson 1-4 -- Five Number Summary
Lesson 1-4 -- Five Number SummaryLesson 1-4 -- Five Number Summary
Lesson 1-4 -- Five Number Summary
 
Finding Interquartile Range from Cumulative Frequency Histogram Polygon
Finding Interquartile Range from Cumulative Frequency Histogram PolygonFinding Interquartile Range from Cumulative Frequency Histogram Polygon
Finding Interquartile Range from Cumulative Frequency Histogram Polygon
 
Finding the Mean from Dot Plot
Finding the Mean from Dot PlotFinding the Mean from Dot Plot
Finding the Mean from Dot Plot
 
Finding Interquartile Range from Stem-Leaf Plot 1
Finding Interquartile Range from Stem-Leaf Plot 1Finding Interquartile Range from Stem-Leaf Plot 1
Finding Interquartile Range from Stem-Leaf Plot 1
 
Finding Interquartile Range from Stem-Leaf Plot 2
Finding Interquartile Range from Stem-Leaf Plot 2Finding Interquartile Range from Stem-Leaf Plot 2
Finding Interquartile Range from Stem-Leaf Plot 2
 
Box and whiskers power point
Box and whiskers power pointBox and whiskers power point
Box and whiskers power point
 
Inter quartile range
Inter quartile rangeInter quartile range
Inter quartile range
 
Finding Interquartile Range from Dot Plot 1
Finding Interquartile Range from Dot Plot 1Finding Interquartile Range from Dot Plot 1
Finding Interquartile Range from Dot Plot 1
 
Finding Interquartile Range from Dot Plot 2
Finding Interquartile Range from Dot Plot 2Finding Interquartile Range from Dot Plot 2
Finding Interquartile Range from Dot Plot 2
 
HISTOGRAMS
HISTOGRAMSHISTOGRAMS
HISTOGRAMS
 
Further3 summarising univariate data
Further3  summarising univariate dataFurther3  summarising univariate data
Further3 summarising univariate data
 
Further4 box plots, 5 number summary and outliers
Further4  box plots, 5 number summary and outliersFurther4  box plots, 5 number summary and outliers
Further4 box plots, 5 number summary and outliers
 
Finding the Mean Introduction
Finding the Mean IntroductionFinding the Mean Introduction
Finding the Mean Introduction
 
Managing Investment in Employees Strategically
Managing Investment in Employees StrategicallyManaging Investment in Employees Strategically
Managing Investment in Employees Strategically
 
Creative Compensation Strategies to Maintain Morale & Retain Talent
Creative Compensation Strategies to Maintain Morale & Retain Talent Creative Compensation Strategies to Maintain Morale & Retain Talent
Creative Compensation Strategies to Maintain Morale & Retain Talent
 
Stem and-leaf plots
Stem and-leaf plotsStem and-leaf plots
Stem and-leaf plots
 
Stem and leaf plot
Stem and leaf plotStem and leaf plot
Stem and leaf plot
 
Biostatics ppt
Biostatics pptBiostatics ppt
Biostatics ppt
 
Stats
StatsStats
Stats
 
Cost curve
Cost curveCost curve
Cost curve
 

Similar to 5-Number Summary, Boxplots and Outliers Analysis

Box and whisker plots with five number summary
Box and whisker plots with five number summaryBox and whisker plots with five number summary
Box and whisker plots with five number summaryLearnbay Datascience
 
quartiles,deciles,percentiles.ppt
quartiles,deciles,percentiles.pptquartiles,deciles,percentiles.ppt
quartiles,deciles,percentiles.pptSyedSaifUrRehman3
 
Lecture 1 Descriptives.pptx
Lecture 1 Descriptives.pptxLecture 1 Descriptives.pptx
Lecture 1 Descriptives.pptxABCraftsman
 
Revisionf2
Revisionf2Revisionf2
Revisionf2wind12
 
Statistics and probability lec006 part 1
Statistics and probability lec006 part 1Statistics and probability lec006 part 1
Statistics and probability lec006 part 1TieeTiee
 
Rt graphical representation
Rt graphical representationRt graphical representation
Rt graphical representationRinchen
 
TSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docx
TSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docxTSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docx
TSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docxnanamonkton
 
Statistics Slides.pdf
Statistics Slides.pdfStatistics Slides.pdf
Statistics Slides.pdfYasirAli74993
 
Numerical Descriptive Measures
Numerical Descriptive MeasuresNumerical Descriptive Measures
Numerical Descriptive MeasuresYesica Adicondro
 
ap_stat_1.3.ppt
ap_stat_1.3.pptap_stat_1.3.ppt
ap_stat_1.3.pptfghgjd
 
CHAPTER 3: FREQUENCY DISTRIBUTION ..pptx
CHAPTER 3: FREQUENCY DISTRIBUTION ..pptxCHAPTER 3: FREQUENCY DISTRIBUTION ..pptx
CHAPTER 3: FREQUENCY DISTRIBUTION ..pptxBeverlyAmoraSerada
 
De vry math 221 all ilabs latest 2016 november
De vry math 221 all ilabs latest 2016 novemberDe vry math 221 all ilabs latest 2016 november
De vry math 221 all ilabs latest 2016 novemberlenasour
 
Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)CS, NcState
 
analytical representation of data
 analytical representation of data analytical representation of data
analytical representation of dataUnsa Shakir
 
measures-of-position-for-ungrouped-data_MAth 10_Part1.pptx
measures-of-position-for-ungrouped-data_MAth 10_Part1.pptxmeasures-of-position-for-ungrouped-data_MAth 10_Part1.pptx
measures-of-position-for-ungrouped-data_MAth 10_Part1.pptxRonnelLozano
 
De vry math221 all ilabs latest 2016 november
De vry math221 all ilabs latest 2016 novemberDe vry math221 all ilabs latest 2016 november
De vry math221 all ilabs latest 2016 novemberlenasour
 

Similar to 5-Number Summary, Boxplots and Outliers Analysis (20)

Bab 4.ppt
Bab 4.pptBab 4.ppt
Bab 4.ppt
 
Chap004
Chap004Chap004
Chap004
 
Chap004
Chap004Chap004
Chap004
 
Box and whisker plots with five number summary
Box and whisker plots with five number summaryBox and whisker plots with five number summary
Box and whisker plots with five number summary
 
quartiles,deciles,percentiles.ppt
quartiles,deciles,percentiles.pptquartiles,deciles,percentiles.ppt
quartiles,deciles,percentiles.ppt
 
Lecture 1 Descriptives.pptx
Lecture 1 Descriptives.pptxLecture 1 Descriptives.pptx
Lecture 1 Descriptives.pptx
 
Revisionf2
Revisionf2Revisionf2
Revisionf2
 
Statistics and probability lec006 part 1
Statistics and probability lec006 part 1Statistics and probability lec006 part 1
Statistics and probability lec006 part 1
 
Rt graphical representation
Rt graphical representationRt graphical representation
Rt graphical representation
 
TSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docx
TSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docxTSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docx
TSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docx
 
Statistics Slides.pdf
Statistics Slides.pdfStatistics Slides.pdf
Statistics Slides.pdf
 
Chap004.ppt
Chap004.pptChap004.ppt
Chap004.ppt
 
Numerical Descriptive Measures
Numerical Descriptive MeasuresNumerical Descriptive Measures
Numerical Descriptive Measures
 
ap_stat_1.3.ppt
ap_stat_1.3.pptap_stat_1.3.ppt
ap_stat_1.3.ppt
 
CHAPTER 3: FREQUENCY DISTRIBUTION ..pptx
CHAPTER 3: FREQUENCY DISTRIBUTION ..pptxCHAPTER 3: FREQUENCY DISTRIBUTION ..pptx
CHAPTER 3: FREQUENCY DISTRIBUTION ..pptx
 
De vry math 221 all ilabs latest 2016 november
De vry math 221 all ilabs latest 2016 novemberDe vry math 221 all ilabs latest 2016 november
De vry math 221 all ilabs latest 2016 november
 
Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)
 
analytical representation of data
 analytical representation of data analytical representation of data
analytical representation of data
 
measures-of-position-for-ungrouped-data_MAth 10_Part1.pptx
measures-of-position-for-ungrouped-data_MAth 10_Part1.pptxmeasures-of-position-for-ungrouped-data_MAth 10_Part1.pptx
measures-of-position-for-ungrouped-data_MAth 10_Part1.pptx
 
De vry math221 all ilabs latest 2016 november
De vry math221 all ilabs latest 2016 novemberDe vry math221 all ilabs latest 2016 november
De vry math221 all ilabs latest 2016 november
 

Recently uploaded

Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 

Recently uploaded (20)

Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 

5-Number Summary, Boxplots and Outliers Analysis

  • 1. The 5-number summary, boxplots and outliers 9/4/2011 Slide 1
  • 2. The 5-number summary • An example 5-number summary: 5 12 14 17 21 • 5 is the minimum value in the set • 12 is the first quartile • 14 is the median • 17 is the third quartile • 21 is the maximum value 9/4/2011 Slide 2
  • 3. The 5-number summary • Quartiles? – The 1st quartile is the point at which 25% of the data is below and 75% above. – The 2nd quartile (the MEDIAN) is the point at which 50% of the data is below and 50% above. – The 3rd quartile is the point at which 75% of the data is below and 25% above 9/4/2011 Slide 3
  • 4. The 5-number summary • Back to the example 5 12 14 17 21 • So, if these are a scores on a 22 point quiz from a class… – The lowest score in the class was 5 points – 25% of students earned 12 or fewer points (75% earned 12 or more) – 50% of students earned 14 or fewer points (50% earned 14 or more) – the median – 75% of students earned 17 or fewer points (25% earned 17 or more) – The highest score in the class was 21 points 9/4/2011 Slide 4
  • 5. The 5-number summary • Finding the minimum and maximum is pretty simple • Finding the median was discussed in lesson 1.6 • So to find the quartiles… 9/4/2011 Slide 5
  • 6. The 5-number summary • Finding the quartiles – To find the 1st quartile, simply find the median of the lower half of the data (the lower half of the data does NOT include the median of the data). – To find the 3rd quartile, simply find the median of the upper half of the data (the upper half of the data does NOT include the median of the data). 9/4/2011 Slide 6
  • 7. The 5-number summary 2 • Example of finding 3 Median of lower the quartiles Lower Half of Data 3 half is 1st quartile = 3 5 Median is (8+9)/2 or 8.5 8 9 9 Median of Upper Half of Data 12 upper half is 3rd quartile = 12 13 13 9/4/2011 Slide 7
  • 8. The 5-number summary • Find the 5-number 17.1 2.1 summary of the 5.8 2.0 following data (which are salaries (in 5.0 1.0 millions) of an NBA 4.5 1.0 team: 4.3 0.8 4.2 0.7 • Go to the next slide to check your work. 3.1 0.3 9/4/2011 Slide 8
  • 9. The 5-number summary • Your answer should be: 0.3 1.0 2.6 4.5 17.1 • Another measure of spread is found from the 5-number summary, the interquartile range (or IQR). • The IQR is simply the 3rd quartile (Q3) minus the 1st quartile (Q1). • So, IQR = Q3 – Q1 • This is simply a variation on the definition of range. 9/4/2011 Slide 9
  • 10. Outliers and extreme values • The 17.1 million dollar salary is quite high. • Is it an outlier among the data? • Tukey’s rule: A data point is an outlier if it falls more than 1.5 IQR below the 1st quartile OR 1.5 IQR above the 3rd quartile. 9/4/2011 Slide 10
  • 11. Outliers and extreme values • Recall the summary: 0.3 1.0 2.6 4.5 17.1 • The IQR = 4.5 – 1.0 = 3.5 • Check for the high outlier: – Is 17.1 more than 1.5IQR above the 3rd quartile (4.5)? – Is 17.1 > 1.5(3.5) + 4.5 ? – Is 17.1 > 9.75 ? – Yes, so the $17.1 million salary is an outlier on the team’s payroll. 9/4/2011 Slide 11
  • 12. Boxplots • The boxplot displays the 5-number summary: minimum, lower quartile (Q1), median upper quartile (Q3), maximum. • It also shows the Inter-quartile range (IQR) and outliers. • It also gives us information about the symmetry of the distribution. 9/4/2011 Slide 12
  • 13. Example Boxplot The largest observation (max) is approximately 4. The smallest observation (min) is approximately 0. The first The second The third quartile quartile (Q1) is quartile (Q2) is (Q3) is approximately approximately approximately 2.7. 9/4/2011 1. 1.6. Slide 13
  • 14. Example Boxplot (modified) Outliers show up as circles. In this case, it is now the max. This is the largest observation that is NOT an out outlier. 9/4/2011 Slide 14
  • 15. Boxplots • Example – Verbal GMAT scores of 12 students: 10, 22, 24, 27, 31, 33, 39, 40, 42, 43, 44, 45 – The 5-number summary is: 10 25.5 36 42.5 45 – Now the box-plot is constructed as follows: • The line inside the box indicates the median. • The left side of this box indicates the lower quartile (Q1). • The right side of this box indicates the upper quartile (Q3). • A straight line is then drawn from the lowest value of this distribution to the box (at Q1) and another straight line from the box (at Q3) to the highest value of this distribution. 9/4/2011 Slide 15
  • 16. Boxplot 9/4/2011 Slide 16
  • 17. Modified Boxplot • Consider the 5-number summary of NBA salaries: 0.3 1.0 2.6 4.5 17.1 • The modified boxplot shows outliers separately. • On the next slide, note how the outlier 17.1 is plotted separately. • The line only goes to the maximum (or minimum) point that is NOT an outlier. 9/4/2011 Slide 17
  • 19. Using Technology to Make Boxplots • Use StatCrunch • On your calculator: – To create a boxplot: • Enter your list of data. • Now select STAT PLOT by pressing the 2nd key followed by the key labeled Y=. • Press the ENTER key, then select the option of ON, and select the boxplot type that shows outliers (as dots) • The Xlist should indicate the name of your list (L1 or other), and the Freq value should be set to 1. • Now press the ZOOM key and select option 9 (ZoomStat). • Press the ENTER key and the boxplot should be displayed. • You can use the TRACE key along with the arrow keys in order to read the values of the five number summary (Min, Q1, Med, Q3, Max). 9/4/2011 Slide 19
  • 20. The 5-number summary, boxplots and outliers • This concludes the presentation. 9/4/2011 Slide 20