SlideShare a Scribd company logo
1 of 211
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
Data Visualization for
Data Science
Principles in action
Christophe Bontemps
Toulouse School of Economics, INRA
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
MY JOB
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHY I’M HERE ?
From Huff (1993)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHY I’M HERE ?
From Huff (1993)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHY I’M HERE ?
From Huff (1993)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHY I’M HERE ?
From Huff (1993)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
BEFORE WE START
Let’s do a simple exercise (from Buja et al. (2009))
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
THE “VISUAL PERCEPTION” OF A GRAPHIC
(source : Buja et al. (2009))
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
THE “VISUAL PERCEPTION” OF A GRAPHIC
(source : Buja et al. (2009))
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“VISUAL PERCEPTION” AS A STATISTICAL TEST
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“VISUAL PERCEPTION” AS A STATISTICAL TEST
“ The human eye acts is a broad feature detector and general
statistical test”. Buja et al. (2009)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“VISUAL PERCEPTION” AS A STATISTICAL TEST
“ The human eye acts is a broad feature detector and general
statistical test”. Buja et al. (2009)
Test : H0 : {There is "nothing" } = {No relation}
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“VISUAL PERCEPTION” AS A STATISTICAL TEST
“ The human eye acts is a broad feature detector and general
statistical test”. Buja et al. (2009)
Test : H0 : {There is "nothing" } = {No relation}
H1 : { There is "something" } = {There is some relation
(Correlation, linearity, heterogeneity, groups..) }
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“VISUAL PERCEPTION” AS A COMPARISON
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“VISUAL PERCEPTION” AS A COMPARISON
What do you see here ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“VISUAL PERCEPTION” AS A COMPARISON
What do you see here ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“VISUAL PERCEPTION” AS A COMPARISON
What do you see here ?
Difficult to see the maximum/minimum of each curve...
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“VISUAL PERCEPTION” AS A COMPARISON
What do you see here ?
Difficult to see the maximum/minimum of each curve...
Idea shared by Gelman (2004) and Munzner (2014)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
It is a representation, a function of the data
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
It is a representation, a function of the data
A statistic too, is a function or a summary of the data
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
It is a representation, a function of the data
A statistic too, is a function or a summary of the data
So, it is a sort of statistic
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
It is a representation, a function of the data
A statistic too, is a function or a summary of the data
So, it is a sort of statistic
It can be descriptive or inferential
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
It is a representation, a function of the data
A statistic too, is a function or a summary of the data
So, it is a sort of statistic
It can be descriptive or inferential
Two or multi-dimensional
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
It is a representation, a function of the data
A statistic too, is a function or a summary of the data
So, it is a sort of statistic
It can be descriptive or inferential
Two or multi-dimensional
Static or dynamic
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
It is a representation, a function of the data
A statistic too, is a function or a summary of the data
So, it is a sort of statistic
It can be descriptive or inferential
Two or multi-dimensional
Static or dynamic
Informative or not
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
It is a representation, a function of the data
A statistic too, is a function or a summary of the data
So, it is a sort of statistic
It can be descriptive or inferential
Two or multi-dimensional
Static or dynamic
Informative or not
Misleading or accurately representing the data
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
It is a representation, a function of the data
A statistic too, is a function or a summary of the data
So, it is a sort of statistic
It can be descriptive or inferential
Two or multi-dimensional
Static or dynamic
Informative or not
Misleading or accurately representing the data
Beautiful or ugly....
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
For Tukey (1977) “The greatest value of a picture is when it
forces us to notice what we never expected to see”
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
For Tukey (1977) “The greatest value of a picture is when it
forces us to notice what we never expected to see”
Cleveland (1994) says that “graphical methods and
techniques are powerful tools for showing the structure of
data. The material is relevant for data analysis, when the
analyst wants to study data, and for data communication,
when the analyst wants to communicate data to others”
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
For Tukey (1977) “The greatest value of a picture is when it
forces us to notice what we never expected to see”
Cleveland (1994) says that “graphical methods and
techniques are powerful tools for showing the structure of
data. The material is relevant for data analysis, when the
analyst wants to study data, and for data communication,
when the analyst wants to communicate data to others”
Bertin (2005) (translated in Bertin (1983)) defines it as a
"visual language" and, as such, with a semiology, i.e. with
a theory of the functions of signs and symbols.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT IS DATA VISUALIZATION ?
For Tukey (1977) “The greatest value of a picture is when it
forces us to notice what we never expected to see”
Cleveland (1994) says that “graphical methods and
techniques are powerful tools for showing the structure of
data. The material is relevant for data analysis, when the
analyst wants to study data, and for data communication,
when the analyst wants to communicate data to others”
Bertin (2005) (translated in Bertin (1983)) defines it as a
"visual language" and, as such, with a semiology, i.e. with
a theory of the functions of signs and symbols.
Tufte (2001) “ Graphics are instruments for reasoning
about quantitative information. Often the most effective
way to describe , explore and summarize a set of numbers
- even a large set - is to look at pictures of those numbers”
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SO WHAT ?
Data visualisation serves different purposes :
Explanatory data analysis
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SO WHAT ?
Data visualisation serves different purposes :
Explanatory data analysis
Statistical questioning of data patterns
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SO WHAT ?
Data visualisation serves different purposes :
Explanatory data analysis
Statistical questioning of data patterns
Visual display of information for communication
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SO WHAT ?
Data visualisation serves different purposes :
Explanatory data analysis
Statistical questioning of data patterns
Visual display of information for communication
Tool for interacting with data
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
2 TYPES OF GRAPHICS :
THOSE IMMEDIATE TO UNDERSTAND
FIGURE – Seen on HK-TV
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
2 TYPES OF GRAPHICS :
THOSE IMMEDIATE TO UNDERSTAND
FIGURE – Seen on HK-TV
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
2 TYPES OF GRAPHICS :
THOSE IMMEDIATE TO UNDERSTAND
FIGURE – Where do people run in Paris (N. Yau)
source :
http://flowingdata.com/2014/02/05/where-people-run/
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
2 TYPES OF GRAPHICS :
THOSE IMMEDIATE TO UNDERSTAND
FIGURE – Climate forecast uncertainty (S. Planton)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
... AND THOSE NOT UNDERSTOOD IMMEDIATELY :
FIGURE – (Dynamic) Parallel Coordinates Plot comparing 5 indicators
for 3 countries (Sweden, Nigeria and Germany).
source :
http://ncva.itn.liu.se/education-geovisual-analytics/parallel-c
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
... AND THOSE NOT UNDERSTOOD IMMEDIATELY :
FIGURE – Pagerank Algorithm Reveals World’s All-Time Top Soccer
Team (MIT Review, March 2015)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
... AND THOSE NOT UNDERSTOOD IMMEDIATELY :
FIGURE – How people spend their days (NYT).
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“GOOD” OR “BAD” GRAPHICS ?
“There are no “good” nor “bad” graphics (...), there are graphics
answering legitimate questions and graphics that do not answer
question at all ”
Bertin (1981)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS
FIGURE – Charles Minard’s (1869) chart showing the number of men
in Napoleon’s 1812 Russian campaign army, their movements, as
well as the temperature they encountered on the return path.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS
FIGURE – Charles Minard’s (1869) chart showing the number of men
in Napoleon’s 1812 Russian campaign army, their movements, as
well as the temperature they encountered on the return path.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS
FIGURE – Charles Minard’s (1869) chart showing the number of men
in Napoleon’s 1812 Russian campaign army, their movements, as
well as the temperature they encountered on the return path.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS
FIGURE – London Cholera Map - John Snow (1854)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS
FIGURE – War Mortality - Florence Nightingale (1855) found that
Zymotic diseases (blue) > wounds injuries.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS
Same data with “modern” visualisation tools. Gelman and
Unwin (2011)
FIGURE – War Mortality - Florence Nightingale (1855) redrawn by
Gelman and Unwin (2011).
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS
FIGURE – Visualizing 5 dimensions : Gapminder (Hans Rosling)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SO WHAT ARE THE RULES ?
Can you name some rules for a good (resp. bad) graphic ?
Your turn !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SO WHAT ARE THE RULES ?
Can you name some rules for a good (resp. bad) graphic ?
Your turn !
Axis and scale (starting at zero !) ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SO WHAT ARE THE RULES ?
Can you name some rules for a good (resp. bad) graphic ?
Your turn !
Axis and scale (starting at zero !) ?
Context ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SO WHAT ARE THE RULES ?
Can you name some rules for a good (resp. bad) graphic ?
Your turn !
Axis and scale (starting at zero !) ?
Context ?
No multiple scales ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SO WHAT ARE THE RULES ?
Can you name some rules for a good (resp. bad) graphic ?
Your turn !
Axis and scale (starting at zero !) ?
Context ?
No multiple scales ?
Colors ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
YOUR TURN : WHAT’S WRONG WITH THIS GRAPHIC ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
BANANA’S SALES HAVE INCREASED !
FIGURE – from A. Dix example of interactive bar chart
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT’S WRONG WITH THIS GRAPHIC ?
FIGURE – Government spending "Skyrocketing".Tufte (2001) from
Playfair(1786).
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SCALES ARE MISLEADING !
FIGURE – Governemnt spending "Skyrocketing" (revisited). Tufte
(2001) from Playfair(1786).
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT’S WRONG WITH THIS GRAPHIC ? (HARDER)
FIGURE – Major Cause of Disability - 1975-2010 (J. Schwabish, 2014).
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT’S WRONG WITH THIS GRAPHIC ? (HARDER)
Do you remember a damn thing of this graph ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
(SMALL) MULTIPLE GRAPHS, ARE OFTEN BETTER
FIGURE – Major Cause of Disability- 1975-2010 (J. Schwabish).
Cf. "brushing" (ex : for parallel Coordinates plots)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT’S WRONG WITH THIS GRAPHIC ? (HARDER)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
KEEP ALL YOUR AUDIENCE
Normal →
Color-blind →
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHICH MEANS THAT FOR 5 % OF MEN :
See also the ggplot option + scale_colour_colorblind()
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
DATA VISUALISATION IS USED FOR TWO MAIN
PURPOSES
Data exploration
Graphs as visual tests, comparisons (short time to built
and to read)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
DATA VISUALISATION IS USED FOR TWO MAIN
PURPOSES
Data exploration
Graphs as visual tests, comparisons (short time to built
and to read)
Data representation
Summaries, storytelling (long time to build, short time to
read)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
DATA VISUALISATION IS USED FOR TWO MAIN
PURPOSES
Data exploration
Graphs as visual tests, comparisons (short time to built
and to read)
Data representation
Summaries, storytelling (long time to build, short time to
read)
The problem is that :
“ Communicating implies simplification
data exploration implies exhaustivity”
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TABLES VS GRAPHICS ?
Several papers have discussed the issue : Gelman et al. (2002),
Gelman (2011) and Friendly and Kwan (2012).
Here, descriptive statistics of continuous variables.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TABLES VS GRAPHICS ?
Graph version of the table. From Gelman (2011)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
GRAPHICS reveal DATA : ANSCOMBE (1973) QUARTET
We use here 4 couples of random variables : (X1, Y1), (X2, Y2)
(X3, Y3) and (X4, Y4). All four data sets have the same
descriptive statistics.
Xs Mean Std. Dev. Ys Mean Std. Dev. corr(Xi, Yi) N
X1 9 3.32 Y1 7.5 2.03 0.8164 11
X2 9 3.32 Y2 7.5 2.03 0.8162 11
X3 9 3.32 Y3 7.5 2.03 0.8163 11
X4 9 3.32 Y4 7.5 2.03 0.8165 11
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
ANSCOMBE (1973) QUARTET
All four data sets are described by the same linear model
(Yi = α + βXi + i), revealing apparently the same
relationships :
Dependent variable :
Y1 Y2 Y3 Y4
Regressed on :
Xi, i=1,...,4 0.500 ∗∗∗
0.500∗∗∗
0.500∗∗∗
0.500∗∗∗
Constant 3.000∗∗
3.001∗∗
3.002∗∗
3.002∗∗
R2
0.667 0.666 0.666 0.667
Resid Std. Error 1.237 1.237 1.236 1.236
F Statistic 17.990∗∗∗
17.966∗∗∗
17.972∗∗∗
18.003∗∗∗
Note : Data from Anscombe (1973). ∗
p <0.1 ; ∗∗
p < 0.05 ; ∗∗∗
p < 0.01
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
ANSCOMBE (1973) QUARTET
A simple scatter plot (regression overlaid) shows something
very different.
4
8
12
5 10 15
x1
y1
Regression of Y1 on X1 (with constant)
4
8
12
5 10 15
x2
y2
Regression of Y2 on X2 (with constant)
4
8
12
5 10 15
x3
y3
Regression of Y3 on X3 (with constant)
4
8
12
5 10 15
x4
y4
Regression of Y4 on X4 (with constant)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
ANSCOMBE (1973) QUARTET
NP : Plots of the residuals shows also same differences
−2
−1
0
1
2
5 6 7 8 9 10
Fitted values
Residuals
Residual vs Fitted Plot
−2
−1
0
1
5 6 7 8 9 10
Fitted values
Residuals
Residual vs Fitted Plot
−1
0
1
2
3
5 6 7 8 9 10
Fitted values
Residuals
Residual vs Fitted Plot
−1
0
1
2
7 8 9 10 11 12
Fitted values
Residuals
Residual vs Fitted Plot
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TABLES AND MATRICES
Data with many 0/1 variables (indicators for towns)
Bertin (1981)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TABLES AND MATRICES
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TABLES AND MATRICES
Bertin (1981)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
AND IN MANY DIMENSIONS ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TABLES AND MATRICES
From Munzner (2014)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TABLES AND MATRICES
From Munzner (2014)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TABLES AND MATRICES
From Munzner (2014)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
REGRESSION TABLES ARE GRAPHICS !
(Mod. 1) (Mod. 2)
Special Special
i_under18 -0.0692∗ -0.119∗∗∗
(-2.25) (-3.57)
log_income 0.116∗∗∗ 0.102∗∗∗
(4.31) (3.51)
i_car 0.00131 -0.112∗
(0.03) (-2.00)
b08_locenv_water 0.0624∗∗∗ 0.0583∗∗
(4.99) (4.28)
i_can 0.710∗∗∗
(23.27)
Constant -1.467∗∗∗ -0.961∗∗
(-5.38) (-3.24)
Classical "visualisation" of regressions
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
REGRESSION TABLES ARE GRAPHICS !
(Mod. 1) (Mod. 2)
Special Special
i_under18 -0.0692 -0.119
(-2.25) (-3.57)
log_income 0.116 0.102
(4.31) (3.51)
i_car 0.00131 -0.112
(0.03) (-2.00)
b08_locenv_water 0.0624 0.0583
(4.99) (4.28)
i_can 0.710
(23.27)
Constant -1.467 -0.961
(-5.38) (-3.24)
Stars are used as preattentive visual variables !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
REGRESSION AS A GRAPHIC
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
GOOD GRAPHICS ?
It the excellent Handbook of data visualisation Chen et al.
(2007), we find some good questions :
What to Whom, How and Why ?
A graphic may be linked to three pieces of text : its caption, a
headline and an article it accompanies. Ideally, all three should
be consistent and complement each other.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
GOOD GRAPHICS ?
It the excellent Handbook of data visualisation Chen et al.
(2007), we find some good questions :
What to Whom, How and Why ?
A graphic may be linked to three pieces of text : its caption, a
headline and an article it accompanies. Ideally, all three should
be consistent and complement each other.
Present or explore data ?
Different purpose, different requirements !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
GOOD GRAPHICS ?
It the excellent Handbook of data visualisation Chen et al.
(2007), we find some good questions :
What to Whom, How and Why ?
A graphic may be linked to three pieces of text : its caption, a
headline and an article it accompanies. Ideally, all three should
be consistent and complement each other.
Present or explore data ?
Different purpose, different requirements !
Choice of Graphical form ?
Choice depends on the type of data to be displayed (e.g.
univariate continuous data, bivariate categorical data, etc..) and
on what is to be shown.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
GOOD GRAPHICS ?
It the excellent Handbook of data visualisation Chen et al.
(2007), we find some good questions :
What to Whom, How and Why ?
A graphic may be linked to three pieces of text : its caption, a
headline and an article it accompanies. Ideally, all three should
be consistent and complement each other.
Present or explore data ?
Different purpose, different requirements !
Choice of Graphical form ?
Choice depends on the type of data to be displayed (e.g.
univariate continuous data, bivariate categorical data, etc..) and
on what is to be shown.
Unique solution ?
There is not always a unique optimal choice and alternatives can
be equally good or good in different ways, emphasizing different
aspects of the same data.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
EDWARD R. TUFTE’S RULES
In his seminal book, Tufte (2001) propose some principles for
displaying quantitative information.
Data : Above all, show the data
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
EDWARD R. TUFTE’S RULES
In his seminal book, Tufte (2001) propose some principles for
displaying quantitative information.
Data : Above all, show the data
Question : Induce the viewer to think about the substance
rather than about methodology, graphic design. Encourage the
eye to compare different piece of data.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
EDWARD R. TUFTE’S RULES
In his seminal book, Tufte (2001) propose some principles for
displaying quantitative information.
Data : Above all, show the data
Question : Induce the viewer to think about the substance
rather than about methodology, graphic design. Encourage the
eye to compare different piece of data.
Data-ink ratio : Maximize the ink-data ratio. Erase all non
data ink, Erase redundant information
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
EDWARD R. TUFTE’S RULES
In his seminal book, Tufte (2001) propose some principles for
displaying quantitative information.
Data : Above all, show the data
Question : Induce the viewer to think about the substance
rather than about methodology, graphic design. Encourage the
eye to compare different piece of data.
Data-ink ratio : Maximize the ink-data ratio. Erase all non
data ink, Erase redundant information
Integrity : Avoid distorting what the data have to say
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
EDWARD R. TUFTE’S RULES
In his seminal book, Tufte (2001) propose some principles for
displaying quantitative information.
Data : Above all, show the data
Question : Induce the viewer to think about the substance
rather than about methodology, graphic design. Encourage the
eye to compare different piece of data.
Data-ink ratio : Maximize the ink-data ratio. Erase all non
data ink, Erase redundant information
Integrity : Avoid distorting what the data have to say
General to specific : Reveal the data at different levels of
detail (from broad picture to fine structure)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
EDWARD R. TUFTE’S RULES
In his seminal book, Tufte (2001) propose some principles for
displaying quantitative information.
Data : Above all, show the data
Question : Induce the viewer to think about the substance
rather than about methodology, graphic design. Encourage the
eye to compare different piece of data.
Data-ink ratio : Maximize the ink-data ratio. Erase all non
data ink, Erase redundant information
Integrity : Avoid distorting what the data have to say
General to specific : Reveal the data at different levels of
detail (from broad picture to fine structure)
Context : Graphical display should be closely integrated with
the statistical and verbal descriptions of the data set.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
PRACTICAL EXAMPLE : DATA-INK RATIO
Let’s start with a classical graph (R default - Boxplot )
g1 g2 g3 g4 g5
98100102104106108110112
Groupe
Response
FIGURE – Distribution of a continuous variable on 4 groups
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
ERASE ALL NON DATA INK
Groupe
Response
1 2 3 4 5
98100102104106108110112
FIGURE – Distribution of a continuous variable on 4 groups
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
ERASE ALL REDUNDANT !
Groupe
Response
1 2 3 4 5
98100102104106108110112
FIGURE – Distribution of a continuous variable on 4 groups
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
GOING FURTHER...
Groupe
Response
1 2 3 4 5
98100102104106108110112
FIGURE – Distribution of a continuous variable on 4 groups
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
AND SHOW THE DATA...
Groupe
Response
101.0
100.0
101.0
103.8
109.1
1 2 3 4 5
FIGURE – Distribution of a continuous variable on 4 groups
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
HAVE WE LOST SOMETHING ?
g1 g2 g3 g4 g5
98100102104106108110112
Groupe
Response
Groupe
Response
101.0
100.0
101.0
103.8
109.1
1 2 3 4 5
FIGURE – Distribution of a continuous variable on 4 groups
Did you noticed that group 1 and group 3 had the same median
(101.0) ? see the ggplot theme + theme_tufte()
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
INTEGRITY : THE LIE FACTOR
LieFactor =
Size of effect shown in graphic
Size of effect in data
(1)
A Lie Factor = 1 indicates a substantial distortion
FIGURE – Fuel economy standards. (E. Tufte - from NY Times 1978)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
FIGURE – Fuel economy standards (revisited)
The "18 mpg" line measures 1.5 cm (in 1978) ; the "27,5 mpg"
measures 13 cm (in 1985)
−→ Lie factor = 14.5% ! ! !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
BERTIN’S APPROACH : A VISUAL LANGUAGE
If graphs are used to communicate, it is a form of language.
Any language has a grammar, “words” and logic. Let us study
the science that deals with signs or sign language : “The
Semiology”.
TABLE – Bertin’s definition of 8 visual variables
Position (x, y)
Size
Value
Texture
Colour
Orientation
Shape
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
THESE VARIABLES SERVE DIFFERENT GOALS
Visual variable syntactics, designating each visual variable as
suited or not for levels of measurement :
Equivalence, differences, order, proportions.
Variable suited for :
Position (x, y) = O ∝
Size = O ∝
Value = O ∝
Texture = O
Colour =
Orientation =
Shape ≡
≡ : Equivalence, = : Differences, O : Order, ∝ : Proportions
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
EXAMPLE : SHAPE IS NOT SUITABLE FOR
PROPORTIONALITY
Price of land in the East of France Bertin (1970)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
EXAMPLE : SIZE IS SUITABLE FOR PROPORTIONALITY
Price of land in the East of France Bertin (1970)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
A NOTE ON COLORS
“Colors” are not suited for ordering !
Try putting the following hues in order from low to high.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
A NOTE ON COLORS
These colors are easy to order from low to high.
Few (2008) provides meaningful solutions for choosing palettes
of colours, for example for heatmaps.
See also the ggplot theme theme_few()
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
A NOTE ON COLORS (FINAL)
Colors are sometimes a graphic puzzle Tufte (2001).
Your eyes will go back and forth from the graph to the legend...
(source : http://viz.wtf/image/135265269618)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CONJUNCTION OF COLOURS AND PROPORTIONALITY
Productivity of Airlines
(Demo with goodleVis)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
FLASH QUIZZ :
If 100% of the US prisoners are represented by the big
square...what is the percentage for each group ?
FIGURE – Ethic composition of prisoners in Jail in 2008 in the USA.
(Le Monde 5/12/2014)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
NOT SO SIMPLE...
If 100% of the US prisoners are represented by the big
square...what is the percentage for each group ?
FIGURE – Ethic composition of prisoners in Jail in 2008 in the USA.
(Le Monde 5/12/2014)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
VERIFICATION
If 100% of the US prisoners are represented by the big
square...what is the percentage for each group ?
→
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
OR...
If 100% of the US prisoners are represented by the big
square...what is the percentage for each group ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
IT MATTERS BECAUSE MANY HIGH DIMENSION
VISUALISATION USE AREA..
Spinograms
A spinogram is area-proportional just like the histogram, but
allows a non-linear x-axis and thus can make all boxes of equal
height. Theus and Urbanek (2009)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
MOSAIC PLOTS
Step 1 of the construction of a mosaic plot (Similar to spineplot
here). All surviving passengers are highlighted in all plots.
Theus and Urbanek (2009)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
MOSAIC PLOTS
Step 2 of the construction of a mosaic plot. Conditioning on
Age.Theus and Urbanek (2009)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
MOSAIC PLOTS
Step 3 of the construction of a mosaic plot. Conditioning on Age
and Gender.Theus and Urbanek (2009)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
MOSAIC PLOTS
Final step of the construction of a mosaic plot. Explicit mention
of Survived as highlighted.Theus and Urbanek (2009)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SCHWABISH (JEP, 2014) BEFORE-AFTER
FIGURE – An Unbalanced Chart - Original
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SCHWABISH (JEP, 2014) BEFORE-AFTER
FIGURE – An Unbalanced Chart - Revised
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SCHWABISH (JEP, 2014) BEFORE-AFTER
FIGURE – A Clutterplot Example - Original
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
SCHWABISH (JEP, 2014) BEFORE-AFTER
FIGURE – A Clutterplot Example - Revised
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“GOOD” OR “BAD” GRAPHICS ?
“There are no “good” nor “bad” graphics (...), there are graphics
answering legitimate questions and graphics that do not answer
question at all ”
Bertin (1981)
It is easy to criticize ... but are there some rules ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
A NOTE ON PERCEPTION
A bird (Duck, Toucan ?) on the X axis, a rabbit on the Y axis !
//
Source
http://flowingdata.com/2014/06/25/duck-vs-rabbit-plot/
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“PREATTENTIVE” VARIABLES
How many "3" in that sequence ? (from Ware (2012))
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“PREATTENTIVE” VARIABLES
How many "3" in that sequence ? (from Ware (2012))
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
“PREATTENTIVE” VARIABLES
How many "3" in that sequence ? (from Ware (2012))
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
AND NOW...
Find the red dot !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TEST : FIND THE RED DOT !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TEST : FIND THE RED DOT !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TEST : FIND THE RED DOT !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TEST : FIND THE RED DOT !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TEST : FIND THE RED DOT !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TEST : FIND THE RED DOT !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TEST : FIND THE RED DOT !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TEST : FIND THE RED DOT !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TEST : FIND THE RED DOT !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TEST : FIND THE RED DOT !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TEST : FIND THE RED DOT !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TEST : FIND THE RED DOT !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
HARDER : IS THERE A "STRANGER" ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
HARDER : IS THERE A "STRANGER" ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
HARDER : IS THERE A "STRANGER" ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
HARDER : IS THERE A "STRANGER" ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
HARDER : IS THERE A "STRANGER" ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
HARDER : IS THERE A "STRANGER" ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
HARDER : IS THERE A "STRANGER" ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
THAT WASN’T EASY
Preattentive concept, Treisman (1985) and Healey (2007)
Some visual elements or patterns are detected immediately
But there may be interferences (colour and form)
Very useful (detection, explanatory and presentation)
Helpful to highlight a message !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
TOO MUCH VARIATION DOESN’T HELP
From Ware (2012)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
MOST PREATTENTIVE VISUAL VARIABLES
From Ware (2012)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
VISUAL PERCEPTION AND PIE CHARTS
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
VISUAL PERCEPTION AND PIE CHARTS
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
VISUAL PERCEPTION AND PIE CHARTS
https://twitter.com/freakonometrics/status/6127423301609512
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
VISUAL PERCEPTION AND LINES
From Cairo (2012)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
VISUAL PERCEPTION AND LINES
When was the biggest negative (positive) difference ?
From Cairo (2012)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
VISUAL PERCEPTION AND LINES
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
VISUAL PERCEPTION AND LINES
When was the biggest negative (positive) difference ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
VISUAL PERCEPTION AND LINES
When was the biggest negative (positive) difference ?
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
VISUAL PERCEPTION AND LINES
When was the biggest negative (positive) difference ?
From Cairo (2012)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
THE CLEVELAND-MCGILL EFFECT
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
THE CLEVELAND-MCGILL EFFECT
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
THE CLEVELAND-MCGILL EFFECT
From Cleveland and McGill (1984)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WEBER’S LAW AND FRAMED BOXES
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WEBER’S LAW AND FRAMED BOXES
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WEBER’S LAW AND FRAMED BOXES
From Cleveland and McGill (1984)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
THE CLEVELAND-MCGILL SCALE
http://hcil2.cs.umd.edu/trs/99-20/99-20.html
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
PARTIAL CONCLUSION
Gordon and Finch (2015) gives some nice principles
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
PARTIAL CONCLUSION
Gordon and Finch (2015) gives some nice principles
1. Show the data clearly
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
PARTIAL CONCLUSION
Gordon and Finch (2015) gives some nice principles
1. Show the data clearly
2. Use simplicity in design
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
PARTIAL CONCLUSION
Gordon and Finch (2015) gives some nice principles
1. Show the data clearly
2. Use simplicity in design
3. Use good alignment on a common scale for quantities to be
compared
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
PARTIAL CONCLUSION
Gordon and Finch (2015) gives some nice principles
1. Show the data clearly
2. Use simplicity in design
3. Use good alignment on a common scale for quantities to be
compared
4. Keep visual encoding transparent
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
PARTIAL CONCLUSION
Gordon and Finch (2015) gives some nice principles
1. Show the data clearly
2. Use simplicity in design
3. Use good alignment on a common scale for quantities to be
compared
4. Keep visual encoding transparent
5. Use graphical forms consistent with those principles
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
PARTIAL CONCLUSION
Gordon and Finch (2015) gives some nice principles
1. Show the data clearly
2. Use simplicity in design
3. Use good alignment on a common scale for quantities to be
compared
4. Keep visual encoding transparent
5. Use graphical forms consistent with those principles
We may add some others (use preattentive elements,
integrity, ...)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
PARTIAL CONCLUSION
Do not forget the big picture
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
PARTIAL CONCLUSION
Do not forget the big picture
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CASE STUDY : VISUALIZING THE WHOLE AND THE
DETAILS !
2588 dairy farmers over 11 years.
One variable is estimated : risk aversion (AR)
6 region of study
Don’t know the results
https:
//xtophedataviz.shinyapps.io/ShinyParallel/
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CASE STUDY : RISK AVERSION
Simple plot : Median value over time.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CASE STUDY : RISK AVERSION
Simple plot : Median value with dispersion visualized.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CASE STUDY : RISK AVERSION
Classical BoxPlot : There are changes over time.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CASE STUDY : HOW TO VISUALIZE FARMS ?
Points over time : Too much overlapping
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CASE STUDY : HOW TO VISUALIZE FARMS ?
Points over time : Jitter helps !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CASE STUDY : HOW TO VISUALIZE FARMS ?
Farms over time : Jitter helps !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CASE STUDY : HOW TO VISUALIZE FARMS ?
Farms over time : Spaghetti plots !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CASE STUDY : HOW TO VISUALIZE FARMS ?
Farms over time : Spaghetti plots with some Brushing !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CASE STUDY : HOW TO VISUALIZE FARMS ?
Farms over time by region : Multiple Spaghetti plots !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CASE STUDY : HOW TO VISUALIZE FARMS ?
Farms over time : Spaghetti plots with some Brushing !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CASE STUDY : HOW TO VISUALIZE FARMS ?
Farms over time by region : Highlighting Spaghetti plots !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER
Data visualisation serves at least two main purposes
Data exploration
Graphs as visual tests, comparisons (short time to built
and to read)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER
Data visualisation serves at least two main purposes
Data exploration
Graphs as visual tests, comparisons (short time to built
and to read)
Data representation
Summaries, storytelling (long time to build, short time to
read)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER
Data visualisation serves at least two main purposes
Data exploration
Graphs as visual tests, comparisons (short time to built
and to read)
Data representation
Summaries, storytelling (long time to build, short time to
read)
The problem is that :
“ Communicating implies simplification
data exploration implies exhaustivity”
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER
From the viewer“data visualisation” are implicitly or explicitly
comparisons or even tests (in the statistical sense)
Graphics should help questioning
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER
From the viewer“data visualisation” are implicitly or explicitly
comparisons or even tests (in the statistical sense)
Graphics should help questioning
They should provide elements, to answer (data at least)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER
From the viewer“data visualisation” are implicitly or explicitly
comparisons or even tests (in the statistical sense)
Graphics should help questioning
They should provide elements, to answer (data at least)
If the question implies comparison, they should truthfully
show the comparison
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER
Many “data visualisation” are useless, meaningless or stupid !
Some are simply poor :
Graphs as visual tests, comparisons (short time to built
and to read)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER
Many “data visualisation” are useless, meaningless or stupid !
Some are simply poor :
Graphs as visual tests, comparisons (short time to built
and to read)
Some are funny :
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER
Many “data visualisation” are useless, meaningless or stupid !
Some are simply poor :
Graphs as visual tests, comparisons (short time to built
and to read)
Some are funny :
Many are ridiculous :
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
CHALLENGES : NETWORKS
Relationships of all of Victor Hugo’s characters of "Les
Miserables".
http://bl.ocks.org/mbostock/4062045_
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
NETWORKS : ADJACENT MATRIX PLOT
An adjacency matrix, where each cell ij represents an edge from
vertex i to vertex j. Here, vertices represent characters in a
book, while edges represent co-occurrence in a chapter.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
NETWORKS : ADJACENT MATRIX PLOT
Here again, sorting is very useful !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER : THERE ARE RULES
Data visualisation is a visual language, so there are :
Elements of language
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER : THERE ARE RULES
Data visualisation is a visual language, so there are :
Elements of language
Rules of use (spelling)
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER : THERE ARE RULES
Data visualisation is a visual language, so there are :
Elements of language
Rules of use (spelling)
Grammar
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER : A GOOD TECHNIQUE DOES
NOT PRECLUDE GOOD COMMON SENSE !
let’s...
KISS
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER : A GOOD TECHNIQUE DOES
NOT PRECLUDE GOOD COMMON SENSE !
let’s...
KISS
Keep It Simple Stupid !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER : A GOOD TECHNIQUE DOES
NOT PRECLUDE GOOD COMMON SENSE !
let’s...
KISS
Keep It Simple Stupid !
Keep It Statistical Stupid !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
WHAT TO REMEMBER : A GOOD TECHNIQUE DOES
NOT PRECLUDE GOOD COMMON SENSE !
let’s...
KISS
Keep It Simple Stupid !
Keep It Statistical Stupid !
Keep It Statistical and Simple !
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
REFERENCES I
Anscombe, F. J. (1973). Graphs in statistical analysis. The American
Statistician, 27(1) :17–21.
Bertin, J. (1970). La graphique. Communications, 15(1) :169–185.
Bertin, J. (1981). Théorie matricielle de la graphique. Communication et
langages, 48(1) :62–74.
Bertin, J. (1983). Semiology of graphics, translation from sémilogie graphique
(1967).
Bertin, J. (2005). Sémiologie graphique : Les diagrammes, les réseaux, les cartes. Les
Réimpressions des Éditions de l’École des Hautes Études en Sciences
Sociales. Éditions de l’École des Hautes Études en Sciences Sociales.
Buja, A., Cook, D., Hofmann, H., Lawrence, M., Lee, E.-K., Swayne, D. F., and
Wickham, H. (2009). Statistical inference for exploratory data analysis and
model diagnostics. Philosophical Transactions of the Royal Society of London
A : Mathematical, Physical and Engineering Sciences, 367(1906) :4361–4383.
Cairo, A. (2012). The Functional Art : An introduction to information graphics and
visualization. Voices That Matter. Pearson Education.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
REFERENCES II
Chen, C.-h., Härdle, W. K., and Unwin, A. (2007). Handbook of data
visualization. Springer Science & Business Media.
Cleveland, W. S. (1994). The Elements of Graphing Data. Hobart Press,
Summit : NJ, 2 edition.
Cleveland, W. S. and McGill, R. (1984). Graphical perception : Theory,
experimentation, and application to the development of graphical
methods. Journal of the American Statistical Association, 79(387) :531–554.
Few, S. (2008). Practical rules for using color in charts. Visual Business
Intelligence Newsletter, (11).
Friendly, M. and Kwan, E. (2012). Comment. Journal of Computational and
Graphical Statistics.
Gelman, A. (2004). Exploratory data analysis for complex models. Journal of
Computational and Graphical Statistics, 13(4).
Gelman, A. (2011). Why tables are really much better than graphs. Journal of
Computational and Graphical Statistics, 20(1) :3–7.
Gelman, A., Pasarica, C., and Dodhia, R. (2002). Let’s practice what we
preach : turning tables into graphs. The American Statistician,
56(2) :121–130.
Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ
REFERENCES III
Gelman, A. and Unwin, A. (2011). Visualization, graphics, and statistics.
Statistical Computing and graphics, 22(1) :9–12.
Gordon, I. and Finch, S. (2015). Statistician heal thyself : Have we lost the
plot ? Journal of Computational and Graphical Statistics, 24(4) :1210–1229.
Healey, C. (2007). Perception in visualization.
Huff, D. (1993). How to Lie with Statistics. W. W. Norton & Company.
Munzner, T. (2014). Visualization Analysis and Design. AK Peters Visualization
Series. A K Peters/CRC Press, 1 edition.
Theus, M. and Urbanek, S. (2009). Interactive graphics for data analysis :
principles and examples. Series in computer science and data analysis. CRC
Press.
Treisman, A. (1985). Preattentive processing in vision. Computer Vision,
Graphics, and Image Processing, 31(2) :156–177.
Tufte, E. R. (2001). The Visual Display of Quantitative Information. Graphics
Press, 2 edition.
Tukey, J. W. (1977). Exploratory data analysis. Reading, Mass.
Ware, C. (2012). Information visualization : perception for design. Elsevier.

More Related Content

What's hot

What's hot (20)

Data Mining: Association Rules Basics
Data Mining: Association Rules BasicsData Mining: Association Rules Basics
Data Mining: Association Rules Basics
 
Data mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, dataData mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, data
 
Introduction to Rstudio
Introduction to RstudioIntroduction to Rstudio
Introduction to Rstudio
 
Bfs and dfs in data structure
Bfs and dfs in  data structure Bfs and dfs in  data structure
Bfs and dfs in data structure
 
Data preprocessing PPT
Data preprocessing PPTData preprocessing PPT
Data preprocessing PPT
 
Programming in R
Programming in RProgramming in R
Programming in R
 
Tableau Visual analytics complete deck 2
Tableau Visual analytics complete deck 2Tableau Visual analytics complete deck 2
Tableau Visual analytics complete deck 2
 
2.3 bayesian classification
2.3 bayesian classification2.3 bayesian classification
2.3 bayesian classification
 
04 Classification in Data Mining
04 Classification in Data Mining04 Classification in Data Mining
04 Classification in Data Mining
 
Feature selection
Feature selectionFeature selection
Feature selection
 
Python Pandas for Data Science cheatsheet
Python Pandas for Data Science cheatsheet Python Pandas for Data Science cheatsheet
Python Pandas for Data Science cheatsheet
 
Relational algebra ppt
Relational algebra pptRelational algebra ppt
Relational algebra ppt
 
Step By Step Guide to Learn R
Step By Step Guide to Learn RStep By Step Guide to Learn R
Step By Step Guide to Learn R
 
Apriori Algorithm
Apriori AlgorithmApriori Algorithm
Apriori Algorithm
 
K Nearest Neighbors
K Nearest NeighborsK Nearest Neighbors
K Nearest Neighbors
 
Python Basics
Python BasicsPython Basics
Python Basics
 
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
 
Decision tree
Decision treeDecision tree
Decision tree
 
Searching techniques in Data Structure And Algorithm
Searching techniques in Data Structure And AlgorithmSearching techniques in Data Structure And Algorithm
Searching techniques in Data Structure And Algorithm
 
Apriori algorithm
Apriori algorithmApriori algorithm
Apriori algorithm
 

Viewers also liked

Infographics & Data Visualisation
Infographics & Data VisualisationInfographics & Data Visualisation
Infographics & Data Visualisation
trisnadi
 
Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency
Jan Nah
 
Introduction To Statistics
Introduction To StatisticsIntroduction To Statistics
Introduction To Statistics
albertlaporte
 
Probability 4.1
Probability 4.1Probability 4.1
Probability 4.1
herbison
 
Publish versin host monitoring and outbound load balancing(0915113656)
Publish versin host monitoring and outbound load balancing(0915113656)Publish versin host monitoring and outbound load balancing(0915113656)
Publish versin host monitoring and outbound load balancing(0915113656)
gmolina200
 
01 03 traffic statistic items
01 03 traffic statistic items01 03 traffic statistic items
01 03 traffic statistic items
rwayda93
 

Viewers also liked (20)

Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Introduction to Data Science by Datalent Team @Data Science Clinic #9
Introduction to Data Science by Datalent Team @Data Science Clinic #9Introduction to Data Science by Datalent Team @Data Science Clinic #9
Introduction to Data Science by Datalent Team @Data Science Clinic #9
 
Let's do some thinking about data visualisation thinking
Let's do some thinking about data visualisation thinkingLet's do some thinking about data visualisation thinking
Let's do some thinking about data visualisation thinking
 
Infographics & Data Visualisation
Infographics & Data VisualisationInfographics & Data Visualisation
Infographics & Data Visualisation
 
Big Data, Data Science, Machine Intelligence and Learning: Demystification, T...
Big Data, Data Science, Machine Intelligence and Learning: Demystification, T...Big Data, Data Science, Machine Intelligence and Learning: Demystification, T...
Big Data, Data Science, Machine Intelligence and Learning: Demystification, T...
 
Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency
 
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
 
Introduction To Statistics
Introduction To StatisticsIntroduction To Statistics
Introduction To Statistics
 
Probability 4.1
Probability 4.1Probability 4.1
Probability 4.1
 
Data visualization history
Data visualization historyData visualization history
Data visualization history
 
Graphing
GraphingGraphing
Graphing
 
Using Data Visualization to Make Routine Health Information Meaningful
Using Data Visualization to Make Routine Health Information MeaningfulUsing Data Visualization to Make Routine Health Information Meaningful
Using Data Visualization to Make Routine Health Information Meaningful
 
Andy kirk NYC Data Visualization Meetup
Andy kirk NYC Data Visualization MeetupAndy kirk NYC Data Visualization Meetup
Andy kirk NYC Data Visualization Meetup
 
The art of data visualization slideset
The art of data visualization slidesetThe art of data visualization slideset
The art of data visualization slideset
 
Publish versin host monitoring and outbound load balancing(0915113656)
Publish versin host monitoring and outbound load balancing(0915113656)Publish versin host monitoring and outbound load balancing(0915113656)
Publish versin host monitoring and outbound load balancing(0915113656)
 
STATISTIC ESTIMATION
STATISTIC ESTIMATIONSTATISTIC ESTIMATION
STATISTIC ESTIMATION
 
The field-guide-to-data-science 2015 (second edition) By Booz | Allen | Hamilton
The field-guide-to-data-science 2015 (second edition) By Booz | Allen | HamiltonThe field-guide-to-data-science 2015 (second edition) By Booz | Allen | Hamilton
The field-guide-to-data-science 2015 (second edition) By Booz | Allen | Hamilton
 
data communication
data communicationdata communication
data communication
 
01 03 traffic statistic items
01 03 traffic statistic items01 03 traffic statistic items
01 03 traffic statistic items
 
Statistic for spatial data
Statistic for spatial dataStatistic for spatial data
Statistic for spatial data
 

Similar to Data Visualisation for Data Science

Evidenced based practice In this writing, locate an article pert.docx
Evidenced based practice In this writing, locate an article pert.docxEvidenced based practice In this writing, locate an article pert.docx
Evidenced based practice In this writing, locate an article pert.docx
turveycharlyn
 
Mba2216 week 07 08 measurement and data collection forms
Mba2216 week 07 08 measurement and data collection formsMba2216 week 07 08 measurement and data collection forms
Mba2216 week 07 08 measurement and data collection forms
Stephen Ong
 
Focus on what you learned that made an impression, what may have s.docx
Focus on what you learned that made an impression, what may have s.docxFocus on what you learned that made an impression, what may have s.docx
Focus on what you learned that made an impression, what may have s.docx
keugene1
 
Running head ONline analytical process1ONline analytical proce.docx
Running head ONline analytical process1ONline analytical proce.docxRunning head ONline analytical process1ONline analytical proce.docx
Running head ONline analytical process1ONline analytical proce.docx
toltonkendal
 
Research1ResearchStudent’s NameUniversity Affiliation.docx
Research1ResearchStudent’s NameUniversity Affiliation.docxResearch1ResearchStudent’s NameUniversity Affiliation.docx
Research1ResearchStudent’s NameUniversity Affiliation.docx
debishakespeare
 

Similar to Data Visualisation for Data Science (20)

A cognitive architecture-based modelling approach to understanding biases in ...
A cognitive architecture-based modelling approach to understanding biases in ...A cognitive architecture-based modelling approach to understanding biases in ...
A cognitive architecture-based modelling approach to understanding biases in ...
 
DECISIVe workshop introduction
DECISIVe workshop introductionDECISIVe workshop introduction
DECISIVe workshop introduction
 
Evidenced based practice In this writing, locate an article pert.docx
Evidenced based practice In this writing, locate an article pert.docxEvidenced based practice In this writing, locate an article pert.docx
Evidenced based practice In this writing, locate an article pert.docx
 
Theory building lecture-3
Theory building lecture-3Theory building lecture-3
Theory building lecture-3
 
Overview of Statistical Concepts
Overview of Statistical ConceptsOverview of Statistical Concepts
Overview of Statistical Concepts
 
The Dark Art: Is Music Recommendation Science a Science
The Dark Art: Is Music Recommendation Science a ScienceThe Dark Art: Is Music Recommendation Science a Science
The Dark Art: Is Music Recommendation Science a Science
 
Abdm4064 week 07 08 measurement part 1
Abdm4064 week 07 08 measurement part 1Abdm4064 week 07 08 measurement part 1
Abdm4064 week 07 08 measurement part 1
 
Mba2216 week 07 08 measurement and data collection forms
Mba2216 week 07 08 measurement and data collection formsMba2216 week 07 08 measurement and data collection forms
Mba2216 week 07 08 measurement and data collection forms
 
Is it important to explain a theorem? A case study in UML and ALCQI
Is it important to explain a theorem? A case study in UML and ALCQIIs it important to explain a theorem? A case study in UML and ALCQI
Is it important to explain a theorem? A case study in UML and ALCQI
 
Focus on what you learned that made an impression, what may have s.docx
Focus on what you learned that made an impression, what may have s.docxFocus on what you learned that made an impression, what may have s.docx
Focus on what you learned that made an impression, what may have s.docx
 
Data Analysis - How to Make Evidence from Data
Data Analysis - How to Make Evidence from DataData Analysis - How to Make Evidence from Data
Data Analysis - How to Make Evidence from Data
 
Categories for data collection methods
Categories for data collection methodsCategories for data collection methods
Categories for data collection methods
 
Research methods workshop data analysis
Research methods workshop data analysisResearch methods workshop data analysis
Research methods workshop data analysis
 
Real-life Data Visualization - guest lecture for McGill INSY-442
Real-life Data Visualization - guest lecture for McGill INSY-442Real-life Data Visualization - guest lecture for McGill INSY-442
Real-life Data Visualization - guest lecture for McGill INSY-442
 
Running head ONline analytical process1ONline analytical proce.docx
Running head ONline analytical process1ONline analytical proce.docxRunning head ONline analytical process1ONline analytical proce.docx
Running head ONline analytical process1ONline analytical proce.docx
 
Es estadísticas duro
Es estadísticas duroEs estadísticas duro
Es estadísticas duro
 
Research1ResearchStudent’s NameUniversity Affiliation.docx
Research1ResearchStudent’s NameUniversity Affiliation.docxResearch1ResearchStudent’s NameUniversity Affiliation.docx
Research1ResearchStudent’s NameUniversity Affiliation.docx
 
How to Do Ecology (the easy way)
How to Do Ecology (the easy way)How to Do Ecology (the easy way)
How to Do Ecology (the easy way)
 
TheoreticalFramework
TheoreticalFrameworkTheoreticalFramework
TheoreticalFramework
 
Udforskning af problem gennem forslag til dets løsning
Udforskning af problem gennem forslag til dets løsningUdforskning af problem gennem forslag til dets løsning
Udforskning af problem gennem forslag til dets løsning
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

Data Visualisation for Data Science

  • 1. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ Data Visualization for Data Science Principles in action Christophe Bontemps Toulouse School of Economics, INRA
  • 2. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ MY JOB
  • 3. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHY I’M HERE ? From Huff (1993)
  • 4. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHY I’M HERE ? From Huff (1993)
  • 5. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHY I’M HERE ? From Huff (1993)
  • 6. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHY I’M HERE ? From Huff (1993)
  • 7. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ BEFORE WE START Let’s do a simple exercise (from Buja et al. (2009))
  • 8. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ THE “VISUAL PERCEPTION” OF A GRAPHIC (source : Buja et al. (2009))
  • 9. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ THE “VISUAL PERCEPTION” OF A GRAPHIC (source : Buja et al. (2009))
  • 10. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “VISUAL PERCEPTION” AS A STATISTICAL TEST
  • 11. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “VISUAL PERCEPTION” AS A STATISTICAL TEST “ The human eye acts is a broad feature detector and general statistical test”. Buja et al. (2009)
  • 12. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “VISUAL PERCEPTION” AS A STATISTICAL TEST “ The human eye acts is a broad feature detector and general statistical test”. Buja et al. (2009) Test : H0 : {There is "nothing" } = {No relation}
  • 13. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “VISUAL PERCEPTION” AS A STATISTICAL TEST “ The human eye acts is a broad feature detector and general statistical test”. Buja et al. (2009) Test : H0 : {There is "nothing" } = {No relation} H1 : { There is "something" } = {There is some relation (Correlation, linearity, heterogeneity, groups..) }
  • 14. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “VISUAL PERCEPTION” AS A COMPARISON
  • 15. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “VISUAL PERCEPTION” AS A COMPARISON What do you see here ?
  • 16. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “VISUAL PERCEPTION” AS A COMPARISON What do you see here ?
  • 17. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “VISUAL PERCEPTION” AS A COMPARISON What do you see here ? Difficult to see the maximum/minimum of each curve...
  • 18. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “VISUAL PERCEPTION” AS A COMPARISON What do you see here ? Difficult to see the maximum/minimum of each curve... Idea shared by Gelman (2004) and Munzner (2014)
  • 19. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? It is a representation, a function of the data
  • 20. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? It is a representation, a function of the data A statistic too, is a function or a summary of the data
  • 21. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? It is a representation, a function of the data A statistic too, is a function or a summary of the data So, it is a sort of statistic
  • 22. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? It is a representation, a function of the data A statistic too, is a function or a summary of the data So, it is a sort of statistic It can be descriptive or inferential
  • 23. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? It is a representation, a function of the data A statistic too, is a function or a summary of the data So, it is a sort of statistic It can be descriptive or inferential Two or multi-dimensional
  • 24. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? It is a representation, a function of the data A statistic too, is a function or a summary of the data So, it is a sort of statistic It can be descriptive or inferential Two or multi-dimensional Static or dynamic
  • 25. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? It is a representation, a function of the data A statistic too, is a function or a summary of the data So, it is a sort of statistic It can be descriptive or inferential Two or multi-dimensional Static or dynamic Informative or not
  • 26. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? It is a representation, a function of the data A statistic too, is a function or a summary of the data So, it is a sort of statistic It can be descriptive or inferential Two or multi-dimensional Static or dynamic Informative or not Misleading or accurately representing the data
  • 27. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? It is a representation, a function of the data A statistic too, is a function or a summary of the data So, it is a sort of statistic It can be descriptive or inferential Two or multi-dimensional Static or dynamic Informative or not Misleading or accurately representing the data Beautiful or ugly....
  • 28. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? For Tukey (1977) “The greatest value of a picture is when it forces us to notice what we never expected to see”
  • 29. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? For Tukey (1977) “The greatest value of a picture is when it forces us to notice what we never expected to see” Cleveland (1994) says that “graphical methods and techniques are powerful tools for showing the structure of data. The material is relevant for data analysis, when the analyst wants to study data, and for data communication, when the analyst wants to communicate data to others”
  • 30. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? For Tukey (1977) “The greatest value of a picture is when it forces us to notice what we never expected to see” Cleveland (1994) says that “graphical methods and techniques are powerful tools for showing the structure of data. The material is relevant for data analysis, when the analyst wants to study data, and for data communication, when the analyst wants to communicate data to others” Bertin (2005) (translated in Bertin (1983)) defines it as a "visual language" and, as such, with a semiology, i.e. with a theory of the functions of signs and symbols.
  • 31. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT IS DATA VISUALIZATION ? For Tukey (1977) “The greatest value of a picture is when it forces us to notice what we never expected to see” Cleveland (1994) says that “graphical methods and techniques are powerful tools for showing the structure of data. The material is relevant for data analysis, when the analyst wants to study data, and for data communication, when the analyst wants to communicate data to others” Bertin (2005) (translated in Bertin (1983)) defines it as a "visual language" and, as such, with a semiology, i.e. with a theory of the functions of signs and symbols. Tufte (2001) “ Graphics are instruments for reasoning about quantitative information. Often the most effective way to describe , explore and summarize a set of numbers - even a large set - is to look at pictures of those numbers”
  • 32. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SO WHAT ? Data visualisation serves different purposes : Explanatory data analysis
  • 33. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SO WHAT ? Data visualisation serves different purposes : Explanatory data analysis Statistical questioning of data patterns
  • 34. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SO WHAT ? Data visualisation serves different purposes : Explanatory data analysis Statistical questioning of data patterns Visual display of information for communication
  • 35. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SO WHAT ? Data visualisation serves different purposes : Explanatory data analysis Statistical questioning of data patterns Visual display of information for communication Tool for interacting with data
  • 36. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ 2 TYPES OF GRAPHICS : THOSE IMMEDIATE TO UNDERSTAND FIGURE – Seen on HK-TV
  • 37. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ 2 TYPES OF GRAPHICS : THOSE IMMEDIATE TO UNDERSTAND FIGURE – Seen on HK-TV
  • 38. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ 2 TYPES OF GRAPHICS : THOSE IMMEDIATE TO UNDERSTAND FIGURE – Where do people run in Paris (N. Yau) source : http://flowingdata.com/2014/02/05/where-people-run/
  • 39. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ 2 TYPES OF GRAPHICS : THOSE IMMEDIATE TO UNDERSTAND FIGURE – Climate forecast uncertainty (S. Planton)
  • 40. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ ... AND THOSE NOT UNDERSTOOD IMMEDIATELY : FIGURE – (Dynamic) Parallel Coordinates Plot comparing 5 indicators for 3 countries (Sweden, Nigeria and Germany). source : http://ncva.itn.liu.se/education-geovisual-analytics/parallel-c
  • 41. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ ... AND THOSE NOT UNDERSTOOD IMMEDIATELY : FIGURE – Pagerank Algorithm Reveals World’s All-Time Top Soccer Team (MIT Review, March 2015)
  • 42. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ ... AND THOSE NOT UNDERSTOOD IMMEDIATELY : FIGURE – How people spend their days (NYT).
  • 43. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “GOOD” OR “BAD” GRAPHICS ? “There are no “good” nor “bad” graphics (...), there are graphics answering legitimate questions and graphics that do not answer question at all ” Bertin (1981)
  • 44. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS FIGURE – Charles Minard’s (1869) chart showing the number of men in Napoleon’s 1812 Russian campaign army, their movements, as well as the temperature they encountered on the return path.
  • 45. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS FIGURE – Charles Minard’s (1869) chart showing the number of men in Napoleon’s 1812 Russian campaign army, their movements, as well as the temperature they encountered on the return path.
  • 46. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS FIGURE – Charles Minard’s (1869) chart showing the number of men in Napoleon’s 1812 Russian campaign army, their movements, as well as the temperature they encountered on the return path.
  • 47. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS FIGURE – London Cholera Map - John Snow (1854)
  • 48. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS
  • 49. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS FIGURE – War Mortality - Florence Nightingale (1855) found that Zymotic diseases (blue) > wounds injuries.
  • 50. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS Same data with “modern” visualisation tools. Gelman and Unwin (2011) FIGURE – War Mortality - Florence Nightingale (1855) redrawn by Gelman and Unwin (2011).
  • 51. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ FAMOUS EXAMPLES OF “GOOD” VISUALIZATIONS FIGURE – Visualizing 5 dimensions : Gapminder (Hans Rosling)
  • 52. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SO WHAT ARE THE RULES ? Can you name some rules for a good (resp. bad) graphic ? Your turn !
  • 53. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SO WHAT ARE THE RULES ? Can you name some rules for a good (resp. bad) graphic ? Your turn ! Axis and scale (starting at zero !) ?
  • 54. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SO WHAT ARE THE RULES ? Can you name some rules for a good (resp. bad) graphic ? Your turn ! Axis and scale (starting at zero !) ? Context ?
  • 55. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SO WHAT ARE THE RULES ? Can you name some rules for a good (resp. bad) graphic ? Your turn ! Axis and scale (starting at zero !) ? Context ? No multiple scales ?
  • 56. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SO WHAT ARE THE RULES ? Can you name some rules for a good (resp. bad) graphic ? Your turn ! Axis and scale (starting at zero !) ? Context ? No multiple scales ? Colors ?
  • 57. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ YOUR TURN : WHAT’S WRONG WITH THIS GRAPHIC ?
  • 58. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ BANANA’S SALES HAVE INCREASED ! FIGURE – from A. Dix example of interactive bar chart
  • 59. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT’S WRONG WITH THIS GRAPHIC ? FIGURE – Government spending "Skyrocketing".Tufte (2001) from Playfair(1786).
  • 60. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SCALES ARE MISLEADING ! FIGURE – Governemnt spending "Skyrocketing" (revisited). Tufte (2001) from Playfair(1786).
  • 61. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT’S WRONG WITH THIS GRAPHIC ? (HARDER) FIGURE – Major Cause of Disability - 1975-2010 (J. Schwabish, 2014).
  • 62. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT’S WRONG WITH THIS GRAPHIC ? (HARDER) Do you remember a damn thing of this graph ?
  • 63. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ (SMALL) MULTIPLE GRAPHS, ARE OFTEN BETTER FIGURE – Major Cause of Disability- 1975-2010 (J. Schwabish). Cf. "brushing" (ex : for parallel Coordinates plots)
  • 64. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT’S WRONG WITH THIS GRAPHIC ? (HARDER)
  • 65. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ KEEP ALL YOUR AUDIENCE Normal → Color-blind →
  • 66. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHICH MEANS THAT FOR 5 % OF MEN : See also the ggplot option + scale_colour_colorblind()
  • 67. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ DATA VISUALISATION IS USED FOR TWO MAIN PURPOSES Data exploration Graphs as visual tests, comparisons (short time to built and to read)
  • 68. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ DATA VISUALISATION IS USED FOR TWO MAIN PURPOSES Data exploration Graphs as visual tests, comparisons (short time to built and to read) Data representation Summaries, storytelling (long time to build, short time to read)
  • 69. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ DATA VISUALISATION IS USED FOR TWO MAIN PURPOSES Data exploration Graphs as visual tests, comparisons (short time to built and to read) Data representation Summaries, storytelling (long time to build, short time to read) The problem is that : “ Communicating implies simplification data exploration implies exhaustivity”
  • 70. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TABLES VS GRAPHICS ? Several papers have discussed the issue : Gelman et al. (2002), Gelman (2011) and Friendly and Kwan (2012). Here, descriptive statistics of continuous variables.
  • 71. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TABLES VS GRAPHICS ? Graph version of the table. From Gelman (2011)
  • 72. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ GRAPHICS reveal DATA : ANSCOMBE (1973) QUARTET We use here 4 couples of random variables : (X1, Y1), (X2, Y2) (X3, Y3) and (X4, Y4). All four data sets have the same descriptive statistics. Xs Mean Std. Dev. Ys Mean Std. Dev. corr(Xi, Yi) N X1 9 3.32 Y1 7.5 2.03 0.8164 11 X2 9 3.32 Y2 7.5 2.03 0.8162 11 X3 9 3.32 Y3 7.5 2.03 0.8163 11 X4 9 3.32 Y4 7.5 2.03 0.8165 11
  • 73. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ ANSCOMBE (1973) QUARTET All four data sets are described by the same linear model (Yi = α + βXi + i), revealing apparently the same relationships : Dependent variable : Y1 Y2 Y3 Y4 Regressed on : Xi, i=1,...,4 0.500 ∗∗∗ 0.500∗∗∗ 0.500∗∗∗ 0.500∗∗∗ Constant 3.000∗∗ 3.001∗∗ 3.002∗∗ 3.002∗∗ R2 0.667 0.666 0.666 0.667 Resid Std. Error 1.237 1.237 1.236 1.236 F Statistic 17.990∗∗∗ 17.966∗∗∗ 17.972∗∗∗ 18.003∗∗∗ Note : Data from Anscombe (1973). ∗ p <0.1 ; ∗∗ p < 0.05 ; ∗∗∗ p < 0.01
  • 74. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ ANSCOMBE (1973) QUARTET A simple scatter plot (regression overlaid) shows something very different. 4 8 12 5 10 15 x1 y1 Regression of Y1 on X1 (with constant) 4 8 12 5 10 15 x2 y2 Regression of Y2 on X2 (with constant) 4 8 12 5 10 15 x3 y3 Regression of Y3 on X3 (with constant) 4 8 12 5 10 15 x4 y4 Regression of Y4 on X4 (with constant)
  • 75. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ ANSCOMBE (1973) QUARTET NP : Plots of the residuals shows also same differences −2 −1 0 1 2 5 6 7 8 9 10 Fitted values Residuals Residual vs Fitted Plot −2 −1 0 1 5 6 7 8 9 10 Fitted values Residuals Residual vs Fitted Plot −1 0 1 2 3 5 6 7 8 9 10 Fitted values Residuals Residual vs Fitted Plot −1 0 1 2 7 8 9 10 11 12 Fitted values Residuals Residual vs Fitted Plot
  • 76. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TABLES AND MATRICES Data with many 0/1 variables (indicators for towns) Bertin (1981)
  • 77. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TABLES AND MATRICES
  • 78. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TABLES AND MATRICES Bertin (1981)
  • 79. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ AND IN MANY DIMENSIONS ?
  • 80. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TABLES AND MATRICES From Munzner (2014)
  • 81. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TABLES AND MATRICES From Munzner (2014)
  • 82. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TABLES AND MATRICES From Munzner (2014)
  • 83. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ REGRESSION TABLES ARE GRAPHICS ! (Mod. 1) (Mod. 2) Special Special i_under18 -0.0692∗ -0.119∗∗∗ (-2.25) (-3.57) log_income 0.116∗∗∗ 0.102∗∗∗ (4.31) (3.51) i_car 0.00131 -0.112∗ (0.03) (-2.00) b08_locenv_water 0.0624∗∗∗ 0.0583∗∗ (4.99) (4.28) i_can 0.710∗∗∗ (23.27) Constant -1.467∗∗∗ -0.961∗∗ (-5.38) (-3.24) Classical "visualisation" of regressions
  • 84. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ REGRESSION TABLES ARE GRAPHICS ! (Mod. 1) (Mod. 2) Special Special i_under18 -0.0692 -0.119 (-2.25) (-3.57) log_income 0.116 0.102 (4.31) (3.51) i_car 0.00131 -0.112 (0.03) (-2.00) b08_locenv_water 0.0624 0.0583 (4.99) (4.28) i_can 0.710 (23.27) Constant -1.467 -0.961 (-5.38) (-3.24) Stars are used as preattentive visual variables !
  • 85. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ REGRESSION AS A GRAPHIC
  • 86. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ GOOD GRAPHICS ? It the excellent Handbook of data visualisation Chen et al. (2007), we find some good questions : What to Whom, How and Why ? A graphic may be linked to three pieces of text : its caption, a headline and an article it accompanies. Ideally, all three should be consistent and complement each other.
  • 87. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ GOOD GRAPHICS ? It the excellent Handbook of data visualisation Chen et al. (2007), we find some good questions : What to Whom, How and Why ? A graphic may be linked to three pieces of text : its caption, a headline and an article it accompanies. Ideally, all three should be consistent and complement each other. Present or explore data ? Different purpose, different requirements !
  • 88. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ GOOD GRAPHICS ? It the excellent Handbook of data visualisation Chen et al. (2007), we find some good questions : What to Whom, How and Why ? A graphic may be linked to three pieces of text : its caption, a headline and an article it accompanies. Ideally, all three should be consistent and complement each other. Present or explore data ? Different purpose, different requirements ! Choice of Graphical form ? Choice depends on the type of data to be displayed (e.g. univariate continuous data, bivariate categorical data, etc..) and on what is to be shown.
  • 89. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ GOOD GRAPHICS ? It the excellent Handbook of data visualisation Chen et al. (2007), we find some good questions : What to Whom, How and Why ? A graphic may be linked to three pieces of text : its caption, a headline and an article it accompanies. Ideally, all three should be consistent and complement each other. Present or explore data ? Different purpose, different requirements ! Choice of Graphical form ? Choice depends on the type of data to be displayed (e.g. univariate continuous data, bivariate categorical data, etc..) and on what is to be shown. Unique solution ? There is not always a unique optimal choice and alternatives can be equally good or good in different ways, emphasizing different aspects of the same data.
  • 90. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ EDWARD R. TUFTE’S RULES In his seminal book, Tufte (2001) propose some principles for displaying quantitative information. Data : Above all, show the data
  • 91. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ EDWARD R. TUFTE’S RULES In his seminal book, Tufte (2001) propose some principles for displaying quantitative information. Data : Above all, show the data Question : Induce the viewer to think about the substance rather than about methodology, graphic design. Encourage the eye to compare different piece of data.
  • 92. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ EDWARD R. TUFTE’S RULES In his seminal book, Tufte (2001) propose some principles for displaying quantitative information. Data : Above all, show the data Question : Induce the viewer to think about the substance rather than about methodology, graphic design. Encourage the eye to compare different piece of data. Data-ink ratio : Maximize the ink-data ratio. Erase all non data ink, Erase redundant information
  • 93. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ EDWARD R. TUFTE’S RULES In his seminal book, Tufte (2001) propose some principles for displaying quantitative information. Data : Above all, show the data Question : Induce the viewer to think about the substance rather than about methodology, graphic design. Encourage the eye to compare different piece of data. Data-ink ratio : Maximize the ink-data ratio. Erase all non data ink, Erase redundant information Integrity : Avoid distorting what the data have to say
  • 94. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ EDWARD R. TUFTE’S RULES In his seminal book, Tufte (2001) propose some principles for displaying quantitative information. Data : Above all, show the data Question : Induce the viewer to think about the substance rather than about methodology, graphic design. Encourage the eye to compare different piece of data. Data-ink ratio : Maximize the ink-data ratio. Erase all non data ink, Erase redundant information Integrity : Avoid distorting what the data have to say General to specific : Reveal the data at different levels of detail (from broad picture to fine structure)
  • 95. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ EDWARD R. TUFTE’S RULES In his seminal book, Tufte (2001) propose some principles for displaying quantitative information. Data : Above all, show the data Question : Induce the viewer to think about the substance rather than about methodology, graphic design. Encourage the eye to compare different piece of data. Data-ink ratio : Maximize the ink-data ratio. Erase all non data ink, Erase redundant information Integrity : Avoid distorting what the data have to say General to specific : Reveal the data at different levels of detail (from broad picture to fine structure) Context : Graphical display should be closely integrated with the statistical and verbal descriptions of the data set.
  • 96. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ PRACTICAL EXAMPLE : DATA-INK RATIO Let’s start with a classical graph (R default - Boxplot ) g1 g2 g3 g4 g5 98100102104106108110112 Groupe Response FIGURE – Distribution of a continuous variable on 4 groups
  • 97. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ ERASE ALL NON DATA INK Groupe Response 1 2 3 4 5 98100102104106108110112 FIGURE – Distribution of a continuous variable on 4 groups
  • 98. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ ERASE ALL REDUNDANT ! Groupe Response 1 2 3 4 5 98100102104106108110112 FIGURE – Distribution of a continuous variable on 4 groups
  • 99. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ GOING FURTHER... Groupe Response 1 2 3 4 5 98100102104106108110112 FIGURE – Distribution of a continuous variable on 4 groups
  • 100. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ AND SHOW THE DATA... Groupe Response 101.0 100.0 101.0 103.8 109.1 1 2 3 4 5 FIGURE – Distribution of a continuous variable on 4 groups
  • 101. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ HAVE WE LOST SOMETHING ? g1 g2 g3 g4 g5 98100102104106108110112 Groupe Response Groupe Response 101.0 100.0 101.0 103.8 109.1 1 2 3 4 5 FIGURE – Distribution of a continuous variable on 4 groups Did you noticed that group 1 and group 3 had the same median (101.0) ? see the ggplot theme + theme_tufte()
  • 102. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ INTEGRITY : THE LIE FACTOR LieFactor = Size of effect shown in graphic Size of effect in data (1) A Lie Factor = 1 indicates a substantial distortion FIGURE – Fuel economy standards. (E. Tufte - from NY Times 1978)
  • 103. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ FIGURE – Fuel economy standards (revisited) The "18 mpg" line measures 1.5 cm (in 1978) ; the "27,5 mpg" measures 13 cm (in 1985) −→ Lie factor = 14.5% ! ! !
  • 104. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ BERTIN’S APPROACH : A VISUAL LANGUAGE If graphs are used to communicate, it is a form of language. Any language has a grammar, “words” and logic. Let us study the science that deals with signs or sign language : “The Semiology”. TABLE – Bertin’s definition of 8 visual variables Position (x, y) Size Value Texture Colour Orientation Shape
  • 105. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ THESE VARIABLES SERVE DIFFERENT GOALS Visual variable syntactics, designating each visual variable as suited or not for levels of measurement : Equivalence, differences, order, proportions. Variable suited for : Position (x, y) = O ∝ Size = O ∝ Value = O ∝ Texture = O Colour = Orientation = Shape ≡ ≡ : Equivalence, = : Differences, O : Order, ∝ : Proportions
  • 106. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ EXAMPLE : SHAPE IS NOT SUITABLE FOR PROPORTIONALITY Price of land in the East of France Bertin (1970)
  • 107. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ EXAMPLE : SIZE IS SUITABLE FOR PROPORTIONALITY Price of land in the East of France Bertin (1970)
  • 108. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ A NOTE ON COLORS “Colors” are not suited for ordering ! Try putting the following hues in order from low to high.
  • 109. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ A NOTE ON COLORS These colors are easy to order from low to high. Few (2008) provides meaningful solutions for choosing palettes of colours, for example for heatmaps. See also the ggplot theme theme_few()
  • 110. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ A NOTE ON COLORS (FINAL) Colors are sometimes a graphic puzzle Tufte (2001). Your eyes will go back and forth from the graph to the legend... (source : http://viz.wtf/image/135265269618)
  • 111. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CONJUNCTION OF COLOURS AND PROPORTIONALITY Productivity of Airlines (Demo with goodleVis)
  • 112. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ FLASH QUIZZ : If 100% of the US prisoners are represented by the big square...what is the percentage for each group ? FIGURE – Ethic composition of prisoners in Jail in 2008 in the USA. (Le Monde 5/12/2014)
  • 113. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ NOT SO SIMPLE... If 100% of the US prisoners are represented by the big square...what is the percentage for each group ? FIGURE – Ethic composition of prisoners in Jail in 2008 in the USA. (Le Monde 5/12/2014)
  • 114. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ VERIFICATION If 100% of the US prisoners are represented by the big square...what is the percentage for each group ? →
  • 115. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ OR... If 100% of the US prisoners are represented by the big square...what is the percentage for each group ?
  • 116. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ IT MATTERS BECAUSE MANY HIGH DIMENSION VISUALISATION USE AREA.. Spinograms A spinogram is area-proportional just like the histogram, but allows a non-linear x-axis and thus can make all boxes of equal height. Theus and Urbanek (2009)
  • 117. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ MOSAIC PLOTS Step 1 of the construction of a mosaic plot (Similar to spineplot here). All surviving passengers are highlighted in all plots. Theus and Urbanek (2009)
  • 118. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ MOSAIC PLOTS Step 2 of the construction of a mosaic plot. Conditioning on Age.Theus and Urbanek (2009)
  • 119. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ MOSAIC PLOTS Step 3 of the construction of a mosaic plot. Conditioning on Age and Gender.Theus and Urbanek (2009)
  • 120. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ MOSAIC PLOTS Final step of the construction of a mosaic plot. Explicit mention of Survived as highlighted.Theus and Urbanek (2009)
  • 121. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SCHWABISH (JEP, 2014) BEFORE-AFTER FIGURE – An Unbalanced Chart - Original
  • 122. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SCHWABISH (JEP, 2014) BEFORE-AFTER FIGURE – An Unbalanced Chart - Revised
  • 123. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SCHWABISH (JEP, 2014) BEFORE-AFTER FIGURE – A Clutterplot Example - Original
  • 124. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ SCHWABISH (JEP, 2014) BEFORE-AFTER FIGURE – A Clutterplot Example - Revised
  • 125. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “GOOD” OR “BAD” GRAPHICS ? “There are no “good” nor “bad” graphics (...), there are graphics answering legitimate questions and graphics that do not answer question at all ” Bertin (1981) It is easy to criticize ... but are there some rules ?
  • 126. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ A NOTE ON PERCEPTION A bird (Duck, Toucan ?) on the X axis, a rabbit on the Y axis ! // Source http://flowingdata.com/2014/06/25/duck-vs-rabbit-plot/
  • 127. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “PREATTENTIVE” VARIABLES How many "3" in that sequence ? (from Ware (2012))
  • 128. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “PREATTENTIVE” VARIABLES How many "3" in that sequence ? (from Ware (2012))
  • 129. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ “PREATTENTIVE” VARIABLES How many "3" in that sequence ? (from Ware (2012))
  • 130. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ AND NOW... Find the red dot !
  • 131. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TEST : FIND THE RED DOT !
  • 132. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TEST : FIND THE RED DOT !
  • 133. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TEST : FIND THE RED DOT !
  • 134. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TEST : FIND THE RED DOT !
  • 135. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TEST : FIND THE RED DOT !
  • 136. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TEST : FIND THE RED DOT !
  • 137. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TEST : FIND THE RED DOT !
  • 138. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TEST : FIND THE RED DOT !
  • 139. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TEST : FIND THE RED DOT !
  • 140. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TEST : FIND THE RED DOT !
  • 141. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TEST : FIND THE RED DOT !
  • 142. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TEST : FIND THE RED DOT !
  • 143. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ HARDER : IS THERE A "STRANGER" ?
  • 144. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ HARDER : IS THERE A "STRANGER" ?
  • 145. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ HARDER : IS THERE A "STRANGER" ?
  • 146. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ HARDER : IS THERE A "STRANGER" ?
  • 147. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ HARDER : IS THERE A "STRANGER" ?
  • 148. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ HARDER : IS THERE A "STRANGER" ?
  • 149. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ HARDER : IS THERE A "STRANGER" ?
  • 150. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ THAT WASN’T EASY Preattentive concept, Treisman (1985) and Healey (2007) Some visual elements or patterns are detected immediately But there may be interferences (colour and form) Very useful (detection, explanatory and presentation) Helpful to highlight a message !
  • 151. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ TOO MUCH VARIATION DOESN’T HELP From Ware (2012)
  • 152. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ MOST PREATTENTIVE VISUAL VARIABLES From Ware (2012)
  • 153. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ VISUAL PERCEPTION AND PIE CHARTS
  • 154. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ VISUAL PERCEPTION AND PIE CHARTS
  • 155. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ VISUAL PERCEPTION AND PIE CHARTS https://twitter.com/freakonometrics/status/6127423301609512
  • 156. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ VISUAL PERCEPTION AND LINES From Cairo (2012)
  • 157. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ VISUAL PERCEPTION AND LINES When was the biggest negative (positive) difference ? From Cairo (2012)
  • 158. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ VISUAL PERCEPTION AND LINES
  • 159. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ VISUAL PERCEPTION AND LINES When was the biggest negative (positive) difference ?
  • 160. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ VISUAL PERCEPTION AND LINES When was the biggest negative (positive) difference ?
  • 161. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ VISUAL PERCEPTION AND LINES When was the biggest negative (positive) difference ? From Cairo (2012)
  • 162. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ THE CLEVELAND-MCGILL EFFECT
  • 163. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ THE CLEVELAND-MCGILL EFFECT
  • 164. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ THE CLEVELAND-MCGILL EFFECT From Cleveland and McGill (1984)
  • 165. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WEBER’S LAW AND FRAMED BOXES
  • 166. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WEBER’S LAW AND FRAMED BOXES
  • 167. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WEBER’S LAW AND FRAMED BOXES From Cleveland and McGill (1984)
  • 168. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ THE CLEVELAND-MCGILL SCALE http://hcil2.cs.umd.edu/trs/99-20/99-20.html
  • 169. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ PARTIAL CONCLUSION Gordon and Finch (2015) gives some nice principles
  • 170. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ PARTIAL CONCLUSION Gordon and Finch (2015) gives some nice principles 1. Show the data clearly
  • 171. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ PARTIAL CONCLUSION Gordon and Finch (2015) gives some nice principles 1. Show the data clearly 2. Use simplicity in design
  • 172. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ PARTIAL CONCLUSION Gordon and Finch (2015) gives some nice principles 1. Show the data clearly 2. Use simplicity in design 3. Use good alignment on a common scale for quantities to be compared
  • 173. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ PARTIAL CONCLUSION Gordon and Finch (2015) gives some nice principles 1. Show the data clearly 2. Use simplicity in design 3. Use good alignment on a common scale for quantities to be compared 4. Keep visual encoding transparent
  • 174. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ PARTIAL CONCLUSION Gordon and Finch (2015) gives some nice principles 1. Show the data clearly 2. Use simplicity in design 3. Use good alignment on a common scale for quantities to be compared 4. Keep visual encoding transparent 5. Use graphical forms consistent with those principles
  • 175. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ PARTIAL CONCLUSION Gordon and Finch (2015) gives some nice principles 1. Show the data clearly 2. Use simplicity in design 3. Use good alignment on a common scale for quantities to be compared 4. Keep visual encoding transparent 5. Use graphical forms consistent with those principles We may add some others (use preattentive elements, integrity, ...)
  • 176. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ PARTIAL CONCLUSION Do not forget the big picture
  • 177. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ PARTIAL CONCLUSION Do not forget the big picture
  • 178. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CASE STUDY : VISUALIZING THE WHOLE AND THE DETAILS ! 2588 dairy farmers over 11 years. One variable is estimated : risk aversion (AR) 6 region of study Don’t know the results https: //xtophedataviz.shinyapps.io/ShinyParallel/
  • 179. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CASE STUDY : RISK AVERSION Simple plot : Median value over time.
  • 180. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CASE STUDY : RISK AVERSION Simple plot : Median value with dispersion visualized.
  • 181. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CASE STUDY : RISK AVERSION Classical BoxPlot : There are changes over time.
  • 182. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CASE STUDY : HOW TO VISUALIZE FARMS ? Points over time : Too much overlapping
  • 183. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CASE STUDY : HOW TO VISUALIZE FARMS ? Points over time : Jitter helps !
  • 184. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CASE STUDY : HOW TO VISUALIZE FARMS ? Farms over time : Jitter helps !
  • 185. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CASE STUDY : HOW TO VISUALIZE FARMS ? Farms over time : Spaghetti plots !
  • 186. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CASE STUDY : HOW TO VISUALIZE FARMS ? Farms over time : Spaghetti plots with some Brushing !
  • 187. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CASE STUDY : HOW TO VISUALIZE FARMS ? Farms over time by region : Multiple Spaghetti plots !
  • 188. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CASE STUDY : HOW TO VISUALIZE FARMS ? Farms over time : Spaghetti plots with some Brushing !
  • 189. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CASE STUDY : HOW TO VISUALIZE FARMS ? Farms over time by region : Highlighting Spaghetti plots !
  • 190. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER Data visualisation serves at least two main purposes Data exploration Graphs as visual tests, comparisons (short time to built and to read)
  • 191. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER Data visualisation serves at least two main purposes Data exploration Graphs as visual tests, comparisons (short time to built and to read) Data representation Summaries, storytelling (long time to build, short time to read)
  • 192. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER Data visualisation serves at least two main purposes Data exploration Graphs as visual tests, comparisons (short time to built and to read) Data representation Summaries, storytelling (long time to build, short time to read) The problem is that : “ Communicating implies simplification data exploration implies exhaustivity”
  • 193. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER From the viewer“data visualisation” are implicitly or explicitly comparisons or even tests (in the statistical sense) Graphics should help questioning
  • 194. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER From the viewer“data visualisation” are implicitly or explicitly comparisons or even tests (in the statistical sense) Graphics should help questioning They should provide elements, to answer (data at least)
  • 195. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER From the viewer“data visualisation” are implicitly or explicitly comparisons or even tests (in the statistical sense) Graphics should help questioning They should provide elements, to answer (data at least) If the question implies comparison, they should truthfully show the comparison
  • 196. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER Many “data visualisation” are useless, meaningless or stupid ! Some are simply poor : Graphs as visual tests, comparisons (short time to built and to read)
  • 197. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER Many “data visualisation” are useless, meaningless or stupid ! Some are simply poor : Graphs as visual tests, comparisons (short time to built and to read) Some are funny :
  • 198. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER Many “data visualisation” are useless, meaningless or stupid ! Some are simply poor : Graphs as visual tests, comparisons (short time to built and to read) Some are funny : Many are ridiculous :
  • 199. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ CHALLENGES : NETWORKS Relationships of all of Victor Hugo’s characters of "Les Miserables". http://bl.ocks.org/mbostock/4062045_
  • 200. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ NETWORKS : ADJACENT MATRIX PLOT An adjacency matrix, where each cell ij represents an edge from vertex i to vertex j. Here, vertices represent characters in a book, while edges represent co-occurrence in a chapter.
  • 201. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ NETWORKS : ADJACENT MATRIX PLOT Here again, sorting is very useful !
  • 202. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER : THERE ARE RULES Data visualisation is a visual language, so there are : Elements of language
  • 203. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER : THERE ARE RULES Data visualisation is a visual language, so there are : Elements of language Rules of use (spelling)
  • 204. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER : THERE ARE RULES Data visualisation is a visual language, so there are : Elements of language Rules of use (spelling) Grammar
  • 205. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER : A GOOD TECHNIQUE DOES NOT PRECLUDE GOOD COMMON SENSE ! let’s... KISS
  • 206. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER : A GOOD TECHNIQUE DOES NOT PRECLUDE GOOD COMMON SENSE ! let’s... KISS Keep It Simple Stupid !
  • 207. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER : A GOOD TECHNIQUE DOES NOT PRECLUDE GOOD COMMON SENSE ! let’s... KISS Keep It Simple Stupid ! Keep It Statistical Stupid !
  • 208. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ WHAT TO REMEMBER : A GOOD TECHNIQUE DOES NOT PRECLUDE GOOD COMMON SENSE ! let’s... KISS Keep It Simple Stupid ! Keep It Statistical Stupid ! Keep It Statistical and Simple !
  • 209. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ REFERENCES I Anscombe, F. J. (1973). Graphs in statistical analysis. The American Statistician, 27(1) :17–21. Bertin, J. (1970). La graphique. Communications, 15(1) :169–185. Bertin, J. (1981). Théorie matricielle de la graphique. Communication et langages, 48(1) :62–74. Bertin, J. (1983). Semiology of graphics, translation from sémilogie graphique (1967). Bertin, J. (2005). Sémiologie graphique : Les diagrammes, les réseaux, les cartes. Les Réimpressions des Éditions de l’École des Hautes Études en Sciences Sociales. Éditions de l’École des Hautes Études en Sciences Sociales. Buja, A., Cook, D., Hofmann, H., Lawrence, M., Lee, E.-K., Swayne, D. F., and Wickham, H. (2009). Statistical inference for exploratory data analysis and model diagnostics. Philosophical Transactions of the Royal Society of London A : Mathematical, Physical and Engineering Sciences, 367(1906) :4361–4383. Cairo, A. (2012). The Functional Art : An introduction to information graphics and visualization. Voices That Matter. Pearson Education.
  • 210. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ REFERENCES II Chen, C.-h., Härdle, W. K., and Unwin, A. (2007). Handbook of data visualization. Springer Science & Business Media. Cleveland, W. S. (1994). The Elements of Graphing Data. Hobart Press, Summit : NJ, 2 edition. Cleveland, W. S. and McGill, R. (1984). Graphical perception : Theory, experimentation, and application to the development of graphical methods. Journal of the American Statistical Association, 79(387) :531–554. Few, S. (2008). Practical rules for using color in charts. Visual Business Intelligence Newsletter, (11). Friendly, M. and Kwan, E. (2012). Comment. Journal of Computational and Graphical Statistics. Gelman, A. (2004). Exploratory data analysis for complex models. Journal of Computational and Graphical Statistics, 13(4). Gelman, A. (2011). Why tables are really much better than graphs. Journal of Computational and Graphical Statistics, 20(1) :3–7. Gelman, A., Pasarica, C., and Dodhia, R. (2002). Let’s practice what we preach : turning tables into graphs. The American Statistician, 56(2) :121–130.
  • 211. Definitions Typologies Good vs bad Tables Principles Before After Visual perception An example What to remember Référ REFERENCES III Gelman, A. and Unwin, A. (2011). Visualization, graphics, and statistics. Statistical Computing and graphics, 22(1) :9–12. Gordon, I. and Finch, S. (2015). Statistician heal thyself : Have we lost the plot ? Journal of Computational and Graphical Statistics, 24(4) :1210–1229. Healey, C. (2007). Perception in visualization. Huff, D. (1993). How to Lie with Statistics. W. W. Norton & Company. Munzner, T. (2014). Visualization Analysis and Design. AK Peters Visualization Series. A K Peters/CRC Press, 1 edition. Theus, M. and Urbanek, S. (2009). Interactive graphics for data analysis : principles and examples. Series in computer science and data analysis. CRC Press. Treisman, A. (1985). Preattentive processing in vision. Computer Vision, Graphics, and Image Processing, 31(2) :156–177. Tufte, E. R. (2001). The Visual Display of Quantitative Information. Graphics Press, 2 edition. Tukey, J. W. (1977). Exploratory data analysis. Reading, Mass. Ware, C. (2012). Information visualization : perception for design. Elsevier.