SlideShare a Scribd company logo
1 of 70
Download to read offline
Causal Reasoning
and Learning Systems
Presented by:
Elon Portugaly
Microsoft Research / AdCenter
Léon Bottou
Microsoft Research
Joint work with
Jonas Peters Joaquin Quiñonero Candela
Denis Xavier Charles D. Max Chickering
Elon Portugaly Dipankar Ray
Patrice Simard Ed Snelson
I. MOTIVATION
The pesky little ads
Mainline
Sidebar
Multiple feedback loops
User Advertiser
Queries
Ads &
Bids
Ads Prices
Clicks (and consequences)
Learning
ADVERTISER
FEEDBACK LOOP
LEARNING
FEEDBACK LOOP
USER
FEEDBACK
LOOP
Publisher
Learning to run a marketplace
• The learning machine
is not a machine but
is an organization with lots
of people doing stuff and
making decisions
working in the dark
How can we help?
• Goal: improve marketplace machinery such
that its long term revenue is maximal
• Approximate goal by improving multiple
performance measures related to all players
Learning to run a marketplace
• The learning machine
is not a machine but
is an organization with lots
of people doing stuff and
making decisions
working in the dark
How can we help?
• Goal: improve marketplace machinery such
that its long term revenue is maximal
• Approximate goal by improving multiple
performance measures related to all players
• Provide data for decision making
• Automatically optimize parts of the
system
Current methodologies
• Auction theory
– Handles advertiser loop, but not other loops
• Machine learning
– Handles learning loop, but not other loops
• Historical data analysis
– Cannot detect causal effects, therefore:
– Frequently invalid due to Simpson’s paradox
• Controlled experiments
– Powerful, but slow
– Very slow when trying to handle slow feedback loops
Historical data analysis
• Can detect correlations, but not causality
– Does a highly relevant ad in the 1st position
increase the click through rate of the ad in the 2nd
position?
Ad2
Ad1
All
Ad2 CTR
Low
relevance
124/2000
(6.2%)
High
relevance
149/2000
(7.5%)
Historical data analysis
• Can detect correlations, but not causality
– Does a highly relevant ad in the 1st position
increase the click through rate of the ad in the 2nd
position?
Ad2
Ad1
All Low rel High rel
Ad2 CTR
Low
relevance
124/2000
(6.2%)
High
relevance
149/2000
(7.5%)
Ad2
Ad1
All Low rel High rel
Ad2 CTR
Low
relevance
124/2000
(6.2%)
92/1823
(5.0%)
32/177
(18.1%)
High
relevance
149/2000
(7.5%)
71/1534
(4.6%)
78/466
(16.7%)
Historical data analysis
• Can detect correlations, but not causality
– Does a highly relevant ad in the 1st position
increase the click through rate of the ad in the 2nd
position?
• Problem – cannot detect causal effects:
– Simpson’s paradox would lead to the wrong
conclusions most of the time
– Controlling for it requires knowledge of all
confounding factors
Controlled experiments
Comparing ad placement strategies
• Apply alternative treatments to random traffic buckets.
– Randomly replace the 1st ad with ads of higher/lower
relevance
• Wait several days and compare performance metrics.
Issues
• Need full implementation and several days.
• Controlling for slow feedback means experiments must
run even longer
• Need to know your questions in advance.
Outline from here on
II. Causal inference
III. Counterfactual measurements
IV. Algorithmic toolbox
V. Equilibrium analysis
Since we must deal with causation…
II. CAUSAL INFERENCE
What is causation?
Manipulability theories
• “The electric fan spins because the switch is up.”
≈ “If one moves the switch up, the fan will spin.”
• We can carry out experiments and collect data.
“No causation without manipulation” (Rubin, 1986)
Counter-example?
• “The apple falls because the earth attracts it.”
• Describe the corresponding manipulation.
Structural equation model (SEM)
Direct causes / Known and unknown functions
Noise variables / Exogenous variables
Intervention
Interventions as algebraic manipulation of the SEM.
*
NEW Q=𝒇 𝟒
∗
Isolation assumption
What to do with unknown functions?
• Replace knowledge by statistics.
• Statistics need repeated isolated experiments.
• Isolate experiments by assuming an
unknown but invariant joint distribution
for the exogenous variables.
• What about the feedback loops?
𝑃(𝑢, 𝑣)
Markov factorization
A Bayes network is born (Pearl, 1988)
Markov interventions
Many interrelated Bayes networks are born (Pearl, 2000)
– They are interrelated because they share some factors.
– More complex algebraic interventions are of course possible.
*
*
Distribution under
intervention
III. COUNTERFACTUAL
MEASUREMENTS
Counterfactuals
Measuring something that did not happen
“How would the system have performed if, when the data
was collected, we had used 𝑃∗ 𝑞|𝑥, 𝑎 instead of
𝑃 𝑞|𝑥, 𝑎 ? ”
Learning procedure
• Collect data that describes the operation of the system
during a past time period.
• Find changes that would have increased the performance
of the system if they had been applied during the data
collection period.
• Implement and verify…
Replaying past data
OCR Example
• Collect labeled data in existing setup
• Replay the past data to evaluate what the performance
would have been if we had used OCR system θ.
• Requires knowledge of all functions connecting the
point of intervention to the point of measurement.
Zip Code
Scan
Zip Code
Label
OCR Output
Performance
Score
Replaying past data
OCR Example
• Collect labeled data in existing setup
• Replay the past data to evaluate what the performance
would have been if we had used OCR system θ.
• Requires knowledge of all functions connecting the
point of intervention to the point of measurement.
Zip Code
Scan
Zip Code
Label
OCR Output
Performance
Score
OCR Output
Replaying past data
OCR Example
• Collect labeled data in existing setup
• Replay the past data to evaluate what the performance
would have been if we had used OCR system θ.
• Requires knowledge of all functions connecting the
point of intervention to the point of measurement.
Zip Code
Scan
Zip Code
Label
OCR Output
Performance
Score
OCR Output
Replaying past data
OCR Example
• Collect labeled data in existing setup
• Replay the past data to evaluate what the performance
would have been if we had used OCR system θ.
• Requires knowledge of all functions connecting the
point of intervention to the point of measurement.
Zip Code
Scan
Zip Code
Label
OCR Output
Performance
Score
OCR Output
Randomized experiments
Randomly select who to treat with penicillin
• Selection independent of all confounding factors
• Therefore eliminates Simpson’s paradox and allows
Counterfactual estimate
• If we had given penicillin to 𝑥% of the patients,
the success rate would have been
3680
4000
× 𝑥 +
840
6000
× 1 − 𝑥
(not real data)
All Survived Died
Survival
Rate
with drug 4,000 3,680 320 92%
without drug 6,000 840 5,160 14%
Total 10,000 4,520 5,480 45%
Importance sampling
*
*
Distribution under
intervention
• Can we estimate the results of the intervention
counterfactually (without actually performing the
intervention)
– Yes if P and P* are non-deterministic (and close enough)
Importance sampling
Actual expectation
𝑌 = ℓ 𝜔 𝑃(𝜔)
𝜔
Counterfactual expectation
𝑌∗ = ℓ 𝜔 𝑃∗(𝜔)
𝜔
= ℓ 𝜔
𝑃∗(𝜔)
𝑃 𝜔
𝑃(𝜔)
𝜔
≈
1
𝑛
𝑃∗(𝜔𝑖)
𝑃 𝜔𝑖
𝑛
𝑖=1
ℓ 𝜔𝑖
Importance sampling
Principle
• Reweight past examples to emulate the
probability they would have had under the
counterfactual distribution.
𝑤 𝜔𝑖 =
𝑃∗
(𝜔𝑖)
𝑃 𝜔𝑖
=
𝑃∗
(𝑞|𝑥, 𝑎)
𝑃(𝑞|𝑥, 𝑎)
• Only requires the knowledge of the function
under intervention (before and after)
Factors in P* not in P
Factors in P not in P*
Exploration
• Confidence intervals on the counterfactuals
define the level of exploration.
• Successful exploration
= Ability to measure reliable counterfactuals
𝑃(𝜔) 𝑃∗(𝜔)
Quality of the estimation
• Good when distribs overlap
• Bad otherwise
• Note that if either are
deterministic, they do not overlap
Confidence intervals
𝑌∗
= ℓ 𝜔 𝑤 𝜔 𝑃(𝜔)
𝜔
≈
1
𝑛
ℓ 𝜔𝑖 𝑤 𝜔𝑖
𝑛
𝑖=1
Using the central limit theorem?
• 𝑤 𝜔𝑖 very large when 𝑃(𝜔𝑖) small.
• A few samples in poorly explored regions
dominate the sum with their noisy contributions.
• Solution: ignore them.
Confidence intervals (ii)
Zero-clipped weights
𝑤 𝜔 =
𝑤(𝜔) if less than 𝑅,
0 otherwise.
Easier estimate
𝑌∗
= ℓ 𝜔 𝑤 𝜔 𝑃(𝜔)
𝜔
≈
1
𝑛
ℓ 𝜔𝑖 𝑤 𝜔𝑖
𝑛
𝑖=1
Confidence intervals (iii)
Bounding the bias
• Observe 𝑤 𝜔 𝑃(𝜔)𝜔
=
𝑃∗ 𝜔
𝑃 𝜔
𝑃 𝜔 = 1𝜔
.
• Assuming 0 ≤ ℓ 𝜔 ≤ 𝑀 we have
0 ≤ 𝑌∗
− 𝑌∗
= 𝑤 − 𝑤 ℓ(𝜔) 𝑃(
𝜔
𝜔) ≤ 𝑀 𝑤 − 𝑤 𝑃(
𝜔
𝜔)
= 𝑀 1 − 𝑤(𝜔)𝑃(
𝜔
𝜔) ≈ 𝑀 1 −
1
𝑛
𝑤(𝜔𝑖)
𝑛
𝑖=1
• This is easy to estimate because 𝑤(𝜔) is bounded.
• This represents what we miss because of insufficient exploration.
Two-parts confidence interval
Outer confidence interval
• Bounds Y∗
− Y 𝑛
∗
• When this is too large, we must sample more.
Inner confidence interval
• Bounds 𝑌∗
− 𝑌∗
• When this is too large, we must explore more.
Playing with mainline reserves
Mainline reserves (MLRs)
• Thresholds that control whether ads are
displayed in the mainline (north ads)
Randomized bucket
• Random log-normal multiplier applied to MLRs.
• 22M auctions over five weeks
Control buckets
• Same setup with 18% lower mainline reserves
• Same setup without randomization
Playing with mainline reserves (ii)
Inner interval
Outer interval
Control with no
randomization
Control with
18% lower MLR
Playing with mainline reserves (iii)
This is easy to
estimate
Playing with mainline reserves (iv)
Revenue has
always high
variance
More with the same data
Examples
• Estimates for different randomization variance
 Good to determine how much to explore.
• Query-dependent reserves
 Just another counterfactual distribution!
This is the big advantage
• Collect data first, choose question later.
• Randomizing more stuff increases opportunities.
IV. ALGORITHMIC TOOLBOX
Algorithmic toolbox
• Improving the confidence intervals:
– Exploiting causal graph for much better behaved
weights
– Incorporating predictors invariant to the manipulation
• Counterfactual derivatives and optimization
– Counterfactual differences
– Counterfactual derivatives
– Policy gradients
– Optimization (= learning)
Derivatives and optimization
Tuning squashing exponents and reserves
• Ads ranked by decreasing 𝑏𝑖𝑑 × 𝑝𝐶𝑙𝑖𝑐𝑘 𝛼
• Lahaie & McAfee (2011) show that 𝛼 < 1 is good
when click probability estimation gets less accurate.
• Different 𝛼 𝑘 and reserves 𝜌 𝑘 for each query cluster 𝑘.
• Squashing can increase prices.
Optimize advertiser value instead of revenue.
“Making the pie larger instead of our slice of the pie!”
Derivatives and optimization
Objective function
• 𝑉 𝜶, 𝝆 : Lower bound for advertiser value.
• 𝑁 𝜶, 𝝆 : Upper bound for number of mainline ads
max
𝜶,𝝆
𝑉 𝜶, 𝝆 subject to 𝑁 𝜶, 𝝆 < 𝑁0
• We can estimate these functions and their derivatives.
Therefore we can optimize easily.
• The alternative is auction simulation.
Users are then assumed to behave like 𝑝𝐶𝑙𝑖𝑐𝑘 says.
Derivatives and optimization
Variation of the
average number
of mainline ads.
Estimated
advertiser value
(arbitrary units)
Level curves for one particular query cluster
Optimizing counterfactuals = Learning
• Does it generalize?
Yes, we can obtain uniform confidence intervals.
• Sequential design?
Thompson sampling comes naturally in this context.
• Metering exploration wisely?
Inner confidence interval tells how much exploration
we need to answer a counterfactual question.
But it does not tell which questions we should ask.
This was not a problem in practice…
V. EQUILIBRIUM ANALYSIS
Revisiting the feedback loops
Queries
Ads &
Bids
Ads Prices
Clicks (and consequences)
ADVERTISER
FEEDBACK LOOP
LEARNING
FEEDBACK LOOP
USER
FEEDBACK
LOOP
Revisiting the feedback loops
Different time scales
• Auctions happen thousands of times per second.
• Learning feedback: a couple hours.
• User and advertiser feedback: several weeks.
Tracking the equilibrium
• Assume the user and advertiser feedbacks converge to
equilibrium
• “What will have been the total revenue had we
changed MLR and waited until a new equilibrium was
reached”
User feedback
We make the following causal assumptions
• We can measure a quantity 𝑔 that quantifies the
relevance of the displayed ads.
– For instance using human labelers.
• The user reaction can be expressed by the effect
on the click yield of the average relevance 𝑔
experienced by each user in the recent past.
– Realistically, we should also consider an effect on the
number of queries issued by each user. We omit this
for simplicity.
User feedback
Let 𝑌(𝜃, 𝑔) = ℓ(𝜔) 𝑃 𝜃
(𝜔|𝑔)𝜔
and 𝐺 𝜃, 𝑔 = 𝑔 𝑃 𝜃
𝜔 𝑔𝜔
.
Equilibrium condition : 𝐺 = 𝑔.
Total derivative
Total derivatives
d𝑌 =
𝜕𝑌
𝜕𝜃
d𝜃 +
𝜕𝑌
𝜕𝑔
d𝑔
d𝐺 =
𝜕𝐺
𝜕𝜃
d𝜃 +
𝜕𝐺
𝜕𝑔
d𝑔 = d𝑔
Solving gives
d𝑔 =
𝜕𝐺
𝜕𝜃
d𝜃
And substituting gives the answer to our question
d𝑌 =
𝜕𝑌
𝜕𝜃
+
𝜕𝑌
𝜕𝑔
𝜕𝐺
𝜕𝜃
d𝜃
Equilibrium
condition
Zero because of the
graph structure
Estimating derivatives
• Estimate
𝜕𝑌
𝜕𝜃
and
𝜕𝐺
𝜕𝜃
like policy gradients.
• Estimate
𝜕𝑌
𝜕𝑔
as follows:
1. Randomly submit users to different treatments.
2. At some point switch to same treatment and
observe click yield 𝑌 as functions of 𝑔.
Advertiser feedback
Same thing with total derivatives.
For instance, using economics to characterize the equilibrium,
avoiding the extra experiment.
𝑦𝑎, 𝑧 𝑎
𝑦𝑎, 𝑧 𝑎
Pricing theory
Rational advertisers keep Va =
𝜕𝑧 𝑎
𝜕𝑦 𝑎
=
𝜕𝑧 𝑎
𝜕𝑏 𝑎
𝜕𝑦 𝑎
𝜕𝑏 𝑎
constant.
Number of clicks 𝑦𝑎
Totalpaid𝑧𝑎
Pricing curve
Adjusting the bid 𝑏 𝑎
moves 𝑦𝑎, 𝑧 𝑎 on
this curve.
Value curve
Advertiser will not pay
more than this.
Maximum surplus
Best deal for the advertisers.
The slope of the pricing
curve reveals their value.
Estimating values
• At equilibrium 𝑦𝑎 = 𝑌𝑎 and 𝑧 𝑎 = 𝑍 𝑎.
• Therefore we can compute
Va =
𝜕𝑧 𝑎
𝜕𝑏 𝑎
𝜕𝑦𝑎
𝜕𝑏 𝑎
=
𝜕𝑍 𝑎
𝜕𝑏 𝑎
𝜕𝑌𝑎
𝜕𝑏 𝑎
• So vector Φ = …
𝜕𝑍 𝑎
𝜕𝑏 𝑎
− 𝑉𝑎
𝜕𝑌𝑎
𝜕𝑏 𝑎
… = 0
• Then we can use the policy gradient equation.
Advertiser feedback equilibrium
Φ was zero before 𝑑𝜃, then it converges to zero when
equilibrium is returned so:
dΦ =
𝜕Φ
𝜕𝜃
d𝜃 +
𝜕Φ
𝜕𝑏 𝑎
𝑎
d𝑏 𝑎 = 0
• Solve the linear system for
d𝑏 𝑎
d𝜃
• Then answer the counterfactual question
d𝑌 =
𝜕𝑌
𝜕𝜃
+
𝜕𝑌
𝜕𝑏 𝑎
d𝑏 𝑎
d𝜃
𝑎
d𝜃
Equilibrium
condition
Multiple feedback loops
Same procedure:
1. Write total derivatives.
2. Solve the linear system formed
by all the equilibrium conditions.
3. Substitute into the total derivative
of the counterfactual expectation of interest.
VI. CONCLUSION
Main messages
• There are systems in the real world that are too complex to
formally define
– ML can assist humans in running these systems
• Causal inference clarifies many problems
– Ignoring causality => Simpson’s paradox
– Randomness allows inferring causality
• The counterfactual framework is modular
– Randomize in advance, ask later
– Compatible with other methodologies, e.g. optimization using
gradients, equilibrium analysis
ADDITIONAL SLIDES
IV. ALGORITHMIC TOOLBOX
Shifting the reweighting point
• Users make click decisions on the basis of what they see.
• They cannot see the scores, the reserves, the prices, etc.
Shifting the reweighting point
Standard weights
𝑤 𝜔𝑖 =
𝑃∗
(𝜔𝑖)
𝑃 𝜔𝑖
=
𝑃∗
(𝑞|𝑥, 𝑎)
𝑃(𝑞|𝑥, 𝑎)
Shifted weights
𝑤 𝜔𝑖 =
𝑃∗
(𝑠|𝑥, 𝑎, 𝑏)
𝑃(𝑠|𝑥, 𝑎, 𝑏)
with 𝑃⋄
𝑠 𝑥, 𝑎, 𝑏 = 𝑃 𝑠 𝑎, 𝑞, 𝑏 𝑃⋄
(𝑞|𝑥, 𝑎)𝑞
.
Shifting the reweighting point
When can we do this?
• 𝑃⋄
(𝜔) factorizes in the right way iff
1. Reweighting variables intercept every causal path
connecting the point(s) of intervention to the
point of measurement.
2. All functional dependencies between the point(s)
of intervention and the reweighting variables are
known.
Shifting the reweighting point
Experimental validation
• Mainline reserves
Score reweighting Slate reweighting
Invariant predictors
• Some variables 𝑣 not affected by the intervention
have a strong impact on the outcome: time, location, …
• Let 𝜁(𝑣) be an arbitrary predictor of ℓ(𝜔)
𝑌∗ = ℓ 𝜔 − 𝜁(𝜈) 𝑤 𝜔 𝑃 𝜔
𝜔
+ 𝜁 𝜈 𝑃(𝜈)
𝜈
≈
1
𝑛
ℓ 𝜔𝑖 − 𝜁 𝜈𝑖 𝑤 𝜔𝑖
𝑛
𝑖=1
+
1
𝑛
𝜁 𝑣𝑖
𝑛
𝑖=1
No multiplier 𝑤(𝜔) for the
variance captured by 𝜁 𝜈 .
Reduced variance if the
predictor 𝜁 𝜈 is any good.
Counterfactual differences
• Which scoring model works best?
• Comparing expectations under counterfactual
distributions 𝑃+(𝜔) and 𝑃∗(𝜔).
𝑌+ − 𝑌∗ = ℓ 𝜔 − 𝜁 𝜈 Δ𝑤 𝜔
𝜔
𝑃 𝜔
≈
1
𝑛
ℓ 𝜔𝑖 − 𝜁(𝜈𝑖) Δ𝑤 𝜔𝑖
𝑛
𝑖=1
with Δ𝑤 𝜔 =
𝑃+ 𝜔
𝑃 𝜔
−
𝑃∗ 𝜔
𝑃 𝜔
Variance captured
by 𝜁 𝜈 is gone!
Counterfactual derivatives
• Counterfactual distribution 𝑃 𝜃 𝜔
𝜕𝑌 𝜃
𝜕𝜃
= ℓ 𝜔 − 𝜁 𝜈 𝑤 𝜃
′
𝜔 𝑃 𝜔
𝜔
≈
1
𝑛
ℓ 𝜔𝑖 − 𝜁 𝜈𝑖 𝑤 𝜃
′
𝜔𝑖
𝑛
𝑖=1
with 𝑤 𝜃
′
𝜔 =
𝜕𝑤 𝜃(𝜔)
𝜕𝜃
= 𝑤 𝜃 𝜔
𝜕 log 𝑃 𝜃 𝜔
𝜕𝜃
𝑤 𝜃 𝜔 can be large but there are ways…
Policy gradient
Infinitesimal interventions
• Assuming 𝑃 𝜔 = 𝑃0
𝜔 and using the previous result:
𝜕𝑌 𝜃
𝜕𝜃 𝜃=0
≈
1
𝑛
ℓ 𝜔𝑖 − 𝜁 𝜈𝑖 𝑤0
′
𝜔𝑖
𝑛
𝑖=1
with 𝑤0
′
𝜔 =
𝜕𝑤 𝜃 𝜔
𝜕𝜃 𝜃=0
=
𝜕 log 𝑃 𝜃 𝜔
𝜕𝜃
𝜃=0
Potentially large
𝑤 𝜃 𝜔 is gone!

More Related Content

What's hot

Alleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal ModelsAlleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal Models
Amit Sharma
 
E00 program-level modeling and simulation experiences
E00   program-level modeling and simulation experiencesE00   program-level modeling and simulation experiences
E00 program-level modeling and simulation experiences
therealreverendbayes
 

What's hot (20)

Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inference
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine Learning
 
Python and the Holy Grail of Causal Inference - Dennis Ramondt, Huib Keemink
Python and the Holy Grail of Causal Inference - Dennis Ramondt, Huib KeeminkPython and the Holy Grail of Causal Inference - Dennis Ramondt, Huib Keemink
Python and the Holy Grail of Causal Inference - Dennis Ramondt, Huib Keemink
 
sigir2020
sigir2020sigir2020
sigir2020
 
Introduction to Deep Reinforcement Learning
Introduction to Deep Reinforcement LearningIntroduction to Deep Reinforcement Learning
Introduction to Deep Reinforcement Learning
 
Alleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal ModelsAlleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal Models
 
Proximal Policy Optimization Algorithms, Schulman et al, 2017
Proximal Policy Optimization Algorithms, Schulman et al, 2017Proximal Policy Optimization Algorithms, Schulman et al, 2017
Proximal Policy Optimization Algorithms, Schulman et al, 2017
 
sigir2018tutorial
sigir2018tutorialsigir2018tutorial
sigir2018tutorial
 
Analyze Phase Roadmap (Level 3)
Analyze Phase Roadmap (Level 3)Analyze Phase Roadmap (Level 3)
Analyze Phase Roadmap (Level 3)
 
Hypothesis Testing: Spread (Compare 1:Standard)
Hypothesis Testing: Spread (Compare 1:Standard)Hypothesis Testing: Spread (Compare 1:Standard)
Hypothesis Testing: Spread (Compare 1:Standard)
 
Hypothesis Testing: Relationships (Compare 2+ Factors)
Hypothesis Testing: Relationships (Compare 2+ Factors)Hypothesis Testing: Relationships (Compare 2+ Factors)
Hypothesis Testing: Relationships (Compare 2+ Factors)
 
Temporal Learning and Sequence Modeling for a Job Recommender System
Temporal Learning and Sequence Modeling for a Job Recommender SystemTemporal Learning and Sequence Modeling for a Job Recommender System
Temporal Learning and Sequence Modeling for a Job Recommender System
 
Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021
Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021
Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021
 
Optimizing marketing campaigns using experimental designs
Optimizing marketing campaigns using experimental designsOptimizing marketing campaigns using experimental designs
Optimizing marketing campaigns using experimental designs
 
Beyond Churn Prediction : An Introduction to uplift modeling
Beyond Churn Prediction : An Introduction to uplift modelingBeyond Churn Prediction : An Introduction to uplift modeling
Beyond Churn Prediction : An Introduction to uplift modeling
 
Metaheuristics
MetaheuristicsMetaheuristics
Metaheuristics
 
Hypothesis Testing: Spread (Compare 1:1)
Hypothesis Testing: Spread (Compare 1:1)Hypothesis Testing: Spread (Compare 1:1)
Hypothesis Testing: Spread (Compare 1:1)
 
E00 program-level modeling and simulation experiences
E00   program-level modeling and simulation experiencesE00   program-level modeling and simulation experiences
E00 program-level modeling and simulation experiences
 
Can We Automate Predictive Analytics
Can We Automate Predictive AnalyticsCan We Automate Predictive Analytics
Can We Automate Predictive Analytics
 
Satisfaction and loyalty
Satisfaction and loyaltySatisfaction and loyalty
Satisfaction and loyalty
 

Viewers also liked

COM 110 Interview questions
COM 110 Interview questionsCOM 110 Interview questions
COM 110 Interview questions
turnercom
 
COM 110 Ch 10 Group Problem Solving
COM 110 Ch 10 Group Problem SolvingCOM 110 Ch 10 Group Problem Solving
COM 110 Ch 10 Group Problem Solving
turnercom
 
Group problem solving by mihaela-alexandrina cenusa
Group problem solving by mihaela-alexandrina cenusaGroup problem solving by mihaela-alexandrina cenusa
Group problem solving by mihaela-alexandrina cenusa
budur eleonora
 
COM 110 Ch 12 Step 3 Organizing part 2
COM 110 Ch 12 Step 3 Organizing part 2COM 110 Ch 12 Step 3 Organizing part 2
COM 110 Ch 12 Step 3 Organizing part 2
turnercom
 
Chapter 10 - The problem solving process
Chapter 10 - The problem solving processChapter 10 - The problem solving process
Chapter 10 - The problem solving process
Anh Truc Ha Nguyen
 
Categorical syllogism
Categorical syllogismCategorical syllogism
Categorical syllogism
3842
 
Ch03 basic logical_concepts
Ch03 basic logical_conceptsCh03 basic logical_concepts
Ch03 basic logical_concepts
Hariz Mustafa
 

Viewers also liked (15)

Causal Reasoning
Causal ReasoningCausal Reasoning
Causal Reasoning
 
COM 110 Interview questions
COM 110 Interview questionsCOM 110 Interview questions
COM 110 Interview questions
 
COM 110 Ch 10 Group Problem Solving
COM 110 Ch 10 Group Problem SolvingCOM 110 Ch 10 Group Problem Solving
COM 110 Ch 10 Group Problem Solving
 
Group problem solving by mihaela-alexandrina cenusa
Group problem solving by mihaela-alexandrina cenusaGroup problem solving by mihaela-alexandrina cenusa
Group problem solving by mihaela-alexandrina cenusa
 
University Transcript
University TranscriptUniversity Transcript
University Transcript
 
Reasoning unit
Reasoning unitReasoning unit
Reasoning unit
 
COM 110 Ch 12 Step 3 Organizing part 2
COM 110 Ch 12 Step 3 Organizing part 2COM 110 Ch 12 Step 3 Organizing part 2
COM 110 Ch 12 Step 3 Organizing part 2
 
Chapter 10 - The problem solving process
Chapter 10 - The problem solving processChapter 10 - The problem solving process
Chapter 10 - The problem solving process
 
Categorical syllogism
Categorical syllogismCategorical syllogism
Categorical syllogism
 
Ch03 basic logical_concepts
Ch03 basic logical_conceptsCh03 basic logical_concepts
Ch03 basic logical_concepts
 
Group problem solving
Group problem solvingGroup problem solving
Group problem solving
 
Categorical propositions
Categorical propositionsCategorical propositions
Categorical propositions
 
Critical Reasoning
Critical ReasoningCritical Reasoning
Critical Reasoning
 
CATEGORICAL SYLLOGISM
CATEGORICAL SYLLOGISMCATEGORICAL SYLLOGISM
CATEGORICAL SYLLOGISM
 
A Critical Reading
A Critical ReadingA Critical Reading
A Critical Reading
 

Similar to Causal reasoning and Learning Systems

Optimal Learning for Fun and Profit with MOE
Optimal Learning for Fun and Profit with MOEOptimal Learning for Fun and Profit with MOE
Optimal Learning for Fun and Profit with MOE
Yelp Engineering
 
Scott Clark, Software Engineer, Yelp at MLconf SF
Scott Clark, Software Engineer, Yelp at MLconf SFScott Clark, Software Engineer, Yelp at MLconf SF
Scott Clark, Software Engineer, Yelp at MLconf SF
MLconf
 
Semi-Supervised Insight Generation from Petabyte Scale Text Data
Semi-Supervised Insight Generation from Petabyte Scale Text DataSemi-Supervised Insight Generation from Petabyte Scale Text Data
Semi-Supervised Insight Generation from Petabyte Scale Text Data
Tech Triveni
 
Approximate Dynamic Programming: A New Paradigm for Process Control & Optimiz...
Approximate Dynamic Programming: A New Paradigm for Process Control & Optimiz...Approximate Dynamic Programming: A New Paradigm for Process Control & Optimiz...
Approximate Dynamic Programming: A New Paradigm for Process Control & Optimiz...
height
 

Similar to Causal reasoning and Learning Systems (20)

Ed Snelson. Counterfactual Analysis
Ed Snelson. Counterfactual AnalysisEd Snelson. Counterfactual Analysis
Ed Snelson. Counterfactual Analysis
 
Optimal Learning for Fun and Profit with MOE
Optimal Learning for Fun and Profit with MOEOptimal Learning for Fun and Profit with MOE
Optimal Learning for Fun and Profit with MOE
 
Scott Clark, Software Engineer, Yelp at MLconf SF
Scott Clark, Software Engineer, Yelp at MLconf SFScott Clark, Software Engineer, Yelp at MLconf SF
Scott Clark, Software Engineer, Yelp at MLconf SF
 
Machine Learning with Big Data using Apache Spark
Machine Learning with Big Data using Apache SparkMachine Learning with Big Data using Apache Spark
Machine Learning with Big Data using Apache Spark
 
Deep Q-learning from Demonstrations DQfD
Deep Q-learning from Demonstrations DQfDDeep Q-learning from Demonstrations DQfD
Deep Q-learning from Demonstrations DQfD
 
PCA.pptx
PCA.pptxPCA.pptx
PCA.pptx
 
Marketing analytics
Marketing analyticsMarketing analytics
Marketing analytics
 
Scientific Revenue USF 2016 talk
Scientific Revenue USF 2016 talkScientific Revenue USF 2016 talk
Scientific Revenue USF 2016 talk
 
Semi-Supervised Insight Generation from Petabyte Scale Text Data
Semi-Supervised Insight Generation from Petabyte Scale Text DataSemi-Supervised Insight Generation from Petabyte Scale Text Data
Semi-Supervised Insight Generation from Petabyte Scale Text Data
 
Ml ppt at
Ml ppt atMl ppt at
Ml ppt at
 
Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models
 
Six Sigma Overview
Six Sigma OverviewSix Sigma Overview
Six Sigma Overview
 
Credit Card Fraudulent Transaction Detection Research Paper
Credit Card Fraudulent Transaction Detection Research PaperCredit Card Fraudulent Transaction Detection Research Paper
Credit Card Fraudulent Transaction Detection Research Paper
 
Bridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Bridging the Gap: Machine Learning for Ubiquitous Computing -- EvaluationBridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Bridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
 
Random Forest Decision Tree.pptx
Random Forest Decision Tree.pptxRandom Forest Decision Tree.pptx
Random Forest Decision Tree.pptx
 
Analytics for large-scale time series and event data
Analytics for large-scale time series and event dataAnalytics for large-scale time series and event data
Analytics for large-scale time series and event data
 
Approximate Dynamic Programming: A New Paradigm for Process Control & Optimiz...
Approximate Dynamic Programming: A New Paradigm for Process Control & Optimiz...Approximate Dynamic Programming: A New Paradigm for Process Control & Optimiz...
Approximate Dynamic Programming: A New Paradigm for Process Control & Optimiz...
 
Recommendation engine Using Genetic Algorithm
Recommendation engine Using Genetic AlgorithmRecommendation engine Using Genetic Algorithm
Recommendation engine Using Genetic Algorithm
 
Application of Machine Learning in Agriculture
Application of Machine  Learning in AgricultureApplication of Machine  Learning in Agriculture
Application of Machine Learning in Agriculture
 
Propagating Data Policies - A User Study
Propagating Data Policies - A User StudyPropagating Data Policies - A User Study
Propagating Data Policies - A User Study
 

More from Trieu Nguyen

[Notes] Customer 360 Analytics with LEO CDP
[Notes] Customer 360 Analytics with LEO CDP[Notes] Customer 360 Analytics with LEO CDP
[Notes] Customer 360 Analytics with LEO CDP
Trieu Nguyen
 

More from Trieu Nguyen (20)

Building Your Customer Data Platform with LEO CDP in Travel Industry.pdf
Building Your Customer Data Platform with LEO CDP in Travel Industry.pdfBuilding Your Customer Data Platform with LEO CDP in Travel Industry.pdf
Building Your Customer Data Platform with LEO CDP in Travel Industry.pdf
 
Building Your Customer Data Platform with LEO CDP - Spa and Hotel Business
Building Your Customer Data Platform with LEO CDP - Spa and Hotel BusinessBuilding Your Customer Data Platform with LEO CDP - Spa and Hotel Business
Building Your Customer Data Platform with LEO CDP - Spa and Hotel Business
 
Building Your Customer Data Platform with LEO CDP
Building Your Customer Data Platform with LEO CDP Building Your Customer Data Platform with LEO CDP
Building Your Customer Data Platform with LEO CDP
 
How to track and improve Customer Experience with LEO CDP
How to track and improve Customer Experience with LEO CDPHow to track and improve Customer Experience with LEO CDP
How to track and improve Customer Experience with LEO CDP
 
[Notes] Customer 360 Analytics with LEO CDP
[Notes] Customer 360 Analytics with LEO CDP[Notes] Customer 360 Analytics with LEO CDP
[Notes] Customer 360 Analytics with LEO CDP
 
Leo CDP - Pitch Deck
Leo CDP - Pitch DeckLeo CDP - Pitch Deck
Leo CDP - Pitch Deck
 
LEO CDP - What's new in 2022
LEO CDP  - What's new in 2022LEO CDP  - What's new in 2022
LEO CDP - What's new in 2022
 
Lộ trình triển khai LEO CDP cho ngành bất động sản
Lộ trình triển khai LEO CDP cho ngành bất động sảnLộ trình triển khai LEO CDP cho ngành bất động sản
Lộ trình triển khai LEO CDP cho ngành bất động sản
 
Why is LEO CDP important for digital business ?
Why is LEO CDP important for digital business ?Why is LEO CDP important for digital business ?
Why is LEO CDP important for digital business ?
 
From Dataism to Customer Data Platform
From Dataism to Customer Data PlatformFrom Dataism to Customer Data Platform
From Dataism to Customer Data Platform
 
Data collection, processing & organization with USPA framework
Data collection, processing & organization with USPA frameworkData collection, processing & organization with USPA framework
Data collection, processing & organization with USPA framework
 
Part 1: Introduction to digital marketing technology
Part 1: Introduction to digital marketing technologyPart 1: Introduction to digital marketing technology
Part 1: Introduction to digital marketing technology
 
Why is Customer Data Platform (CDP) ?
Why is Customer Data Platform (CDP) ?Why is Customer Data Platform (CDP) ?
Why is Customer Data Platform (CDP) ?
 
How to build a Personalized News Recommendation Platform
How to build a Personalized News Recommendation PlatformHow to build a Personalized News Recommendation Platform
How to build a Personalized News Recommendation Platform
 
How to grow your business in the age of digital marketing 4.0
How to grow your business  in the age of digital marketing 4.0How to grow your business  in the age of digital marketing 4.0
How to grow your business in the age of digital marketing 4.0
 
Video Ecosystem and some ideas about video big data
Video Ecosystem and some ideas about video big dataVideo Ecosystem and some ideas about video big data
Video Ecosystem and some ideas about video big data
 
Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)
 
Open OTT - Video Content Platform
Open OTT - Video Content PlatformOpen OTT - Video Content Platform
Open OTT - Video Content Platform
 
Apache Hadoop and Spark: Introduction and Use Cases for Data Analysis
Apache Hadoop and Spark: Introduction and Use Cases for Data AnalysisApache Hadoop and Spark: Introduction and Use Cases for Data Analysis
Apache Hadoop and Spark: Introduction and Use Cases for Data Analysis
 
Introduction to Recommendation Systems (Vietnam Web Submit)
Introduction to Recommendation Systems (Vietnam Web Submit)Introduction to Recommendation Systems (Vietnam Web Submit)
Introduction to Recommendation Systems (Vietnam Web Submit)
 

Recently uploaded

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
amitlee9823
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 

Recently uploaded (20)

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 

Causal reasoning and Learning Systems

  • 1. Causal Reasoning and Learning Systems Presented by: Elon Portugaly Microsoft Research / AdCenter Léon Bottou Microsoft Research
  • 2. Joint work with Jonas Peters Joaquin Quiñonero Candela Denis Xavier Charles D. Max Chickering Elon Portugaly Dipankar Ray Patrice Simard Ed Snelson
  • 4. The pesky little ads Mainline Sidebar
  • 5. Multiple feedback loops User Advertiser Queries Ads & Bids Ads Prices Clicks (and consequences) Learning ADVERTISER FEEDBACK LOOP LEARNING FEEDBACK LOOP USER FEEDBACK LOOP Publisher
  • 6. Learning to run a marketplace • The learning machine is not a machine but is an organization with lots of people doing stuff and making decisions working in the dark How can we help? • Goal: improve marketplace machinery such that its long term revenue is maximal • Approximate goal by improving multiple performance measures related to all players
  • 7. Learning to run a marketplace • The learning machine is not a machine but is an organization with lots of people doing stuff and making decisions working in the dark How can we help? • Goal: improve marketplace machinery such that its long term revenue is maximal • Approximate goal by improving multiple performance measures related to all players • Provide data for decision making • Automatically optimize parts of the system
  • 8. Current methodologies • Auction theory – Handles advertiser loop, but not other loops • Machine learning – Handles learning loop, but not other loops • Historical data analysis – Cannot detect causal effects, therefore: – Frequently invalid due to Simpson’s paradox • Controlled experiments – Powerful, but slow – Very slow when trying to handle slow feedback loops
  • 9. Historical data analysis • Can detect correlations, but not causality – Does a highly relevant ad in the 1st position increase the click through rate of the ad in the 2nd position? Ad2 Ad1 All Ad2 CTR Low relevance 124/2000 (6.2%) High relevance 149/2000 (7.5%)
  • 10. Historical data analysis • Can detect correlations, but not causality – Does a highly relevant ad in the 1st position increase the click through rate of the ad in the 2nd position? Ad2 Ad1 All Low rel High rel Ad2 CTR Low relevance 124/2000 (6.2%) High relevance 149/2000 (7.5%) Ad2 Ad1 All Low rel High rel Ad2 CTR Low relevance 124/2000 (6.2%) 92/1823 (5.0%) 32/177 (18.1%) High relevance 149/2000 (7.5%) 71/1534 (4.6%) 78/466 (16.7%)
  • 11. Historical data analysis • Can detect correlations, but not causality – Does a highly relevant ad in the 1st position increase the click through rate of the ad in the 2nd position? • Problem – cannot detect causal effects: – Simpson’s paradox would lead to the wrong conclusions most of the time – Controlling for it requires knowledge of all confounding factors
  • 12. Controlled experiments Comparing ad placement strategies • Apply alternative treatments to random traffic buckets. – Randomly replace the 1st ad with ads of higher/lower relevance • Wait several days and compare performance metrics. Issues • Need full implementation and several days. • Controlling for slow feedback means experiments must run even longer • Need to know your questions in advance.
  • 13. Outline from here on II. Causal inference III. Counterfactual measurements IV. Algorithmic toolbox V. Equilibrium analysis Since we must deal with causation…
  • 15. What is causation? Manipulability theories • “The electric fan spins because the switch is up.” ≈ “If one moves the switch up, the fan will spin.” • We can carry out experiments and collect data. “No causation without manipulation” (Rubin, 1986) Counter-example? • “The apple falls because the earth attracts it.” • Describe the corresponding manipulation.
  • 16. Structural equation model (SEM) Direct causes / Known and unknown functions Noise variables / Exogenous variables
  • 17. Intervention Interventions as algebraic manipulation of the SEM. * NEW Q=𝒇 𝟒 ∗
  • 18. Isolation assumption What to do with unknown functions? • Replace knowledge by statistics. • Statistics need repeated isolated experiments. • Isolate experiments by assuming an unknown but invariant joint distribution for the exogenous variables. • What about the feedback loops? 𝑃(𝑢, 𝑣)
  • 19. Markov factorization A Bayes network is born (Pearl, 1988)
  • 20. Markov interventions Many interrelated Bayes networks are born (Pearl, 2000) – They are interrelated because they share some factors. – More complex algebraic interventions are of course possible. * * Distribution under intervention
  • 22. Counterfactuals Measuring something that did not happen “How would the system have performed if, when the data was collected, we had used 𝑃∗ 𝑞|𝑥, 𝑎 instead of 𝑃 𝑞|𝑥, 𝑎 ? ” Learning procedure • Collect data that describes the operation of the system during a past time period. • Find changes that would have increased the performance of the system if they had been applied during the data collection period. • Implement and verify…
  • 23. Replaying past data OCR Example • Collect labeled data in existing setup • Replay the past data to evaluate what the performance would have been if we had used OCR system θ. • Requires knowledge of all functions connecting the point of intervention to the point of measurement. Zip Code Scan Zip Code Label OCR Output Performance Score
  • 24. Replaying past data OCR Example • Collect labeled data in existing setup • Replay the past data to evaluate what the performance would have been if we had used OCR system θ. • Requires knowledge of all functions connecting the point of intervention to the point of measurement. Zip Code Scan Zip Code Label OCR Output Performance Score OCR Output
  • 25. Replaying past data OCR Example • Collect labeled data in existing setup • Replay the past data to evaluate what the performance would have been if we had used OCR system θ. • Requires knowledge of all functions connecting the point of intervention to the point of measurement. Zip Code Scan Zip Code Label OCR Output Performance Score OCR Output
  • 26. Replaying past data OCR Example • Collect labeled data in existing setup • Replay the past data to evaluate what the performance would have been if we had used OCR system θ. • Requires knowledge of all functions connecting the point of intervention to the point of measurement. Zip Code Scan Zip Code Label OCR Output Performance Score OCR Output
  • 27. Randomized experiments Randomly select who to treat with penicillin • Selection independent of all confounding factors • Therefore eliminates Simpson’s paradox and allows Counterfactual estimate • If we had given penicillin to 𝑥% of the patients, the success rate would have been 3680 4000 × 𝑥 + 840 6000 × 1 − 𝑥 (not real data) All Survived Died Survival Rate with drug 4,000 3,680 320 92% without drug 6,000 840 5,160 14% Total 10,000 4,520 5,480 45%
  • 28. Importance sampling * * Distribution under intervention • Can we estimate the results of the intervention counterfactually (without actually performing the intervention) – Yes if P and P* are non-deterministic (and close enough)
  • 29. Importance sampling Actual expectation 𝑌 = ℓ 𝜔 𝑃(𝜔) 𝜔 Counterfactual expectation 𝑌∗ = ℓ 𝜔 𝑃∗(𝜔) 𝜔 = ℓ 𝜔 𝑃∗(𝜔) 𝑃 𝜔 𝑃(𝜔) 𝜔 ≈ 1 𝑛 𝑃∗(𝜔𝑖) 𝑃 𝜔𝑖 𝑛 𝑖=1 ℓ 𝜔𝑖
  • 30. Importance sampling Principle • Reweight past examples to emulate the probability they would have had under the counterfactual distribution. 𝑤 𝜔𝑖 = 𝑃∗ (𝜔𝑖) 𝑃 𝜔𝑖 = 𝑃∗ (𝑞|𝑥, 𝑎) 𝑃(𝑞|𝑥, 𝑎) • Only requires the knowledge of the function under intervention (before and after) Factors in P* not in P Factors in P not in P*
  • 31. Exploration • Confidence intervals on the counterfactuals define the level of exploration. • Successful exploration = Ability to measure reliable counterfactuals 𝑃(𝜔) 𝑃∗(𝜔) Quality of the estimation • Good when distribs overlap • Bad otherwise • Note that if either are deterministic, they do not overlap
  • 32. Confidence intervals 𝑌∗ = ℓ 𝜔 𝑤 𝜔 𝑃(𝜔) 𝜔 ≈ 1 𝑛 ℓ 𝜔𝑖 𝑤 𝜔𝑖 𝑛 𝑖=1 Using the central limit theorem? • 𝑤 𝜔𝑖 very large when 𝑃(𝜔𝑖) small. • A few samples in poorly explored regions dominate the sum with their noisy contributions. • Solution: ignore them.
  • 33. Confidence intervals (ii) Zero-clipped weights 𝑤 𝜔 = 𝑤(𝜔) if less than 𝑅, 0 otherwise. Easier estimate 𝑌∗ = ℓ 𝜔 𝑤 𝜔 𝑃(𝜔) 𝜔 ≈ 1 𝑛 ℓ 𝜔𝑖 𝑤 𝜔𝑖 𝑛 𝑖=1
  • 34. Confidence intervals (iii) Bounding the bias • Observe 𝑤 𝜔 𝑃(𝜔)𝜔 = 𝑃∗ 𝜔 𝑃 𝜔 𝑃 𝜔 = 1𝜔 . • Assuming 0 ≤ ℓ 𝜔 ≤ 𝑀 we have 0 ≤ 𝑌∗ − 𝑌∗ = 𝑤 − 𝑤 ℓ(𝜔) 𝑃( 𝜔 𝜔) ≤ 𝑀 𝑤 − 𝑤 𝑃( 𝜔 𝜔) = 𝑀 1 − 𝑤(𝜔)𝑃( 𝜔 𝜔) ≈ 𝑀 1 − 1 𝑛 𝑤(𝜔𝑖) 𝑛 𝑖=1 • This is easy to estimate because 𝑤(𝜔) is bounded. • This represents what we miss because of insufficient exploration.
  • 35. Two-parts confidence interval Outer confidence interval • Bounds Y∗ − Y 𝑛 ∗ • When this is too large, we must sample more. Inner confidence interval • Bounds 𝑌∗ − 𝑌∗ • When this is too large, we must explore more.
  • 36. Playing with mainline reserves Mainline reserves (MLRs) • Thresholds that control whether ads are displayed in the mainline (north ads) Randomized bucket • Random log-normal multiplier applied to MLRs. • 22M auctions over five weeks Control buckets • Same setup with 18% lower mainline reserves • Same setup without randomization
  • 37. Playing with mainline reserves (ii) Inner interval Outer interval Control with no randomization Control with 18% lower MLR
  • 38. Playing with mainline reserves (iii) This is easy to estimate
  • 39. Playing with mainline reserves (iv) Revenue has always high variance
  • 40. More with the same data Examples • Estimates for different randomization variance  Good to determine how much to explore. • Query-dependent reserves  Just another counterfactual distribution! This is the big advantage • Collect data first, choose question later. • Randomizing more stuff increases opportunities.
  • 42. Algorithmic toolbox • Improving the confidence intervals: – Exploiting causal graph for much better behaved weights – Incorporating predictors invariant to the manipulation • Counterfactual derivatives and optimization – Counterfactual differences – Counterfactual derivatives – Policy gradients – Optimization (= learning)
  • 43. Derivatives and optimization Tuning squashing exponents and reserves • Ads ranked by decreasing 𝑏𝑖𝑑 × 𝑝𝐶𝑙𝑖𝑐𝑘 𝛼 • Lahaie & McAfee (2011) show that 𝛼 < 1 is good when click probability estimation gets less accurate. • Different 𝛼 𝑘 and reserves 𝜌 𝑘 for each query cluster 𝑘. • Squashing can increase prices. Optimize advertiser value instead of revenue. “Making the pie larger instead of our slice of the pie!”
  • 44. Derivatives and optimization Objective function • 𝑉 𝜶, 𝝆 : Lower bound for advertiser value. • 𝑁 𝜶, 𝝆 : Upper bound for number of mainline ads max 𝜶,𝝆 𝑉 𝜶, 𝝆 subject to 𝑁 𝜶, 𝝆 < 𝑁0 • We can estimate these functions and their derivatives. Therefore we can optimize easily. • The alternative is auction simulation. Users are then assumed to behave like 𝑝𝐶𝑙𝑖𝑐𝑘 says.
  • 45. Derivatives and optimization Variation of the average number of mainline ads. Estimated advertiser value (arbitrary units) Level curves for one particular query cluster
  • 46. Optimizing counterfactuals = Learning • Does it generalize? Yes, we can obtain uniform confidence intervals. • Sequential design? Thompson sampling comes naturally in this context. • Metering exploration wisely? Inner confidence interval tells how much exploration we need to answer a counterfactual question. But it does not tell which questions we should ask. This was not a problem in practice…
  • 48. Revisiting the feedback loops Queries Ads & Bids Ads Prices Clicks (and consequences) ADVERTISER FEEDBACK LOOP LEARNING FEEDBACK LOOP USER FEEDBACK LOOP
  • 49. Revisiting the feedback loops Different time scales • Auctions happen thousands of times per second. • Learning feedback: a couple hours. • User and advertiser feedback: several weeks. Tracking the equilibrium • Assume the user and advertiser feedbacks converge to equilibrium • “What will have been the total revenue had we changed MLR and waited until a new equilibrium was reached”
  • 50. User feedback We make the following causal assumptions • We can measure a quantity 𝑔 that quantifies the relevance of the displayed ads. – For instance using human labelers. • The user reaction can be expressed by the effect on the click yield of the average relevance 𝑔 experienced by each user in the recent past. – Realistically, we should also consider an effect on the number of queries issued by each user. We omit this for simplicity.
  • 51. User feedback Let 𝑌(𝜃, 𝑔) = ℓ(𝜔) 𝑃 𝜃 (𝜔|𝑔)𝜔 and 𝐺 𝜃, 𝑔 = 𝑔 𝑃 𝜃 𝜔 𝑔𝜔 . Equilibrium condition : 𝐺 = 𝑔.
  • 52. Total derivative Total derivatives d𝑌 = 𝜕𝑌 𝜕𝜃 d𝜃 + 𝜕𝑌 𝜕𝑔 d𝑔 d𝐺 = 𝜕𝐺 𝜕𝜃 d𝜃 + 𝜕𝐺 𝜕𝑔 d𝑔 = d𝑔 Solving gives d𝑔 = 𝜕𝐺 𝜕𝜃 d𝜃 And substituting gives the answer to our question d𝑌 = 𝜕𝑌 𝜕𝜃 + 𝜕𝑌 𝜕𝑔 𝜕𝐺 𝜕𝜃 d𝜃 Equilibrium condition Zero because of the graph structure
  • 53. Estimating derivatives • Estimate 𝜕𝑌 𝜕𝜃 and 𝜕𝐺 𝜕𝜃 like policy gradients. • Estimate 𝜕𝑌 𝜕𝑔 as follows: 1. Randomly submit users to different treatments. 2. At some point switch to same treatment and observe click yield 𝑌 as functions of 𝑔.
  • 54. Advertiser feedback Same thing with total derivatives. For instance, using economics to characterize the equilibrium, avoiding the extra experiment. 𝑦𝑎, 𝑧 𝑎 𝑦𝑎, 𝑧 𝑎
  • 55. Pricing theory Rational advertisers keep Va = 𝜕𝑧 𝑎 𝜕𝑦 𝑎 = 𝜕𝑧 𝑎 𝜕𝑏 𝑎 𝜕𝑦 𝑎 𝜕𝑏 𝑎 constant. Number of clicks 𝑦𝑎 Totalpaid𝑧𝑎 Pricing curve Adjusting the bid 𝑏 𝑎 moves 𝑦𝑎, 𝑧 𝑎 on this curve. Value curve Advertiser will not pay more than this. Maximum surplus Best deal for the advertisers. The slope of the pricing curve reveals their value.
  • 56. Estimating values • At equilibrium 𝑦𝑎 = 𝑌𝑎 and 𝑧 𝑎 = 𝑍 𝑎. • Therefore we can compute Va = 𝜕𝑧 𝑎 𝜕𝑏 𝑎 𝜕𝑦𝑎 𝜕𝑏 𝑎 = 𝜕𝑍 𝑎 𝜕𝑏 𝑎 𝜕𝑌𝑎 𝜕𝑏 𝑎 • So vector Φ = … 𝜕𝑍 𝑎 𝜕𝑏 𝑎 − 𝑉𝑎 𝜕𝑌𝑎 𝜕𝑏 𝑎 … = 0 • Then we can use the policy gradient equation.
  • 57. Advertiser feedback equilibrium Φ was zero before 𝑑𝜃, then it converges to zero when equilibrium is returned so: dΦ = 𝜕Φ 𝜕𝜃 d𝜃 + 𝜕Φ 𝜕𝑏 𝑎 𝑎 d𝑏 𝑎 = 0 • Solve the linear system for d𝑏 𝑎 d𝜃 • Then answer the counterfactual question d𝑌 = 𝜕𝑌 𝜕𝜃 + 𝜕𝑌 𝜕𝑏 𝑎 d𝑏 𝑎 d𝜃 𝑎 d𝜃 Equilibrium condition
  • 58. Multiple feedback loops Same procedure: 1. Write total derivatives. 2. Solve the linear system formed by all the equilibrium conditions. 3. Substitute into the total derivative of the counterfactual expectation of interest.
  • 60. Main messages • There are systems in the real world that are too complex to formally define – ML can assist humans in running these systems • Causal inference clarifies many problems – Ignoring causality => Simpson’s paradox – Randomness allows inferring causality • The counterfactual framework is modular – Randomize in advance, ask later – Compatible with other methodologies, e.g. optimization using gradients, equilibrium analysis
  • 63. Shifting the reweighting point • Users make click decisions on the basis of what they see. • They cannot see the scores, the reserves, the prices, etc.
  • 64. Shifting the reweighting point Standard weights 𝑤 𝜔𝑖 = 𝑃∗ (𝜔𝑖) 𝑃 𝜔𝑖 = 𝑃∗ (𝑞|𝑥, 𝑎) 𝑃(𝑞|𝑥, 𝑎) Shifted weights 𝑤 𝜔𝑖 = 𝑃∗ (𝑠|𝑥, 𝑎, 𝑏) 𝑃(𝑠|𝑥, 𝑎, 𝑏) with 𝑃⋄ 𝑠 𝑥, 𝑎, 𝑏 = 𝑃 𝑠 𝑎, 𝑞, 𝑏 𝑃⋄ (𝑞|𝑥, 𝑎)𝑞 .
  • 65. Shifting the reweighting point When can we do this? • 𝑃⋄ (𝜔) factorizes in the right way iff 1. Reweighting variables intercept every causal path connecting the point(s) of intervention to the point of measurement. 2. All functional dependencies between the point(s) of intervention and the reweighting variables are known.
  • 66. Shifting the reweighting point Experimental validation • Mainline reserves Score reweighting Slate reweighting
  • 67. Invariant predictors • Some variables 𝑣 not affected by the intervention have a strong impact on the outcome: time, location, … • Let 𝜁(𝑣) be an arbitrary predictor of ℓ(𝜔) 𝑌∗ = ℓ 𝜔 − 𝜁(𝜈) 𝑤 𝜔 𝑃 𝜔 𝜔 + 𝜁 𝜈 𝑃(𝜈) 𝜈 ≈ 1 𝑛 ℓ 𝜔𝑖 − 𝜁 𝜈𝑖 𝑤 𝜔𝑖 𝑛 𝑖=1 + 1 𝑛 𝜁 𝑣𝑖 𝑛 𝑖=1 No multiplier 𝑤(𝜔) for the variance captured by 𝜁 𝜈 . Reduced variance if the predictor 𝜁 𝜈 is any good.
  • 68. Counterfactual differences • Which scoring model works best? • Comparing expectations under counterfactual distributions 𝑃+(𝜔) and 𝑃∗(𝜔). 𝑌+ − 𝑌∗ = ℓ 𝜔 − 𝜁 𝜈 Δ𝑤 𝜔 𝜔 𝑃 𝜔 ≈ 1 𝑛 ℓ 𝜔𝑖 − 𝜁(𝜈𝑖) Δ𝑤 𝜔𝑖 𝑛 𝑖=1 with Δ𝑤 𝜔 = 𝑃+ 𝜔 𝑃 𝜔 − 𝑃∗ 𝜔 𝑃 𝜔 Variance captured by 𝜁 𝜈 is gone!
  • 69. Counterfactual derivatives • Counterfactual distribution 𝑃 𝜃 𝜔 𝜕𝑌 𝜃 𝜕𝜃 = ℓ 𝜔 − 𝜁 𝜈 𝑤 𝜃 ′ 𝜔 𝑃 𝜔 𝜔 ≈ 1 𝑛 ℓ 𝜔𝑖 − 𝜁 𝜈𝑖 𝑤 𝜃 ′ 𝜔𝑖 𝑛 𝑖=1 with 𝑤 𝜃 ′ 𝜔 = 𝜕𝑤 𝜃(𝜔) 𝜕𝜃 = 𝑤 𝜃 𝜔 𝜕 log 𝑃 𝜃 𝜔 𝜕𝜃 𝑤 𝜃 𝜔 can be large but there are ways…
  • 70. Policy gradient Infinitesimal interventions • Assuming 𝑃 𝜔 = 𝑃0 𝜔 and using the previous result: 𝜕𝑌 𝜃 𝜕𝜃 𝜃=0 ≈ 1 𝑛 ℓ 𝜔𝑖 − 𝜁 𝜈𝑖 𝑤0 ′ 𝜔𝑖 𝑛 𝑖=1 with 𝑤0 ′ 𝜔 = 𝜕𝑤 𝜃 𝜔 𝜕𝜃 𝜃=0 = 𝜕 log 𝑃 𝜃 𝜔 𝜕𝜃 𝜃=0 Potentially large 𝑤 𝜃 𝜔 is gone!