SlideShare a Scribd company logo
1 of 30
Developing

Reading Machines
Sebastian Riedel

UCL Machine Reading 

Bloomsbury AI
Machine Reading
@riedelcastro
@uclmr
UCL Machine Reading
Research Team at UCL Computer Science
Teaching Machines how to read!
2
Guillaume
[Citations]
3
Overview
What is Machine Reading?
How is it done?
In the past: knowledge and rules
Today: a lot of data and generic Deep Learning
But data is scarce
How do we get knowledge back in?
4
5
“Which proteins interact with wild-type TES2?”
Machine Reading for Science …
6
“Which proteins interact with wild-type TES2?”
NF-kB is a protein
complex that controls
the transcription of
DNA. ... It plays a key
role in regulating the
immune response to
infection. ... Incorrect
regulation of NF-kB
has been linked to
many diseases and
disorders.
read
Machine Reading for Science …
7
“Which proteins interact with wild-type TES2?”
Reading Machine
“TRADD, …”
NF-kB is a protein
complex that controls
the transcription of
DNA. ... It plays a key
role in regulating the
immune response to
infection. ... Incorrect
regulation of NF-kB
has been linked to
many diseases and
disorders.
read
For Journalists …
8
“Who is in the upper echelon of the Iranian
Military?”
Reading Machine
“Ataollah Salehi, … “
TEHRAN — Iran’s
supreme leader, in an
unexpected move,
replaced the general
in charge of the
Iranian armed forces
on Tuesday with the
general’s deputy, a
member of the Islamic
Revolutionary Guards
Corps…
read
For Everyone
9
“What pension scheme should I use?”
Reading Machine
“A Self-Invested Personal Pension (SIPP)”
It makes sense to put
some money away for
when you’re older and
that’s what pension
schemes help you do.
You save a little of
your income regularly
during your working
life so you can have
an income in later life,
when you want to
work less or retire.
read
10
Developing Reading Machines

from the 1960s to now
The Reading Machine User
11
NF-kB is a protein
complex that controls
the transcription of
DNA. ... It plays a key
role in regulating the
immune response to
infection. ... Incorrect
regulation of NF-kB
has been linked to
many diseases and
disorders.
Reading Machine
“Which proteins interact with wild-type TES2?”
“TRADD, …”
And the Developer
12
“Which proteins interact with wild-type TES2?”
“TRADD, …”
Reading Machine
“I want to build
the best
reading machine!”
And the Developer
13
Reading Machine
“I want to build
the best
reading machine!”
And the Developer
14
Reading Machine
1960
1990
“I have a lot of knowledge about language!”
If Protein A is the syntactic
subject of an “activates” verb
and Protein B is the object,
and …
then
A interacts with B
…
“I want to build
the best
reading machine!”
And the Developer
Reading Machine
1990
“I have a lot of knowledge about language!”
2014
15
“I want to build the best reading machine!”
“I have some knowledge about language”
“I also have some data though!”
“I want to build
the best
reading machine!”
distance < 10
“activates” in between
And the Developer
Reading Machine
2014
16
“I have some knowledge about language”
“I also have some data though!”
“I have no knowledge about language*”
“But I have a high capacity learner…”
*“rather: I don’t want to bother”
“I want to build
the best
reading machine!”
Encoder
Decoder
“and lots of data!”
Generic Recurrent Neural Network
17
But usually we don’t have lots of data
Reading Machine
2014
18
“I have some knowledge about language”
“I also have some data though!”
“I have no knowledge about language*”
“But I have a high capacity learner…”
“I want to build
the best
reading machine!”
Encoder
Decoder
“and lots of data!”
2015
Reading Machine
19
“I have no knowledge about language*”
“But I have a high capacity learner…”
Encoder
Decoder
“and lots of data!”
2015
“I want to build
the best
reading machine!”
"Cody ran to Melanie ' s farm. The
distance is 18 yards from Cody ' s farm to
Melanie ' s farm. It took Cody 2 hours to
get there. How fast did Cody go?” 18/2
~ 200 Training instances
Math Word Problems
Math Word Problems
20
“I have very little data!”
A has X items, (gives away | gets) Y
items. How many does (he|she) have
now? X (+|-) Y
“But I know something!”
"Cody ran to Melanie ' s farm. The
distance is 18 yards from Cody ' s farm to
Melanie ' s farm. It took Cody 2 hours to
get there. How fast did Cody go?” 18/2
Reading Machine
Encoder
Decoder
Learning to Generate Training Data
21
Reading
Machine
generates data for
to do well on
A has X items, (gives away | gets) Y
items. How many does (he|she) have
now? X (+|-) Y
Data Generation Model
“But I know something!”
"Cody ran to Melanie ' s farm. The
distance is 18 yards from Cody ' s farm to
Melanie ' s farm. It took Cody 2 hours to
get there. How fast did Cody go?” 18/2
“I have very little data!”
defines
Bouchard & Stenetorp, EMNLP 2016
How to Generate What?
22
A has X items, (gives away | gets) Y
items. How many does (he|she) have
now? X (+|-) Y
67
69.75
72.5
75.25
78
Prev. Work Data Knowledge K+D
“I have very little data!”
"Cody ran to Melanie ' s farm. The
distance is 18 yards from Cody ' s farm to
Melanie ' s farm. It took Cody 2 hours to
get there. How fast did Cody go?” 18/2
“But I know something!”
Information Extraction
23
Sebastian is a Reader at UCL
Sebastian lives in London
Who does Sebastian work for?
UCL
“Readers are University employees”
“But I know something!”
“I have very little data!”
Reading Machine
Regularisation
24
“Readers are University employees”
“But I know something!”
Knowledge Loss
optimise on
optimise on
convert
Sebastian is a Reader at UCL
Sebastian lives in London
Who does Sebastian work for?
UCL
“I have very little data!”
Reading Machine
Rocktäschel, Demeester, Singh, NAACL 2015, EMNLP 2016
Regularisation
25
“Readers are University employees”
“But I know something!”
0
15
30
45
60
Data Knowledge K+D Ours
Sebastian is a Reader at UCL
Sebastian lives in London
Who does Sebastian work for?
UCL
“I have very little data!”
Learning to Program
26
Sort animal, pug, dog, mammal
animal, mammal, dog, pub
The program is recursive, and uses
an (unknown) comparison function
“But I know something!”
“I have very little data!”
Machine
Learning to Program
27
Sort animal, pug, dog, mammal
animal, mammal, dog, pub
def sort(input):
???
compare(???, ???)
???
sort(???(input))
“But I know something!”
“I have very little data!”
Machine
Knowledge Compilation
28
Sort animal, pug, dog, mammal
animal, mammal, dog, pub
def sort(input):
???
compare(???, ???)
???
sort(???(input))
“But I know something!”
Machine
compile to
optimise on
“I have very little data!”
Bosnjak, Rocktäschel, 2016, arxiv
Knowledge Compilation
29
Sort animal, pug, dog, mammal
animal, mammal, dog, pub
“And I have very little data!”
Machine
optimise on
A Neural Program Trace:
Summary
Machine Reading today:
high capacity deep learner & a lot of data
But Machine Reading reality: Not much data
How to inject knowledge into deep learning?
By (generating) data
By regularising model
By compiling knowledge into model structure
30

More Related Content

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Developing Reading Machines

  • 1. Developing
 Reading Machines Sebastian Riedel
 UCL Machine Reading 
 Bloomsbury AI Machine Reading @riedelcastro @uclmr
  • 2. UCL Machine Reading Research Team at UCL Computer Science Teaching Machines how to read! 2 Guillaume
  • 4. Overview What is Machine Reading? How is it done? In the past: knowledge and rules Today: a lot of data and generic Deep Learning But data is scarce How do we get knowledge back in? 4
  • 5. 5 “Which proteins interact with wild-type TES2?”
  • 6. Machine Reading for Science … 6 “Which proteins interact with wild-type TES2?” NF-kB is a protein complex that controls the transcription of DNA. ... It plays a key role in regulating the immune response to infection. ... Incorrect regulation of NF-kB has been linked to many diseases and disorders. read
  • 7. Machine Reading for Science … 7 “Which proteins interact with wild-type TES2?” Reading Machine “TRADD, …” NF-kB is a protein complex that controls the transcription of DNA. ... It plays a key role in regulating the immune response to infection. ... Incorrect regulation of NF-kB has been linked to many diseases and disorders. read
  • 8. For Journalists … 8 “Who is in the upper echelon of the Iranian Military?” Reading Machine “Ataollah Salehi, … “ TEHRAN — Iran’s supreme leader, in an unexpected move, replaced the general in charge of the Iranian armed forces on Tuesday with the general’s deputy, a member of the Islamic Revolutionary Guards Corps… read
  • 9. For Everyone 9 “What pension scheme should I use?” Reading Machine “A Self-Invested Personal Pension (SIPP)” It makes sense to put some money away for when you’re older and that’s what pension schemes help you do. You save a little of your income regularly during your working life so you can have an income in later life, when you want to work less or retire. read
  • 11. The Reading Machine User 11 NF-kB is a protein complex that controls the transcription of DNA. ... It plays a key role in regulating the immune response to infection. ... Incorrect regulation of NF-kB has been linked to many diseases and disorders. Reading Machine “Which proteins interact with wild-type TES2?” “TRADD, …”
  • 12. And the Developer 12 “Which proteins interact with wild-type TES2?” “TRADD, …” Reading Machine “I want to build the best reading machine!”
  • 13. And the Developer 13 Reading Machine “I want to build the best reading machine!”
  • 14. And the Developer 14 Reading Machine 1960 1990 “I have a lot of knowledge about language!” If Protein A is the syntactic subject of an “activates” verb and Protein B is the object, and … then A interacts with B … “I want to build the best reading machine!”
  • 15. And the Developer Reading Machine 1990 “I have a lot of knowledge about language!” 2014 15 “I want to build the best reading machine!” “I have some knowledge about language” “I also have some data though!” “I want to build the best reading machine!” distance < 10 “activates” in between
  • 16. And the Developer Reading Machine 2014 16 “I have some knowledge about language” “I also have some data though!” “I have no knowledge about language*” “But I have a high capacity learner…” *“rather: I don’t want to bother” “I want to build the best reading machine!” Encoder Decoder “and lots of data!” Generic Recurrent Neural Network
  • 17. 17 But usually we don’t have lots of data
  • 18. Reading Machine 2014 18 “I have some knowledge about language” “I also have some data though!” “I have no knowledge about language*” “But I have a high capacity learner…” “I want to build the best reading machine!” Encoder Decoder “and lots of data!” 2015
  • 19. Reading Machine 19 “I have no knowledge about language*” “But I have a high capacity learner…” Encoder Decoder “and lots of data!” 2015 “I want to build the best reading machine!” "Cody ran to Melanie ' s farm. The distance is 18 yards from Cody ' s farm to Melanie ' s farm. It took Cody 2 hours to get there. How fast did Cody go?” 18/2 ~ 200 Training instances Math Word Problems
  • 20. Math Word Problems 20 “I have very little data!” A has X items, (gives away | gets) Y items. How many does (he|she) have now? X (+|-) Y “But I know something!” "Cody ran to Melanie ' s farm. The distance is 18 yards from Cody ' s farm to Melanie ' s farm. It took Cody 2 hours to get there. How fast did Cody go?” 18/2 Reading Machine Encoder Decoder
  • 21. Learning to Generate Training Data 21 Reading Machine generates data for to do well on A has X items, (gives away | gets) Y items. How many does (he|she) have now? X (+|-) Y Data Generation Model “But I know something!” "Cody ran to Melanie ' s farm. The distance is 18 yards from Cody ' s farm to Melanie ' s farm. It took Cody 2 hours to get there. How fast did Cody go?” 18/2 “I have very little data!” defines Bouchard & Stenetorp, EMNLP 2016
  • 22. How to Generate What? 22 A has X items, (gives away | gets) Y items. How many does (he|she) have now? X (+|-) Y 67 69.75 72.5 75.25 78 Prev. Work Data Knowledge K+D “I have very little data!” "Cody ran to Melanie ' s farm. The distance is 18 yards from Cody ' s farm to Melanie ' s farm. It took Cody 2 hours to get there. How fast did Cody go?” 18/2 “But I know something!”
  • 23. Information Extraction 23 Sebastian is a Reader at UCL Sebastian lives in London Who does Sebastian work for? UCL “Readers are University employees” “But I know something!” “I have very little data!” Reading Machine
  • 24. Regularisation 24 “Readers are University employees” “But I know something!” Knowledge Loss optimise on optimise on convert Sebastian is a Reader at UCL Sebastian lives in London Who does Sebastian work for? UCL “I have very little data!” Reading Machine Rocktäschel, Demeester, Singh, NAACL 2015, EMNLP 2016
  • 25. Regularisation 25 “Readers are University employees” “But I know something!” 0 15 30 45 60 Data Knowledge K+D Ours Sebastian is a Reader at UCL Sebastian lives in London Who does Sebastian work for? UCL “I have very little data!”
  • 26. Learning to Program 26 Sort animal, pug, dog, mammal animal, mammal, dog, pub The program is recursive, and uses an (unknown) comparison function “But I know something!” “I have very little data!” Machine
  • 27. Learning to Program 27 Sort animal, pug, dog, mammal animal, mammal, dog, pub def sort(input): ??? compare(???, ???) ??? sort(???(input)) “But I know something!” “I have very little data!” Machine
  • 28. Knowledge Compilation 28 Sort animal, pug, dog, mammal animal, mammal, dog, pub def sort(input): ??? compare(???, ???) ??? sort(???(input)) “But I know something!” Machine compile to optimise on “I have very little data!” Bosnjak, Rocktäschel, 2016, arxiv
  • 29. Knowledge Compilation 29 Sort animal, pug, dog, mammal animal, mammal, dog, pub “And I have very little data!” Machine optimise on A Neural Program Trace:
  • 30. Summary Machine Reading today: high capacity deep learner & a lot of data But Machine Reading reality: Not much data How to inject knowledge into deep learning? By (generating) data By regularising model By compiling knowledge into model structure 30