SlideShare a Scribd company logo
1 of 33
A Hybrid Diagnosis Approach Combining 
Black-Box and White-Box Reasoning 
Mingmin Chen1 Shizhuo Yu1 
Nico Franz2 Shawn Bowers3 Bertram Ludäscher1 4 
1 Department of Computer Science , University of California, Davis 
2 School of Life Sciences, Arizona State University 
3 Department of Computer Science, Gonzaga University 
4 GSLIS & NCSA, University of Illinois at Urbana-Champaign
Outline 
• The Taxonomy Alignment Problem 
T1 + T2 + A => T3 (ambiguous .. unique .. inconsistent) 
• Model-based Diagnosis [Reiter’87] 
– Black-box 
• Hybrid Approach 
– Black-box & White-box combined 
• Benchmark Results 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 2
Meet Prof. Nico Franz: 
Curator of Insects @ ASU 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 3
What Nico et al. do for a living … 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 4
What Nico does for a living (cont’d): 
The Indoors Part 
• First: go fun places, find new bugs, study them … 
– “Bugs-R-Us” (see taxonbytes.org) 
• Now: Compare, align and revise taxonomies, based on 
careful observation, “character” data, expertise … 
• Formally: 
– Input: T1 + T2 (taxonomies) + A (expert articulations) 
– Output: revised, “merged” taxonomy (-ies) T3 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 5
Taxonomy Alignment Problem (TAP) 
• Given: 
– Taxonomies T1 , T2 
• incl. constraints (coverage, disjointness) 
– Set of articulations (alignment) A 
• Find: 
– Combined (“merged”) taxonomy T3 (= T1 + T2 + A) 
• Is it a taxonomy? Or a DAG? 
– Optional: 
• Final alignment (should be minimal) 
6 
T1 
T2 
T3 
A
TAP: Possible Outcomes 
Input Alignment 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 7
TAP: Possible Outcomes 
Input Alignment 
Inconsistent! 
 Diagnosis (Reiter) 
= Black-Box 
Provenance 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 8
TAP: Possible Outcomes 
Input Alignment 
Inconsistent! 
 Diagnosis (Reiter) 
= Black-Box 
Provenance 
Ambiguous! 
 Multiple 
Possible 
Worlds 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 9
TAP: Possible Outcomes 
Input Alignment 
Inconsistent! 
 Diagnosis (Reiter) 
= Black-Box 
Provenance 
Ambiguous! 
 Multiple 
Possible 
Worlds 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 10
TAP: Possible Outcomes 
Input Alignment 
Inconsistent! 
 Diagnosis (Reiter) 
= Black-Box 
Provenance 
Ambiguous! 
 Multiple 
Possible 
Worlds 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 11
Euler/X Toolkit and Workflow 
• FO reasoning about 
taxonomies (MFOL) 
• Earlier: CleanTax 
– Prover9/Mace4 
• Now: Euler 
– ASP Reasoners (DLV, 
Clingo) 
– Specialized reasoners 
(PyRCC) 
– … 
– X = ASP, RCC, … 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 12
Real-world examples: Turn this … 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 13
… into this! 
(Perellescus Alignment Result) 
• T3 := T1 and T2 are “merged” 
– Blue dashed: overlaps  resolve via “zoom-in view” 
14
Common Outcome: Inconsistency! 
Input Alignment 
Inconsistent! 
 Diagnosis (Reiter) 
= Black-Box 
Provenance 
• Need to debug the 
input articulations  
(black-box) diagnosis! 
• Focus: 
– How do we efficiently 
compute the diagnostic 
lattice? 
• Also: 
– How to visualize.. 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 15
Example Instance 
(from synthetic benchmark suite) 
• Here: N = 10 taxa in T1, T2 
• Euler/X finds: 
inconsistent! 
•  diagnostic lattice of 210 
= 1024 nodes 
 Find minimal inconsistent 
subset (MIS) 
 maximal consistent subset 
(MCS) .. 
 show to user! 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 16
Visualizing Diagnoses 
N = 10 articulations  210 = 1024 node diagnostic lattice 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 17
Better Idea: Just show MIS, MCS 
N = 4 articulations  24 = 16 node diagnostic lattice, 
but 3 MCS and 2 MIS are enough! 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 18
Visualizing Diagnoses 
.. but 4 MCS and 
1 MIC tell it all! 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 19 
1024 node lattice
Visualizing Diagnoses 
Example from 
paper: N=12  
4096 nodes 
.. but 7 MCS and 
5 MIC tell it all! 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 20
Black-Box Inconsistency Analysis 
(Diagnostic Lattice) 
• Then: 
What happens if you 
can’t have all (here: 4) 
articulations together? 
– Repair: find & revise minimal inconsistent subsets (Min-Incons) 
– Expand: find maximal consistent subsets (Max-Cons) & revise outs 
21
Inconsistency Analysis (Diagnostic Lattice) 
• The Min-Incons (MIS) and 
Max-Cons (MCS) sets 
determine all others 
 Repair MIS and/or Expand 
MCS 
• Black-box Analysis (Hitting Set algo.) yields a Diagnosis (lattice) 
– for n=4 articulations, there are 168 possible diagnoses 
– depending on expected “red/green areas”  explore space differently 
• |articulations| = n  
|possible diagnoses| = |monotonic Boolean functions| 
= Dedekind Number (n): 2, 3, 6, 20, 168, 7581, 7828354, ... 
22
Improving Diagnosis 
• Reiter’s “black-box” (model-based) diagnosis 
helps debug the articulations 
• Limited scalability (inherent complexity) 
• But every bit helps: 
– Hitting Set Algorithm (“logarithmic extraction”) 
• Our idea: 
– Exploit “white-box” reasoning information 
 RULES to the rescue 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 23
Key Idea: exploit white-box info 
• We use Answer Set Programming (ASP) to solve 
Taxonomy Alignment Problem (TAP) 
• Inconsistency = “False” is derived in the head: 
False :- <denial of integrity constraint> 
• Apply provenance trick from databases  
– What articulations contribute to a derivation of 
“False” ? 
– Eliminate those that don’t! 
 an example of reusing inferences across separate 
black-box tests! 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 24
The Provenance “Trick” 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 25
Hybrid Provenance 
A3: c < f Black-box Provenance 
Input Alignment 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 26
Hybrid Provenance 
A3: c < f Black-box Provenance 
A1+A2 + … => f < c 
White-box 
Provenance 
Input Alignment 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 27
The Hybrid Approach 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 28
Hybrid Approach 
What articulations 
contribute to some 
inconsistency? 
Good old black-box 
(HST) 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 29
Benchmark 
Results 
• White-box < Hybrid < Black-box (runtimes) 
• Note: white-box does not give you a diagnosis 
• Potassco < DLV 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 30
Benchmark 
DLV 
• White-box < Hybrid < Black-box (runtimes) 
• Potassco < DLV 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 31
Benchmark 
Clingo 
• White-box < Hybrid < Black-box (runtimes) 
• Potassco < DLV 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 32
Conclusions 
• ASP rules can be used to efficiently solve real-world 
taxonomy reasoning problems 
• Reiter’s diagnosis useful to debug inconsistent 
alignments 
• Adding a “white-box” provenance approach 
speeds up state-of-the-art HST algorithm by 
eliminating independent articulations 
• Future work: 
– Further improvements, including parallelism: 
• Trade-off with sharing inferences across parallel instances 
Hybrid Diagnosis in Euler @ RuleML2014, Prague 33

More Related Content

Similar to Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

optimizedBell.pptx
optimizedBell.pptxoptimizedBell.pptx
optimizedBell.pptxRichard Gill
 
Yet another statistical analysis of the data of the ‘loophole free’ experime...
Yet another statistical analysis of the data of the  ‘loophole free’ experime...Yet another statistical analysis of the data of the  ‘loophole free’ experime...
Yet another statistical analysis of the data of the ‘loophole free’ experime...Richard Gill
 
Causally regularized machine learning
Causally regularized machine learningCausally regularized machine learning
Causally regularized machine learningWanjin Yu
 
optimizedBell.pptx
optimizedBell.pptxoptimizedBell.pptx
optimizedBell.pptxRichard Gill
 
20130928 automated theorem_proving_harrison
20130928 automated theorem_proving_harrison20130928 automated theorem_proving_harrison
20130928 automated theorem_proving_harrisonComputer Science Club
 
Multiple estimators for Monte Carlo approximations
Multiple estimators for Monte Carlo approximationsMultiple estimators for Monte Carlo approximations
Multiple estimators for Monte Carlo approximationsChristian Robert
 
ensembles_emptytemplate_v2
ensembles_emptytemplate_v2ensembles_emptytemplate_v2
ensembles_emptytemplate_v2Shrayes Ramesh
 
Integrated modelling Cape Town
Integrated modelling Cape TownIntegrated modelling Cape Town
Integrated modelling Cape TownBob O'Hara
 
Chi square distribution and analysis of frequencies.pptx
Chi square distribution and analysis of frequencies.pptxChi square distribution and analysis of frequencies.pptx
Chi square distribution and analysis of frequencies.pptxZayYa9
 
Cluster Analysis
Cluster Analysis Cluster Analysis
Cluster Analysis Baivab Nag
 
Low-rank matrix approximations in Python by Christian Thurau PyData 2014
Low-rank matrix approximations in Python by Christian Thurau PyData 2014Low-rank matrix approximations in Python by Christian Thurau PyData 2014
Low-rank matrix approximations in Python by Christian Thurau PyData 2014PyData
 
ObjRecog2-17 (1).pptx
ObjRecog2-17 (1).pptxObjRecog2-17 (1).pptx
ObjRecog2-17 (1).pptxssuserc074dd
 

Similar to Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning (20)

optimizedBell.pptx
optimizedBell.pptxoptimizedBell.pptx
optimizedBell.pptx
 
Yet another statistical analysis of the data of the ‘loophole free’ experime...
Yet another statistical analysis of the data of the  ‘loophole free’ experime...Yet another statistical analysis of the data of the  ‘loophole free’ experime...
Yet another statistical analysis of the data of the ‘loophole free’ experime...
 
Causally regularized machine learning
Causally regularized machine learningCausally regularized machine learning
Causally regularized machine learning
 
optimizedBell.pptx
optimizedBell.pptxoptimizedBell.pptx
optimizedBell.pptx
 
20130928 automated theorem_proving_harrison
20130928 automated theorem_proving_harrison20130928 automated theorem_proving_harrison
20130928 automated theorem_proving_harrison
 
Multiple estimators for Monte Carlo approximations
Multiple estimators for Monte Carlo approximationsMultiple estimators for Monte Carlo approximations
Multiple estimators for Monte Carlo approximations
 
ensembles_emptytemplate_v2
ensembles_emptytemplate_v2ensembles_emptytemplate_v2
ensembles_emptytemplate_v2
 
Chi square test
Chi square test Chi square test
Chi square test
 
Integrated modelling Cape Town
Integrated modelling Cape TownIntegrated modelling Cape Town
Integrated modelling Cape Town
 
2주차
2주차2주차
2주차
 
Phylogenetics2
Phylogenetics2Phylogenetics2
Phylogenetics2
 
HPC_NIST_SHA3
HPC_NIST_SHA3HPC_NIST_SHA3
HPC_NIST_SHA3
 
Chi square distribution and analysis of frequencies.pptx
Chi square distribution and analysis of frequencies.pptxChi square distribution and analysis of frequencies.pptx
Chi square distribution and analysis of frequencies.pptx
 
Kboom phenoday-2016
Kboom phenoday-2016Kboom phenoday-2016
Kboom phenoday-2016
 
CSC446: Pattern Recognition (LN3)
CSC446: Pattern Recognition (LN3)CSC446: Pattern Recognition (LN3)
CSC446: Pattern Recognition (LN3)
 
4646150.ppt
4646150.ppt4646150.ppt
4646150.ppt
 
Cluster Analysis
Cluster Analysis Cluster Analysis
Cluster Analysis
 
Low-rank matrix approximations in Python by Christian Thurau PyData 2014
Low-rank matrix approximations in Python by Christian Thurau PyData 2014Low-rank matrix approximations in Python by Christian Thurau PyData 2014
Low-rank matrix approximations in Python by Christian Thurau PyData 2014
 
ObjRecog2-17 (1).pptx
ObjRecog2-17 (1).pptxObjRecog2-17 (1).pptx
ObjRecog2-17 (1).pptx
 
08 entropie
08 entropie08 entropie
08 entropie
 

More from taxonbytes

De-centralized but global: Redesigning biodiversity data aggregation for impr...
De-centralized but global: Redesigning biodiversity data aggregation for impr...De-centralized but global: Redesigning biodiversity data aggregation for impr...
De-centralized but global: Redesigning biodiversity data aggregation for impr...taxonbytes
 
Anzaldo franz 2017 ecn your daily weevil
Anzaldo franz 2017 ecn your daily weevilAnzaldo franz 2017 ecn your daily weevil
Anzaldo franz 2017 ecn your daily weeviltaxonbytes
 
Franz et al 2017 ecn creating and publishing a symbiota based checklist version
Franz et al 2017 ecn creating and publishing a symbiota based checklist versionFranz et al 2017 ecn creating and publishing a symbiota based checklist version
Franz et al 2017 ecn creating and publishing a symbiota based checklist versiontaxonbytes
 
Franz 2017 sols cbs seminar the limits of synthesis for integrative biology
Franz 2017 sols cbs seminar the limits of synthesis for integrative biologyFranz 2017 sols cbs seminar the limits of synthesis for integrative biology
Franz 2017 sols cbs seminar the limits of synthesis for integrative biologytaxonbytes
 
Franz 2017 uiuc cirss non unitary syntheses of systematic knowledge
Franz 2017 uiuc cirss non unitary syntheses of systematic knowledgeFranz 2017 uiuc cirss non unitary syntheses of systematic knowledge
Franz 2017 uiuc cirss non unitary syntheses of systematic knowledgetaxonbytes
 
Franz et al tdwg 2016 new developments for libraries of life
Franz et al tdwg 2016 new developments for libraries of lifeFranz et al tdwg 2016 new developments for libraries of life
Franz et al tdwg 2016 new developments for libraries of lifetaxonbytes
 
Franz et al tdwg 2016 introducing lep net
Franz et al tdwg 2016 introducing lep netFranz et al tdwg 2016 introducing lep net
Franz et al tdwg 2016 introducing lep nettaxonbytes
 
Franz et al TDWG 2016 Updates on multiple neotropical symbiota portals
Franz et al TDWG 2016 Updates on multiple neotropical symbiota portalsFranz et al TDWG 2016 Updates on multiple neotropical symbiota portals
Franz et al TDWG 2016 Updates on multiple neotropical symbiota portalstaxonbytes
 
Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...
Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...
Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...taxonbytes
 
Franz ludaescher tdwg 2016 an update on taxonomic concept reasoning
Franz ludaescher tdwg 2016 an update on taxonomic concept reasoningFranz ludaescher tdwg 2016 an update on taxonomic concept reasoning
Franz ludaescher tdwg 2016 an update on taxonomic concept reasoningtaxonbytes
 
Franz Zhang et al Weevil Workshop 2016 Neotropical Entiminae Systematics evol...
Franz Zhang et al Weevil Workshop 2016 Neotropical Entiminae Systematics evol...Franz Zhang et al Weevil Workshop 2016 Neotropical Entiminae Systematics evol...
Franz Zhang et al Weevil Workshop 2016 Neotropical Entiminae Systematics evol...taxonbytes
 
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...Franz et al ice 2016 addressing the name meaning drift challenge in open ende...
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...taxonbytes
 
Zhang et al ecn 2016 building an accessible weevil tissue collection for geno...
Zhang et al ecn 2016 building an accessible weevil tissue collection for geno...Zhang et al ecn 2016 building an accessible weevil tissue collection for geno...
Zhang et al ecn 2016 building an accessible weevil tissue collection for geno...taxonbytes
 
Franz et al evol 2016 aligning multipe incongruent phylogenies with the euler...
Franz et al evol 2016 aligning multipe incongruent phylogenies with the euler...Franz et al evol 2016 aligning multipe incongruent phylogenies with the euler...
Franz et al evol 2016 aligning multipe incongruent phylogenies with the euler...taxonbytes
 
Zhang et al evol 2016 beyond otus phylogenetic identification of bacterial sy...
Zhang et al evol 2016 beyond otus phylogenetic identification of bacterial sy...Zhang et al evol 2016 beyond otus phylogenetic identification of bacterial sy...
Zhang et al evol 2016 beyond otus phylogenetic identification of bacterial sy...taxonbytes
 
Franz et al evol 2016 representing phylogeny as a logically tractable variable
Franz et al evol 2016 representing phylogeny as a logically tractable variable  Franz et al evol 2016 representing phylogeny as a logically tractable variable
Franz et al evol 2016 representing phylogeny as a logically tractable variable taxonbytes
 
Franz 2016 Phenotype RCN Representing Taxonomy and Phylogeny as Logically Tra...
Franz 2016 Phenotype RCN Representing Taxonomy and Phylogeny as Logically Tra...Franz 2016 Phenotype RCN Representing Taxonomy and Phylogeny as Logically Tra...
Franz 2016 Phenotype RCN Representing Taxonomy and Phylogeny as Logically Tra...taxonbytes
 
Franz et al 2015 escjam 2015 logic resolution taxonomic variable
Franz et al 2015 escjam 2015 logic resolution taxonomic variableFranz et al 2015 escjam 2015 logic resolution taxonomic variable
Franz et al 2015 escjam 2015 logic resolution taxonomic variabletaxonbytes
 
Zhang Franz ESCJAM 2015 Exophthalmus Reclassification
Zhang Franz ESCJAM 2015 Exophthalmus ReclassificationZhang Franz ESCJAM 2015 Exophthalmus Reclassification
Zhang Franz ESCJAM 2015 Exophthalmus Reclassificationtaxonbytes
 
Franz cobb seltmann 2015 spnhc current state of arthropod biodiversity data
Franz cobb seltmann 2015 spnhc current state of arthropod biodiversity dataFranz cobb seltmann 2015 spnhc current state of arthropod biodiversity data
Franz cobb seltmann 2015 spnhc current state of arthropod biodiversity datataxonbytes
 

More from taxonbytes (20)

De-centralized but global: Redesigning biodiversity data aggregation for impr...
De-centralized but global: Redesigning biodiversity data aggregation for impr...De-centralized but global: Redesigning biodiversity data aggregation for impr...
De-centralized but global: Redesigning biodiversity data aggregation for impr...
 
Anzaldo franz 2017 ecn your daily weevil
Anzaldo franz 2017 ecn your daily weevilAnzaldo franz 2017 ecn your daily weevil
Anzaldo franz 2017 ecn your daily weevil
 
Franz et al 2017 ecn creating and publishing a symbiota based checklist version
Franz et al 2017 ecn creating and publishing a symbiota based checklist versionFranz et al 2017 ecn creating and publishing a symbiota based checklist version
Franz et al 2017 ecn creating and publishing a symbiota based checklist version
 
Franz 2017 sols cbs seminar the limits of synthesis for integrative biology
Franz 2017 sols cbs seminar the limits of synthesis for integrative biologyFranz 2017 sols cbs seminar the limits of synthesis for integrative biology
Franz 2017 sols cbs seminar the limits of synthesis for integrative biology
 
Franz 2017 uiuc cirss non unitary syntheses of systematic knowledge
Franz 2017 uiuc cirss non unitary syntheses of systematic knowledgeFranz 2017 uiuc cirss non unitary syntheses of systematic knowledge
Franz 2017 uiuc cirss non unitary syntheses of systematic knowledge
 
Franz et al tdwg 2016 new developments for libraries of life
Franz et al tdwg 2016 new developments for libraries of lifeFranz et al tdwg 2016 new developments for libraries of life
Franz et al tdwg 2016 new developments for libraries of life
 
Franz et al tdwg 2016 introducing lep net
Franz et al tdwg 2016 introducing lep netFranz et al tdwg 2016 introducing lep net
Franz et al tdwg 2016 introducing lep net
 
Franz et al TDWG 2016 Updates on multiple neotropical symbiota portals
Franz et al TDWG 2016 Updates on multiple neotropical symbiota portalsFranz et al TDWG 2016 Updates on multiple neotropical symbiota portals
Franz et al TDWG 2016 Updates on multiple neotropical symbiota portals
 
Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...
Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...
Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...
 
Franz ludaescher tdwg 2016 an update on taxonomic concept reasoning
Franz ludaescher tdwg 2016 an update on taxonomic concept reasoningFranz ludaescher tdwg 2016 an update on taxonomic concept reasoning
Franz ludaescher tdwg 2016 an update on taxonomic concept reasoning
 
Franz Zhang et al Weevil Workshop 2016 Neotropical Entiminae Systematics evol...
Franz Zhang et al Weevil Workshop 2016 Neotropical Entiminae Systematics evol...Franz Zhang et al Weevil Workshop 2016 Neotropical Entiminae Systematics evol...
Franz Zhang et al Weevil Workshop 2016 Neotropical Entiminae Systematics evol...
 
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...Franz et al ice 2016 addressing the name meaning drift challenge in open ende...
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...
 
Zhang et al ecn 2016 building an accessible weevil tissue collection for geno...
Zhang et al ecn 2016 building an accessible weevil tissue collection for geno...Zhang et al ecn 2016 building an accessible weevil tissue collection for geno...
Zhang et al ecn 2016 building an accessible weevil tissue collection for geno...
 
Franz et al evol 2016 aligning multipe incongruent phylogenies with the euler...
Franz et al evol 2016 aligning multipe incongruent phylogenies with the euler...Franz et al evol 2016 aligning multipe incongruent phylogenies with the euler...
Franz et al evol 2016 aligning multipe incongruent phylogenies with the euler...
 
Zhang et al evol 2016 beyond otus phylogenetic identification of bacterial sy...
Zhang et al evol 2016 beyond otus phylogenetic identification of bacterial sy...Zhang et al evol 2016 beyond otus phylogenetic identification of bacterial sy...
Zhang et al evol 2016 beyond otus phylogenetic identification of bacterial sy...
 
Franz et al evol 2016 representing phylogeny as a logically tractable variable
Franz et al evol 2016 representing phylogeny as a logically tractable variable  Franz et al evol 2016 representing phylogeny as a logically tractable variable
Franz et al evol 2016 representing phylogeny as a logically tractable variable
 
Franz 2016 Phenotype RCN Representing Taxonomy and Phylogeny as Logically Tra...
Franz 2016 Phenotype RCN Representing Taxonomy and Phylogeny as Logically Tra...Franz 2016 Phenotype RCN Representing Taxonomy and Phylogeny as Logically Tra...
Franz 2016 Phenotype RCN Representing Taxonomy and Phylogeny as Logically Tra...
 
Franz et al 2015 escjam 2015 logic resolution taxonomic variable
Franz et al 2015 escjam 2015 logic resolution taxonomic variableFranz et al 2015 escjam 2015 logic resolution taxonomic variable
Franz et al 2015 escjam 2015 logic resolution taxonomic variable
 
Zhang Franz ESCJAM 2015 Exophthalmus Reclassification
Zhang Franz ESCJAM 2015 Exophthalmus ReclassificationZhang Franz ESCJAM 2015 Exophthalmus Reclassification
Zhang Franz ESCJAM 2015 Exophthalmus Reclassification
 
Franz cobb seltmann 2015 spnhc current state of arthropod biodiversity data
Franz cobb seltmann 2015 spnhc current state of arthropod biodiversity dataFranz cobb seltmann 2015 spnhc current state of arthropod biodiversity data
Franz cobb seltmann 2015 spnhc current state of arthropod biodiversity data
 

Recently uploaded

Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsNurulAfiqah307317
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...Lokesh Kothari
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 

Recently uploaded (20)

Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening Designs
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 

Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning

  • 1. A Hybrid Diagnosis Approach Combining Black-Box and White-Box Reasoning Mingmin Chen1 Shizhuo Yu1 Nico Franz2 Shawn Bowers3 Bertram Ludäscher1 4 1 Department of Computer Science , University of California, Davis 2 School of Life Sciences, Arizona State University 3 Department of Computer Science, Gonzaga University 4 GSLIS & NCSA, University of Illinois at Urbana-Champaign
  • 2. Outline • The Taxonomy Alignment Problem T1 + T2 + A => T3 (ambiguous .. unique .. inconsistent) • Model-based Diagnosis [Reiter’87] – Black-box • Hybrid Approach – Black-box & White-box combined • Benchmark Results Hybrid Diagnosis in Euler @ RuleML2014, Prague 2
  • 3. Meet Prof. Nico Franz: Curator of Insects @ ASU Hybrid Diagnosis in Euler @ RuleML2014, Prague 3
  • 4. What Nico et al. do for a living … Hybrid Diagnosis in Euler @ RuleML2014, Prague 4
  • 5. What Nico does for a living (cont’d): The Indoors Part • First: go fun places, find new bugs, study them … – “Bugs-R-Us” (see taxonbytes.org) • Now: Compare, align and revise taxonomies, based on careful observation, “character” data, expertise … • Formally: – Input: T1 + T2 (taxonomies) + A (expert articulations) – Output: revised, “merged” taxonomy (-ies) T3 Hybrid Diagnosis in Euler @ RuleML2014, Prague 5
  • 6. Taxonomy Alignment Problem (TAP) • Given: – Taxonomies T1 , T2 • incl. constraints (coverage, disjointness) – Set of articulations (alignment) A • Find: – Combined (“merged”) taxonomy T3 (= T1 + T2 + A) • Is it a taxonomy? Or a DAG? – Optional: • Final alignment (should be minimal) 6 T1 T2 T3 A
  • 7. TAP: Possible Outcomes Input Alignment Hybrid Diagnosis in Euler @ RuleML2014, Prague 7
  • 8. TAP: Possible Outcomes Input Alignment Inconsistent!  Diagnosis (Reiter) = Black-Box Provenance Hybrid Diagnosis in Euler @ RuleML2014, Prague 8
  • 9. TAP: Possible Outcomes Input Alignment Inconsistent!  Diagnosis (Reiter) = Black-Box Provenance Ambiguous!  Multiple Possible Worlds Hybrid Diagnosis in Euler @ RuleML2014, Prague 9
  • 10. TAP: Possible Outcomes Input Alignment Inconsistent!  Diagnosis (Reiter) = Black-Box Provenance Ambiguous!  Multiple Possible Worlds Hybrid Diagnosis in Euler @ RuleML2014, Prague 10
  • 11. TAP: Possible Outcomes Input Alignment Inconsistent!  Diagnosis (Reiter) = Black-Box Provenance Ambiguous!  Multiple Possible Worlds Hybrid Diagnosis in Euler @ RuleML2014, Prague 11
  • 12. Euler/X Toolkit and Workflow • FO reasoning about taxonomies (MFOL) • Earlier: CleanTax – Prover9/Mace4 • Now: Euler – ASP Reasoners (DLV, Clingo) – Specialized reasoners (PyRCC) – … – X = ASP, RCC, … Hybrid Diagnosis in Euler @ RuleML2014, Prague 12
  • 13. Real-world examples: Turn this … Hybrid Diagnosis in Euler @ RuleML2014, Prague 13
  • 14. … into this! (Perellescus Alignment Result) • T3 := T1 and T2 are “merged” – Blue dashed: overlaps  resolve via “zoom-in view” 14
  • 15. Common Outcome: Inconsistency! Input Alignment Inconsistent!  Diagnosis (Reiter) = Black-Box Provenance • Need to debug the input articulations  (black-box) diagnosis! • Focus: – How do we efficiently compute the diagnostic lattice? • Also: – How to visualize.. Hybrid Diagnosis in Euler @ RuleML2014, Prague 15
  • 16. Example Instance (from synthetic benchmark suite) • Here: N = 10 taxa in T1, T2 • Euler/X finds: inconsistent! •  diagnostic lattice of 210 = 1024 nodes  Find minimal inconsistent subset (MIS)  maximal consistent subset (MCS) ..  show to user! Hybrid Diagnosis in Euler @ RuleML2014, Prague 16
  • 17. Visualizing Diagnoses N = 10 articulations  210 = 1024 node diagnostic lattice Hybrid Diagnosis in Euler @ RuleML2014, Prague 17
  • 18. Better Idea: Just show MIS, MCS N = 4 articulations  24 = 16 node diagnostic lattice, but 3 MCS and 2 MIS are enough! Hybrid Diagnosis in Euler @ RuleML2014, Prague 18
  • 19. Visualizing Diagnoses .. but 4 MCS and 1 MIC tell it all! Hybrid Diagnosis in Euler @ RuleML2014, Prague 19 1024 node lattice
  • 20. Visualizing Diagnoses Example from paper: N=12  4096 nodes .. but 7 MCS and 5 MIC tell it all! Hybrid Diagnosis in Euler @ RuleML2014, Prague 20
  • 21. Black-Box Inconsistency Analysis (Diagnostic Lattice) • Then: What happens if you can’t have all (here: 4) articulations together? – Repair: find & revise minimal inconsistent subsets (Min-Incons) – Expand: find maximal consistent subsets (Max-Cons) & revise outs 21
  • 22. Inconsistency Analysis (Diagnostic Lattice) • The Min-Incons (MIS) and Max-Cons (MCS) sets determine all others  Repair MIS and/or Expand MCS • Black-box Analysis (Hitting Set algo.) yields a Diagnosis (lattice) – for n=4 articulations, there are 168 possible diagnoses – depending on expected “red/green areas”  explore space differently • |articulations| = n  |possible diagnoses| = |monotonic Boolean functions| = Dedekind Number (n): 2, 3, 6, 20, 168, 7581, 7828354, ... 22
  • 23. Improving Diagnosis • Reiter’s “black-box” (model-based) diagnosis helps debug the articulations • Limited scalability (inherent complexity) • But every bit helps: – Hitting Set Algorithm (“logarithmic extraction”) • Our idea: – Exploit “white-box” reasoning information  RULES to the rescue Hybrid Diagnosis in Euler @ RuleML2014, Prague 23
  • 24. Key Idea: exploit white-box info • We use Answer Set Programming (ASP) to solve Taxonomy Alignment Problem (TAP) • Inconsistency = “False” is derived in the head: False :- <denial of integrity constraint> • Apply provenance trick from databases  – What articulations contribute to a derivation of “False” ? – Eliminate those that don’t!  an example of reusing inferences across separate black-box tests! Hybrid Diagnosis in Euler @ RuleML2014, Prague 24
  • 25. The Provenance “Trick” Hybrid Diagnosis in Euler @ RuleML2014, Prague 25
  • 26. Hybrid Provenance A3: c < f Black-box Provenance Input Alignment Hybrid Diagnosis in Euler @ RuleML2014, Prague 26
  • 27. Hybrid Provenance A3: c < f Black-box Provenance A1+A2 + … => f < c White-box Provenance Input Alignment Hybrid Diagnosis in Euler @ RuleML2014, Prague 27
  • 28. The Hybrid Approach Hybrid Diagnosis in Euler @ RuleML2014, Prague 28
  • 29. Hybrid Approach What articulations contribute to some inconsistency? Good old black-box (HST) Hybrid Diagnosis in Euler @ RuleML2014, Prague 29
  • 30. Benchmark Results • White-box < Hybrid < Black-box (runtimes) • Note: white-box does not give you a diagnosis • Potassco < DLV Hybrid Diagnosis in Euler @ RuleML2014, Prague 30
  • 31. Benchmark DLV • White-box < Hybrid < Black-box (runtimes) • Potassco < DLV Hybrid Diagnosis in Euler @ RuleML2014, Prague 31
  • 32. Benchmark Clingo • White-box < Hybrid < Black-box (runtimes) • Potassco < DLV Hybrid Diagnosis in Euler @ RuleML2014, Prague 32
  • 33. Conclusions • ASP rules can be used to efficiently solve real-world taxonomy reasoning problems • Reiter’s diagnosis useful to debug inconsistent alignments • Adding a “white-box” provenance approach speeds up state-of-the-art HST algorithm by eliminating independent articulations • Future work: – Further improvements, including parallelism: • Trade-off with sharing inferences across parallel instances Hybrid Diagnosis in Euler @ RuleML2014, Prague 33