SlideShare a Scribd company logo
1 of 13
Download to read offline
Extracting information
  from clinical notes

  H. Yang, I. Spasic, F. Sarafraz,
  John A. Keane, Goran Nenadic


     School of Computer Science
      University of Manchester
Motivation & aim
 Electronic clinical notes
    electronic medical/health records
    hospital discharge summaries
 Extract information on
    individual patients and their diseases
    clinical practice
      treatments, drugs used, etc.
 Aim: support data analytics
       e.g. monitoring quality
 Huge interest locally and internationally
Clinical notes
 Highly condensed text
    sometimes without proper sentences
    hospital discharge summaries are more structured
    list of medications, symptoms, etc.


 Terminological variability
    orthographic, acronyms, local conventions


 Various sections
    previous history, social/family background
Health care  special interest-i2b2
NLP challenges in clinical data
 A series of international challenges in information
  extraction from clinical narratives
      organisers: Informatics for Integrating Biology & the
       Bedside (i2b2)

 3 shared tasks so far
   −   De-identification of medical records and identification of
       smokers from their clinical records (2007)
       Identification of obesity & related diseases in patients from
       hospital discharge documents (2008)
       Extraction of medications and related information from
       patients’ discharge documents (2009)

 2010 challenge
      concept, assertions, relations
i2b2 2008
 Extract status of diseases in patients
       obesity, diabetes mellitus, hypercholesterolemia,
        hypertriglyceridemia, hypertension, heart failure (16 in total)
       status: yes, no, unmentioned, questionable
       on textual and “intuitive” level

 28 teams worldwide
       UoM ranked 1st in textual and 7th in intuitive

 Our methodology
       Term-based exact and approximate matching
       Context-based pattern- and rule-based matching
       Machine learning approach


Yang, H., Spasic, I., Keane, J., Nenadic, G.: A Text Mining Approach to the Prediction of a
Disease Status from Clinical Discharge Summaries, JAMIA 16(4):596-600
Methodology
                    Linguistic      section splitting, sentence splitting,
                 pre-processing     chunking, POS tagging, parsing




                   Information        textual evidence extraction,
                    extraction        section filtering, morphological
  Medical
                 (rules, machine      clues (e.g. drug/disease name
 resources
                     learning)        affixes)

•Disease names
•Drug names
•Body parts                        Template filling, filtering negative
•Symptoms                          results, relations and heuristics:
•Abbreviations    Constructing             Organ : Symptom,
•Synonyms           results                Symptom : Disease,
                                           Disease : Drug,
                                           Drug : Mode of application
Rule-based IE
 Disease status patterns
 - context-based patterns
   [N] negative for CHF
   [Q] question of asthma
   [U] no known diagnosis of CAD
   [U] we should consider further asthma studies as an
   outpatient

 - semantics-based patterns
   [N] normal coronaries, a thin black man

 Clinical resources used in sentence extraction
    clinical inference rules e.g., weight>90kg,
     LDL>160mg/dl, HDL<35mg/dl
    medications e.g., ‘anti-depressant’
Textual Annotation Results

 Performance on Disease Status (Ranked 1st)
Micro-average: Accuracy (0.9723)
Macro-average: P (0.8482), R (0.7737), F-score (0.8052)



      #Eval   #Corr   #Gold   Precision   Recall   F-score

  Y   2267    2132    2192    0.9404      0.9726   0.9562

  N   56      40      65      0.7142      0.6153   0.6611

  Q   12      9       17      0.7500      0.5294   0.6206

  U   5709    5640    5770    0.9879      0.9774   0.9826
Intuitive Annotation Results

 Performance on Disease Status (Ranked 7th)
Micro-average: Accuracy (0.9572)
Macro-average: P (0.6383), R (0.6294), F-score (0.6336)




      #Eval   #Corr   #Gold    Precision   Recall   F-Score

  Y   2160    2068    2285     0.9574      0.9050   0.9304

  N   5236    5014    5100     0.9576      0.9831   0.9702

  Q   3       0       14       0           0        0
i2b2 2009
 Extract mentions of medication and related
  information
   drugs the patient takes
   dose, mode of application, frequency, duration, etc.
    (for each mention)
 19 teams worldwide
   UoM ranked 3rd
 Our approach was based on combining
   extensive dictionaries
   morphological and derivational patterns
Evaluation (F-measure)


              Medication                              83.59%
              Dosage                                  82.67%
              Frequency                               83.49%
              Mode                                    85.33%
              Duration                                51.00%
              Reason                                  38.81%

              All fields                              78.47%




Spasić I, Sarafraz F, Keane JA, Nenadic G: “Medication Information Extraction
  with Linguistic Pattern Matching and Semantic Rules”, JAMIA (to appear)
Summary
 NLP and text mining techniques are useful for extraction
  of clinical data
  - disease status extraction: 95-97% accuracy
  - medication information extraction: 80% F-measure

 Construction of reliable and sufficient resources
  - clinical terms and abbreviations (e.g., disease synonyms,
   symptoms, drugs)
  - context patterns related to diseases, medication, etc.

 Domain knowledge required
      construction of domain- and task-specific resources
      complex clinical facts and conditions for inference
        more comprehensive knowledge representation needed

More Related Content

What's hot

Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...
Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...
Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...ScHARR HEDS
 
Process Oriented Multidisciplinary Approach (POMA) -journal presentation
Process Oriented Multidisciplinary Approach (POMA) -journal presentationProcess Oriented Multidisciplinary Approach (POMA) -journal presentation
Process Oriented Multidisciplinary Approach (POMA) -journal presentationSanjana Nair
 
Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...
Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...
Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...Rodrigo Vargas Zapana
 
Otol HNS Better to be Young-2000-Lacy-Merritt
Otol HNS Better to be Young-2000-Lacy-MerrittOtol HNS Better to be Young-2000-Lacy-Merritt
Otol HNS Better to be Young-2000-Lacy-MerrittMichael (Mick) Merritt
 
Prescription Event Monitoring & Record Linkage Systems
Prescription Event Monitoring & Record Linkage SystemsPrescription Event Monitoring & Record Linkage Systems
Prescription Event Monitoring & Record Linkage SystemsSatish Veerla
 
AccessPoint Excerpt - The potential for RWE to improve care inmalignant melanoma
AccessPoint Excerpt - The potential for RWE to improve care inmalignant melanomaAccessPoint Excerpt - The potential for RWE to improve care inmalignant melanoma
AccessPoint Excerpt - The potential for RWE to improve care inmalignant melanomaIMSHealthRWES
 
The Envisia Genomic Classifier
The Envisia Genomic ClassifierThe Envisia Genomic Classifier
The Envisia Genomic ClassifierPhil J. Morrison
 
Consenso de Fibrose Pulmonar Idiopática da ATS
Consenso de Fibrose Pulmonar Idiopática da ATSConsenso de Fibrose Pulmonar Idiopática da ATS
Consenso de Fibrose Pulmonar Idiopática da ATSFlávia Salame
 
Chapter 25 assessment of clincal responses
Chapter 25 assessment of clincal responsesChapter 25 assessment of clincal responses
Chapter 25 assessment of clincal responsesNilesh Kucha
 
Overall patient satisfaction was significantly higher in homeopathic than in ...
Overall patient satisfaction was significantly higher in homeopathic than in ...Overall patient satisfaction was significantly higher in homeopathic than in ...
Overall patient satisfaction was significantly higher in homeopathic than in ...home
 
Nursesí practices and perception of delirium in the intensive care units of ...
Nursesí  practices and perception of delirium in the intensive care units of ...Nursesí  practices and perception of delirium in the intensive care units of ...
Nursesí practices and perception of delirium in the intensive care units of ...Alexander Decker
 

What's hot (20)

Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...
Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...
Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...
 
Process Oriented Multidisciplinary Approach (POMA) -journal presentation
Process Oriented Multidisciplinary Approach (POMA) -journal presentationProcess Oriented Multidisciplinary Approach (POMA) -journal presentation
Process Oriented Multidisciplinary Approach (POMA) -journal presentation
 
Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...
Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...
Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...
 
Otol HNS Better to be Young-2000-Lacy-Merritt
Otol HNS Better to be Young-2000-Lacy-MerrittOtol HNS Better to be Young-2000-Lacy-Merritt
Otol HNS Better to be Young-2000-Lacy-Merritt
 
London 21.11.2008
London 21.11.2008London 21.11.2008
London 21.11.2008
 
UOG Journal Club: Intrafetal laser treatment for twin reversed arterial perfu...
UOG Journal Club: Intrafetal laser treatment for twin reversed arterial perfu...UOG Journal Club: Intrafetal laser treatment for twin reversed arterial perfu...
UOG Journal Club: Intrafetal laser treatment for twin reversed arterial perfu...
 
Journal of Immune Research
Journal of Immune Research Journal of Immune Research
Journal of Immune Research
 
Prescription Event Monitoring & Record Linkage Systems
Prescription Event Monitoring & Record Linkage SystemsPrescription Event Monitoring & Record Linkage Systems
Prescription Event Monitoring & Record Linkage Systems
 
Annotation Editorial
Annotation EditorialAnnotation Editorial
Annotation Editorial
 
AccessPoint Excerpt - The potential for RWE to improve care inmalignant melanoma
AccessPoint Excerpt - The potential for RWE to improve care inmalignant melanomaAccessPoint Excerpt - The potential for RWE to improve care inmalignant melanoma
AccessPoint Excerpt - The potential for RWE to improve care inmalignant melanoma
 
The Envisia Genomic Classifier
The Envisia Genomic ClassifierThe Envisia Genomic Classifier
The Envisia Genomic Classifier
 
Informed consent
Informed consentInformed consent
Informed consent
 
Bio 152 Paper
Bio 152 PaperBio 152 Paper
Bio 152 Paper
 
Aaa
AaaAaa
Aaa
 
Nódulos pulmonares
Nódulos pulmonares Nódulos pulmonares
Nódulos pulmonares
 
Consenso de Fibrose Pulmonar Idiopática da ATS
Consenso de Fibrose Pulmonar Idiopática da ATSConsenso de Fibrose Pulmonar Idiopática da ATS
Consenso de Fibrose Pulmonar Idiopática da ATS
 
Chapter 25 assessment of clincal responses
Chapter 25 assessment of clincal responsesChapter 25 assessment of clincal responses
Chapter 25 assessment of clincal responses
 
Overall patient satisfaction was significantly higher in homeopathic than in ...
Overall patient satisfaction was significantly higher in homeopathic than in ...Overall patient satisfaction was significantly higher in homeopathic than in ...
Overall patient satisfaction was significantly higher in homeopathic than in ...
 
159th publication jamdsr- 3rd name
159th publication  jamdsr- 3rd name159th publication  jamdsr- 3rd name
159th publication jamdsr- 3rd name
 
Nursesí practices and perception of delirium in the intensive care units of ...
Nursesí  practices and perception of delirium in the intensive care units of ...Nursesí  practices and perception of delirium in the intensive care units of ...
Nursesí practices and perception of delirium in the intensive care units of ...
 

Viewers also liked

Viewers also liked (19)

the_life_cycle_of_a_wireframe
the_life_cycle_of_a_wireframethe_life_cycle_of_a_wireframe
the_life_cycle_of_a_wireframe
 
BioNLP09 Winners
BioNLP09 WinnersBioNLP09 Winners
BioNLP09 Winners
 
Eoy
EoyEoy
Eoy
 
Rosario Hearst
Rosario HearstRosario Hearst
Rosario Hearst
 
Language
LanguageLanguage
Language
 
Crf
CrfCrf
Crf
 
Edu2
Edu2Edu2
Edu2
 
Susan Gray
Susan GraySusan Gray
Susan Gray
 
Workshop negations
Workshop negationsWorkshop negations
Workshop negations
 
Bionlp09
Bionlp09Bionlp09
Bionlp09
 
I2b209
I2b209I2b209
I2b209
 
Edu
EduEdu
Edu
 
Artspoken.com
Artspoken.comArtspoken.com
Artspoken.com
 
Six Month
Six MonthSix Month
Six Month
 
Tinsleys 7 Accomplishments
Tinsleys 7 AccomplishmentsTinsleys 7 Accomplishments
Tinsleys 7 Accomplishments
 
Nacsa úJ 4.1 Jav.
Nacsa úJ 4.1 Jav.Nacsa úJ 4.1 Jav.
Nacsa úJ 4.1 Jav.
 
Defense
DefenseDefense
Defense
 
Olivia Contradictions
Olivia ContradictionsOlivia Contradictions
Olivia Contradictions
 
Ambiguity
AmbiguityAmbiguity
Ambiguity
 

Similar to Health care special interest-i2b2

Using real-world evidence to investigate clinical research questions
Using real-world evidence to investigate clinical research questionsUsing real-world evidence to investigate clinical research questions
Using real-world evidence to investigate clinical research questionsKarin Verspoor
 
nuevos criterios de sepsis
nuevos criterios de sepsisnuevos criterios de sepsis
nuevos criterios de sepsisVeronica Dubay
 
How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...
How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...
How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...Crimsonpublishers-Rehabilitation
 
Electronic health records and machine learning
Electronic health records and machine learningElectronic health records and machine learning
Electronic health records and machine learningEman Abdelrazik
 
Sess_39_NAMCS&NHAMCS_hands-on_SCHAPPERT
Sess_39_NAMCS&NHAMCS_hands-on_SCHAPPERTSess_39_NAMCS&NHAMCS_hands-on_SCHAPPERT
Sess_39_NAMCS&NHAMCS_hands-on_SCHAPPERTguestfbf1e1
 
CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...
CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...
CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...lawrenceanchah
 
iOMICS Clinical & Omnia
iOMICS Clinical & OmniaiOMICS Clinical & Omnia
iOMICS Clinical & OmniaInterpretOmics
 
Metanalisis tratamientos ttm
Metanalisis tratamientos ttmMetanalisis tratamientos ttm
Metanalisis tratamientos ttmReynold Muñoz
 
Analysis of Medication Possession Ratio for Improved Blood Pressure Control
Analysis of Medication Possession Ratio for Improved Blood Pressure ControlAnalysis of Medication Possession Ratio for Improved Blood Pressure Control
Analysis of Medication Possession Ratio for Improved Blood Pressure ControlHealth Informatics New Zealand
 
EmergencyMedicine Research
EmergencyMedicine ResearchEmergencyMedicine Research
EmergencyMedicine Researchzybernav
 
Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...
Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...
Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...Nelson Hendler
 
ai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptxai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptxssuser6b571f
 
Central mucoepidermoid carcinoma an up to-date analysis of 147 cases
Central mucoepidermoid carcinoma an up to-date analysis of 147 casesCentral mucoepidermoid carcinoma an up to-date analysis of 147 cases
Central mucoepidermoid carcinoma an up to-date analysis of 147 casesMNTan1
 
Nejm early goal shock septico 2019
Nejm early goal shock septico 2019Nejm early goal shock septico 2019
Nejm early goal shock septico 2019Lucia Tacanga
 
TADAA - Enabling Continuous Improvement for Anaesthetists
TADAA - Enabling Continuous Improvement for AnaesthetistsTADAA - Enabling Continuous Improvement for Anaesthetists
TADAA - Enabling Continuous Improvement for AnaesthetistsHealth Informatics New Zealand
 

Similar to Health care special interest-i2b2 (20)

Using real-world evidence to investigate clinical research questions
Using real-world evidence to investigate clinical research questionsUsing real-world evidence to investigate clinical research questions
Using real-world evidence to investigate clinical research questions
 
nuevos criterios de sepsis
nuevos criterios de sepsisnuevos criterios de sepsis
nuevos criterios de sepsis
 
How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...
How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...
How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...
 
Electronic health records and machine learning
Electronic health records and machine learningElectronic health records and machine learning
Electronic health records and machine learning
 
Sess_39_NAMCS&NHAMCS_hands-on_SCHAPPERT
Sess_39_NAMCS&NHAMCS_hands-on_SCHAPPERTSess_39_NAMCS&NHAMCS_hands-on_SCHAPPERT
Sess_39_NAMCS&NHAMCS_hands-on_SCHAPPERT
 
Cavernoma JC
Cavernoma JCCavernoma JC
Cavernoma JC
 
CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...
CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...
CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...
 
iOMICS Clinical & Omnia
iOMICS Clinical & OmniaiOMICS Clinical & Omnia
iOMICS Clinical & Omnia
 
Metanalisis tratamientos ttm
Metanalisis tratamientos ttmMetanalisis tratamientos ttm
Metanalisis tratamientos ttm
 
Analysis of Medication Possession Ratio for Improved Blood Pressure Control
Analysis of Medication Possession Ratio for Improved Blood Pressure ControlAnalysis of Medication Possession Ratio for Improved Blood Pressure Control
Analysis of Medication Possession Ratio for Improved Blood Pressure Control
 
EmergencyMedicine Research
EmergencyMedicine ResearchEmergencyMedicine Research
EmergencyMedicine Research
 
Emergency Medicine Research
Emergency Medicine ResearchEmergency Medicine Research
Emergency Medicine Research
 
Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...
Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...
Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...
 
ai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptxai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptx
 
Central mucoepidermoid carcinoma an up to-date analysis of 147 cases
Central mucoepidermoid carcinoma an up to-date analysis of 147 casesCentral mucoepidermoid carcinoma an up to-date analysis of 147 cases
Central mucoepidermoid carcinoma an up to-date analysis of 147 cases
 
EMRs: Meaningful Use and Research
EMRs: Meaningful Use and ResearchEMRs: Meaningful Use and Research
EMRs: Meaningful Use and Research
 
Nejm early goal shock septico 2019
Nejm early goal shock septico 2019Nejm early goal shock septico 2019
Nejm early goal shock septico 2019
 
CIBM
CIBMCIBM
CIBM
 
TADAA - Enabling Continuous Improvement for Anaesthetists
TADAA - Enabling Continuous Improvement for AnaesthetistsTADAA - Enabling Continuous Improvement for Anaesthetists
TADAA - Enabling Continuous Improvement for Anaesthetists
 
20150300.0 00027
20150300.0 0002720150300.0 00027
20150300.0 00027
 

Recently uploaded

UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1DianaGray10
 
UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8DianaGray10
 
Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1DianaGray10
 
Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.YounusS2
 
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAshyamraj55
 
AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarPrecisely
 
Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024SkyPlanner
 
Introduction to Matsuo Laboratory (ENG).pptx
Introduction to Matsuo Laboratory (ENG).pptxIntroduction to Matsuo Laboratory (ENG).pptx
Introduction to Matsuo Laboratory (ENG).pptxMatsuo Lab
 
Bird eye's view on Camunda open source ecosystem
Bird eye's view on Camunda open source ecosystemBird eye's view on Camunda open source ecosystem
Bird eye's view on Camunda open source ecosystemAsko Soukka
 
Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Brian Pichman
 
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopNIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopBachir Benyammi
 
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfJamie (Taka) Wang
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7DianaGray10
 
Crea il tuo assistente AI con lo Stregatto (open source python framework)
Crea il tuo assistente AI con lo Stregatto (open source python framework)Crea il tuo assistente AI con lo Stregatto (open source python framework)
Crea il tuo assistente AI con lo Stregatto (open source python framework)Commit University
 
Building AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxBuilding AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxUdaiappa Ramachandran
 
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationUsing IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationIES VE
 
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019IES VE
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Will Schroeder
 
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding TeamAdam Moalla
 
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online CollaborationCOMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online Collaborationbruanjhuli
 

Recently uploaded (20)

UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
 
UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8
 
Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1
 
Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.
 
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
 
AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity Webinar
 
Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024
 
Introduction to Matsuo Laboratory (ENG).pptx
Introduction to Matsuo Laboratory (ENG).pptxIntroduction to Matsuo Laboratory (ENG).pptx
Introduction to Matsuo Laboratory (ENG).pptx
 
Bird eye's view on Camunda open source ecosystem
Bird eye's view on Camunda open source ecosystemBird eye's view on Camunda open source ecosystem
Bird eye's view on Camunda open source ecosystem
 
Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )Building Your Own AI Instance (TBLC AI )
Building Your Own AI Instance (TBLC AI )
 
NIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 WorkshopNIST Cybersecurity Framework (CSF) 2.0 Workshop
NIST Cybersecurity Framework (CSF) 2.0 Workshop
 
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7
 
Crea il tuo assistente AI con lo Stregatto (open source python framework)
Crea il tuo assistente AI con lo Stregatto (open source python framework)Crea il tuo assistente AI con lo Stregatto (open source python framework)
Crea il tuo assistente AI con lo Stregatto (open source python framework)
 
Building AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxBuilding AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptx
 
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve DecarbonizationUsing IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
Using IESVE for Loads, Sizing and Heat Pump Modeling to Achieve Decarbonization
 
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
 
9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team9 Steps For Building Winning Founding Team
9 Steps For Building Winning Founding Team
 
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online CollaborationCOMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
 

Health care special interest-i2b2

  • 1. Extracting information from clinical notes H. Yang, I. Spasic, F. Sarafraz, John A. Keane, Goran Nenadic School of Computer Science University of Manchester
  • 2. Motivation & aim  Electronic clinical notes  electronic medical/health records  hospital discharge summaries  Extract information on  individual patients and their diseases  clinical practice  treatments, drugs used, etc.  Aim: support data analytics  e.g. monitoring quality  Huge interest locally and internationally
  • 3. Clinical notes  Highly condensed text  sometimes without proper sentences  hospital discharge summaries are more structured  list of medications, symptoms, etc.  Terminological variability  orthographic, acronyms, local conventions  Various sections  previous history, social/family background
  • 5. NLP challenges in clinical data  A series of international challenges in information extraction from clinical narratives  organisers: Informatics for Integrating Biology & the Bedside (i2b2)  3 shared tasks so far − De-identification of medical records and identification of smokers from their clinical records (2007) Identification of obesity & related diseases in patients from hospital discharge documents (2008) Extraction of medications and related information from patients’ discharge documents (2009)  2010 challenge  concept, assertions, relations
  • 6. i2b2 2008  Extract status of diseases in patients  obesity, diabetes mellitus, hypercholesterolemia, hypertriglyceridemia, hypertension, heart failure (16 in total)  status: yes, no, unmentioned, questionable  on textual and “intuitive” level  28 teams worldwide  UoM ranked 1st in textual and 7th in intuitive  Our methodology  Term-based exact and approximate matching  Context-based pattern- and rule-based matching  Machine learning approach Yang, H., Spasic, I., Keane, J., Nenadic, G.: A Text Mining Approach to the Prediction of a Disease Status from Clinical Discharge Summaries, JAMIA 16(4):596-600
  • 7. Methodology Linguistic section splitting, sentence splitting, pre-processing chunking, POS tagging, parsing Information textual evidence extraction, extraction section filtering, morphological Medical (rules, machine clues (e.g. drug/disease name resources learning) affixes) •Disease names •Drug names •Body parts Template filling, filtering negative •Symptoms results, relations and heuristics: •Abbreviations Constructing Organ : Symptom, •Synonyms results Symptom : Disease, Disease : Drug, Drug : Mode of application
  • 8. Rule-based IE  Disease status patterns - context-based patterns [N] negative for CHF [Q] question of asthma [U] no known diagnosis of CAD [U] we should consider further asthma studies as an outpatient - semantics-based patterns [N] normal coronaries, a thin black man  Clinical resources used in sentence extraction  clinical inference rules e.g., weight>90kg, LDL>160mg/dl, HDL<35mg/dl  medications e.g., ‘anti-depressant’
  • 9. Textual Annotation Results  Performance on Disease Status (Ranked 1st) Micro-average: Accuracy (0.9723) Macro-average: P (0.8482), R (0.7737), F-score (0.8052) #Eval #Corr #Gold Precision Recall F-score Y 2267 2132 2192 0.9404 0.9726 0.9562 N 56 40 65 0.7142 0.6153 0.6611 Q 12 9 17 0.7500 0.5294 0.6206 U 5709 5640 5770 0.9879 0.9774 0.9826
  • 10. Intuitive Annotation Results  Performance on Disease Status (Ranked 7th) Micro-average: Accuracy (0.9572) Macro-average: P (0.6383), R (0.6294), F-score (0.6336) #Eval #Corr #Gold Precision Recall F-Score Y 2160 2068 2285 0.9574 0.9050 0.9304 N 5236 5014 5100 0.9576 0.9831 0.9702 Q 3 0 14 0 0 0
  • 11. i2b2 2009  Extract mentions of medication and related information  drugs the patient takes  dose, mode of application, frequency, duration, etc. (for each mention)  19 teams worldwide  UoM ranked 3rd  Our approach was based on combining  extensive dictionaries  morphological and derivational patterns
  • 12. Evaluation (F-measure) Medication 83.59% Dosage 82.67% Frequency 83.49% Mode 85.33% Duration 51.00% Reason 38.81% All fields 78.47% Spasić I, Sarafraz F, Keane JA, Nenadic G: “Medication Information Extraction with Linguistic Pattern Matching and Semantic Rules”, JAMIA (to appear)
  • 13. Summary  NLP and text mining techniques are useful for extraction of clinical data - disease status extraction: 95-97% accuracy - medication information extraction: 80% F-measure  Construction of reliable and sufficient resources - clinical terms and abbreviations (e.g., disease synonyms, symptoms, drugs) - context patterns related to diseases, medication, etc.  Domain knowledge required  construction of domain- and task-specific resources  complex clinical facts and conditions for inference  more comprehensive knowledge representation needed