SlideShare a Scribd company logo
1 of 50
How to Use Text Analytics in
Healthcare to Improve Outcomes:
Why You Need More than NLP
Eric Just
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Text Analytics in Healthcare
Eighty percent of clinical data is locked
away in unstructured physician notes that
can’t be read by an EHR and so can’t be
accessed by advanced decision support
and quality improvement applications.”
Peter J. Embi, MD, MS,
President and CEO
Unfortunately, more than 95 percent of health systems can’t utilize
this valuable clinical data because the analytics needed to access
it is difficult, expensive, and requires an advanced technical skillset
and infrastructure.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Text Analytics in Healthcare
For example, a team of analysts in Indiana set
out to identify peripheral arterial disease (PAD)
patients across two health systems.
The team hit a roadblock when they identified
the structured EMR and claims data failed to
identify over 75 percent of patients with PAD.
To better understand high-risk populations,
such as PAD patients, health systems must
leverage text analytics, including text search
refined with natural language processing (NLP).
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Text Analytics in Healthcare
So far most health systems’ use of text
analytics has been limited to research
departments within academic medical
centers.
This presentation explains why text
analytics in healthcare is important in all
areas of the industry and demonstrates
how health systems can leverage it.
It describes the four critical components
of text analytics, from optimizing text
search to pragmatically integrating
text analytics system-wide.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
The Current State of Text Analytics in Healthcare
Health systems simply aren’t leveraging
text analytics—Gartner estimates less
than 5 percent do as of July 2016.
Most systems manually curate patient
registries and rely only on coded data—
an approach that significantly limits their
understanding of patient populations.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
The Current State of Text Analytics in Healthcare
Picking back up on the search for
patients with PAD from above, let’s
understand why the analytics team was
looking at this high-risk cohort.
PAD is a condition in which narrowed
arteries reduce blood flow to limbs; it
affects more than 3 million patients
every year in the United States.
Patients with PAD are high-risk for
coronary heart disease, heart attack,
and stroke, and translates to both sicker
patients with higher financial cost.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
The Current State of Text Analytics in Healthcare
At first, the Indiana team identified
less than 10,000 PAD patients using a
traditional approach (ICD and CPTcodes).
Hoping for a more complete patient
list, the team tasked a sophisticated
NLP group to write algorithms to
identify more PAD patients.
Integrating text analytics led to the
discovery of over 41,000 PAD patients.
There is a clear need to leverage
unstructured text data in healthcare
analytics despite the limited use today.
Identifying PAD Patients
Traditional
Methods
NLP
Algorithms
>300%
increase
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
The Current State of Text Analytics in Healthcare
In the typical text analytics scenario, a
data scientist writes an NLP algorithm
to determine the number of PAD
patients, validates the algorithm, and
returns the result to the investigator.
In this scenario, only two people in the
health system understand the PAD
patient population.
Systems need more than just a handful
of specialized algorithms serving small
groups of stakeholders.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
The Current State of Text Analytics in Healthcare
As a healthcare system administrator, I want to understand my high-
risk population better. I want to find all patients with peripheral
arterial disease (PAD). I know there are more patients than I was able
to find by simply querying diagnosis and procedure codes.”
Data scientist
develops PAD text
mining algorithm
Algorithm
validated
The patient
cohort is
defined
Results returned
to investigator
Typical Scenario
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
The Current State of Text Analytics in Healthcare
In a much better text analytics scenario,
the data scientist does more than create
and validate an algorithm.
The scientist deploys the algorithm to the
system’s analytics environment to be run
on a nightly basis, allowing any authorized
analytics user to make use of the results.
In this improved scenario, the health
system knows how many PAD patients it
has, how many new patients came into the
system, and what the care gaps are.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
The Current State of Text Analytics in Healthcare
As a healthcare system administrator, I want to understand my high-
risk population better. I want to find all patients with peripheral
arterial disease (PAD). I know there are more patients than I was able
to find by simply querying diagnosis and procedure codes.”
Data scientist
develops PAD text
mining algorithm
Algorithm
validated
The patient
cohort is
defined
PAD algorithm run
nightly and stored in
data warehouse
Better Scenario
PAD algorithm output
combined with coded data
to create PAD registry
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Google: The Text Search Gold Standard
When it comes to text search, Google is
the gold standard.
People overwhelmingly believe that
Google is easy to use, fast, and accurate.
It finishes our sentences for us and, most
of the time, gives us exactly what we’re
looking for.
Even though Google search is simple
and straight forward, it’s a lot more
complicated than it seems.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Google: The Text Search Gold Standard
Google analyzes every web page clicked,
looks at other web pages that link to those
pages, and examines synonyms too.
Google seems simple but is a
sophisticated text analytics machine.
Google is an incredible text analytics tool
for finding information in web pages online.
What if finding healthcare data in medical
records were as simple as using Google?
Let’s look at what it would take to build
“Google for the Medical Record.”
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
How Text Search Works
At the core of how text search works is
an inverted index—an index similar to
one at the end of a book that lists words
in the text and where they appear.
Search engines index documents.
They read each document, break them
into all the different words that appear in
the documents, and create a sorted list of
all the words.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
How Text Search Works
For example the following slide features
three simple clinical documents, starting
with document 0.
• Document 0 states that the patient is a
67-year-old female with NIDDM
(noninsulin-dependent diabetes mellitus)
and hypertension.
• Document 1 states that the patient has
no diabetes or hypertension.
• Document 2 states that the patient’s
mother and sister are diabetic.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
The Current State of Text Analytics in Healthcare
The Basis of Text Search:
The Inverted Index
Patient is a 67 year old female
with NIDDM and hypertension.
Document 0
The patient has no diabetes
or hypertension.
Document 1
Patient’s mother is diabetic.
Patient’s sister is diabetic.
Document 2
Words Document Inverted Index
67 0 {(0,3)}
diabet 1,2 {(1,4),(2,3),(2,7)}
female 0 {(0,6)}
hyperten 0,1 {(0,8),(1,6)}
mother 2 {(2,1)}
niddm 0 {(0,8)}
no 1 {(1,3)}
old 0 {(0,5)}
patient 0,1,2 {(0,0),(1,1), (2,0),(2,4)}
sister 2 {(2,5)}
year 0 {(0,4)}
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
How Text Search Works
The second column in the
inverted index (“document”)
indicates in which document
each word appears.
The third column (“inverted
index”) indicates the position of
the word in the document.
For example, the word “old”
appears in document 0 in the
fifth position.
Words Document Inverted Index
67 0 {(0,3)}
diabet 1,2 {(1,4),(2,3),(2,7)}
female 0 {(0,6)}
hyperten 0,1 {(0,8),(1,6)}
mother 2 {(2,1)}
niddm 0 {(0,8)}
no 1 {(1,3)}
old 0 {(0,5)}
patient 0,1,2 {(0,0),(1,1), (2,0),(2,4)}
sister 2 {(2,5)}
year 0 {(0,4)}
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
How Text Search Works
The term “diabet” also appears,
which is a word stem.
Words that tend to have high
level of variance are broken into
word stems—a simpler version
of the word—to broaden search
capacity.
In this example, “diabet” is
mapped to documents 1 and 2.
Words Document Inverted Index
67 0 {(0,3)}
diabet 1,2 {(1,4),(2,3),(2,7)}
female 0 {(0,6)}
hyperten 0,1 {(0,8),(1,6)}
mother 2 {(2,1)}
niddm 0 {(0,8)}
no 1 {(1,3)}
old 0 {(0,5)}
patient 0,1,2 {(0,0),(1,1), (2,0),(2,4)}
sister 2 {(2,5)}
year 0 {(0,4)}
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Text Search Tools
Three Great Text Search Tools for Indexing Text
and Providing Search Capability
• Tool #1 – Lucene
• Tool #2 – Solr
• Tool #3 – Elasticsearch
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Text Search Tools
Written in 1999, Lucene is a Java-
based API that provides the foundation
for more advanced search engine
capabilities.
Lucene creates and maintains the
search index and does hit ranking and
result sorting.
Most users access Lucene through one
of two technologies, Solr and
Elastsicsearch.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Text Search Tools
Solr is an open source, Java-based
enterprise search tool which pioneered
advances on top of Lucene starting in 2004.
Health systems can use any programming
language to develop with Solr, which makes
it easy to create an inverted index, simple
web page, and search interface like Google.
Many leading technology organizations use
Solr, such as Apple and NASA.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Text Search Tools
Elasticsearch is a newer open source
enterprise search tool, written in 2010.
Elasticsearch, like Solr, uses Lucene as
a foundation.
Elasticsearch has built its own set of APIs
and functionality over the inverted index.
Netflix and Facebook use Elasticsearch.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
As demonstrated on the next slide, the search
for diabetes surfaced two indexed documents.
Using word stemming, the search found both
diabetes and diabetic.
But there are two problems with this search:
it missed the mention of NIDDM (a synonym)
and neither result is relevant to a medical
cohort query for diabetics.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
diabetes Go
Results: 2 records, 0.0 ms
Document 2:
Patient’s mother is diabetic. Patient’s sister
is diabetic.
Document 1:
The patient has no diabetes or hypertension.
Found both diabetes and
diabetic (word stemming)
Missed mention of NIDDM
(synonyms)
Neither result is relevant to a
medical cohort query for
diabetics (context)
Although the simple, familiar interface and fast results (generated by the inverted index)
worked well, three text search must-haves for medical searches—display, medical
terminologies, and context—are missing.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
Must-Have #1: Optimize Results Display for Use Cases
We must create a solution that matches
the needs of those using the data.
For example, users of data may not
have permission to view PHI.
If we aggregate the results of the text
analysis, then we provide users with
access to the key results they need
without compromising privacy.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
Must-Have #2: Expand Search with Medical Terminologies
If you recall, the diabetes search didn’t
include NIDDM, a synonym of diabetes.
Health systems will produce more
sophisticated analyses by leveraging
medical terminologies to automatically
expand synonym lists.
Providing a quick and easy way for users
to identify synonyms valuable to their
search will increase the quantity and
accuracy of the results.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
Must-Have #3: Refine Results with Context
Context matters.
To be useful for clinical applications (e.g.,
searching for genotype/phenotype
correlations), retrieving patients eligible for
a clinical trial, or identifying disease
outbreaks, simply identifying clinical
conditions in the text is insufficient.
Information described in the context of the
clinical condition is critical for
understanding the patient’s state.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
Must-Have #3: Refine Results with Context
If we are searching for patients with
pneumonia, then searching for “pneumonia”
without considering the context would result in
identifying the following types of phrases:
• “ruled out pneumonia”
• “history of pneumonia”
• “family history of pneumonia”
None of these indicate the patient currently
has pneumonia.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
Must-Have #3: Refine Results with Context
ConText, an NLP pattern matching algorithm
published in 2009, solves for these scenarios
through enhancing text search and refining
results by detecting conditions and
determining if they are negated, historical, or
experienced by someone else.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
Must-Have #3: Refine Results with Context
The following slide shows how the ConText
algorithm analyzes the sentence:
“No history of chest tightness but family
history of CHF”
It looks for several things, including:
• Conditions (e.g., chest tightness)
• Negation triggers (e.g., the word “no”)
• Historical triggers (e.g., the word “history”)
• Termination (e.g., the word “but”)
• Experiencer triggers (e.g., the word “family”)
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
Wendy W. Chapman, David Chu, John N. Dowling
J Biomed Inform. 2009 Oct; 42(5): 839–851.
Chest tightness
Negation: affirmed
Experiencer: patient
Temporality: recent
”No history of chest tightness but family history of CHF.”
ConText
CHF
Negation: affirmed
Experiencer: patient
Temporality: recent
Chest tightness
Negation: negated
Experiencer: patient
Temporality: historical
CHF
Negation: affirmed
Experiencer: other
Temporality: historical
Negation trigger
Historical trigger Termination
Termination Historical triggerCondition Condition
Other experiencer trigger
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
Applying Context and Extracting Values with an NLP Pipeline
NLP pipelines can be run using one of
several Java-based, open source tools,
such as Apache Unstructured Information
Management Architecture (UIMA) and
General Architecture for Text Engineering
(GATE).
After starting a query with a clinical
search and expanding that search to
include meaningful terms, the next step is
passing the search onto an NLP pipeline.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
Applying Context and Extracting Values with an NLP Pipeline
In this example, a series of three analyses
performed on the text that relate to and build
on each other:
1. Sentence detection.
2. Entity recognition (e.g., diabetes).
3. Context algorithm.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
Applying Context and Extracting Values with an NLP Pipeline
• Analysis of context uses a sentence as an operand
• Identifying sentences in clinical text is not straightforward
̶ Have you ever seen punctuation in a clinical note?
• An NLP analysis pipeline ties it all together
Search results
Sentence
detection
Entity
recognition
(i.e. diabetes)
Context
Algorithm
Present user
with additional
filters
NLP Pipeline Frameworks
• Apache Unstructured Information Management Architecture (UIMA)
• General Architecture for Text Engineering (GATE)
• Natural Language Toolkit (NLTK)
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
After the third and final step in this NLP
pipeline (context algorithm), users are
presented with additional filters in which
they can choose their desired context
(next slide).
When it comes to text analytics
frameworks, flexibility to run algorithms
based on a user’s specific question is key.
The NLP pipeline should create this
flexibility by expanding search capacity
to refine results.
Applying Context and Extracting Values with an NLP Pipeline
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Filter Diabetes Results
Context: Apply
to Search
Results
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
The NLP pipeline should also extract
values as demanded by each use case.
Many health systems are burdened by
regulatory reporting, especially when
certain measures like ejection fraction
are not stored as discrete values.
In lieu of automated reporting, health
systems are obligated to engage team
members to be “chart abstractors” to
manually open thousands of patient
charts to find ejection fraction values
found in clinical notes.
Applying Context and Extracting Values with an NLP Pipeline
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
An NLP pipeline should not only
identify when ejection fraction is
documented in a note, but also save
each value in such a way that the
organization’s analytics platform can
utilize the discrete value in their
automated reporting.
Other common extraction projects
include aortic root size, PHQ
depression scores, and ankle
brachial index..
Applying Context and Extracting Values with an NLP Pipeline
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Three Text Search Must-Haves: Display, Medical
Terminologies, and Context
Other Pieces to the NLP Pipeline: Extract Values
ef_phrase qualifiers ef_low ef_high ef_mid ef_word
ejection fraction is at least 70-75 is at least 70 75 72.5 NULL
ejection fraction of about 20 of about 20 20 20 NULL
ejection fraction of 60 of 60 60 60 NULL
ejection fraction of greater than 65 of greater than 65 65 65 NULL
ejection fraction of 55 of 55 55 55 NULL
ejection fraction by visual inspection is 65 by visual inspection is 65 65 65 NULL
LVEF is normal is NULL NULL NULL normal
b(((LV)?EF)|(Ejections+Fraction))s+(?<qualifiers>([^sd]+s+){0,5})(?(((?<ef_low>d+)-
(?<ef_high>d+))|(?<ef_mid_txt>d+)|(?<ef_word>([^s]*?normal)|(moderate)|(severe)))
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Always Validate the Algorithm
Quantifying algorithm accuracy is critical.
Every algorithm needs a validation workflow,
which typically includes four steps:
1. Build studies to review query results.
2. Assign team members to review results.
3. Randomly select records to represent study.
4. Highlight key words for easy chart review.
When evaluating NLP tools or devising an
NLP strategy, health systems should ensure
validation is either built into the tool or very
easy to do.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Focus on Interoperability and Pragmatic Integration
Health systems need to combine text analytics
with discrete data, and an enterprise data
warehouse (EDW) is a great place to do that.
A siloed approach to text analytics won’t work;
make sure to integrate validated text analytics
with all other analytics within the organization.
For example, a health system would put its
diabetes and PAD cohorts into the EDW and
make that data available system-wide. Health
systems need a platform that can pull in data
on a regular, timely basis to ensure it’s current.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Focus on Interoperability and Pragmatic Integration
Text Analytics Must Be Interoperable!
Diabetes Cohort
PAD Cohort
Ejection Fraction
Tumor Sizes
Validated Text Analytics
…
Data Warehouse
• Population Analytics
• Care Improvement
• Operational Improvement
• Financial Improvement
• Research
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Focus on Interoperability and Pragmatic Integration
Text analytics will open a new world of
data analysis to health systems.
It’s unlikely that anyone can identify the
exact data uses and needs for text data in
the next two, three, or five years.
Instead of trying to make lasting decisions
on a data model up front, we recommend
focusing on what’s needed to perform
timely, relevant analytics; healthcare
analytics that quickly adapt to new
questions and use cases.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Focus on Interoperability and Pragmatic Integration
This approach follows the Health
Catalyst® Late-Binding™ architecture
that allows for the flexibility that’s so
critical for the constantly evolving
world of text analytics.
Health Catalyst
Data Warehouse eBook.
FREE
DOWNLOAD
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
How to Use Text Analytics to Improve Outcomes
The power of text analytics in healthcare to
create precise patient registries is undeniable.
Rather than manually curating patient
registries and relying solely on coded data,
health systems should leverage data from
coded data sets, NLP, and extraction projects
to create precise patient registries.
Rather than limiting the use of text analytics to
data scientists, architects, and analysts,
health systems should make these tools
widely accessible and empower the entire
organization to leverage clinical text analytics.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
How to Use Text Analytics to Improve Outcomes
Health systems can start using text analytics
to improve outcomes today by focusing on
four key components:
#1: Optimize Text Search (Display, Medical
Terminologies, and Context)
#2: Enhance Context and Extract Values
with an NLP Pipeline
#3: Always Validate the Algorithm
#4: Focus on Interoperability and Integration
Using a Late-Binding Approach
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
How to Use Text Analytics to Improve Outcomes
This broad, all-encompassing approach
to text analytics in healthcare will
position health systems for clinical and
financial success as they strive to create
precise patient registries, enhance their
understanding of high-risk patient
populations, and improve outcomes.
Health systems should invest in a
solution that not only solves today’s
problems, but can also be expanded to
solve future use cases.
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
For more information:
“This book is a fantastic piece of work”
– Robert Lindeman MD, FAAP, Chief Physician Quality Officer
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
More about this topic
Link to original article for a more in-depth discussion.
How to Use Text Analytics in Healthcare to Improve Outcomes—Why You Need More than NLP
Regenstrief Institute and Health Catalyst Team to Reveal Hidden Meaning in Clinical Data for
Better Patient Care – Health Catalyst News
Healthcare Analytics Adoption Model: A Framework and Roadmap (white paper)
Dale Sanders, Executive VP of Software
Healthcare Analytics: Realizing the Value of Health IT
Brian Ahier – Guest Blogger
3 Reasons Why Comparative Analytics, Predictive Analytics, and NLP
Won’t Solve Healthcare’s Problems ̶ Dale Sanders, Executive VP of Software
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
Doug Adamson, Chief Technology Officer, VP
© 2016 Health Catalyst
Proprietary. Feel free to share but we would appreciate a Health Catalyst citation.
Eric Just joined the Health Catalyst family in August of 2011 as Vice President of Technology,
bringing over 10 years of biomedical informatics experience. Prior to Health Catalyst, he
managed the research arm of the Northwestern Medical Data Warehouse at Northwestern
University's Feinberg School of Medicine. In this role, he led the development of technology,
processes, and teams to leverage the clinical data warehouse. Previously, as a senior data
architect, he helped create the data warehouse technical foundation and innovated new ways
to extract and load medical data. In addition, he led the development effort for a genome database. Eric
holds a Master of Science in Chemistry from Northwestern University and a Bachelors of Science in
Chemistry from the College of William and Mary.
Other Clinical Quality Improvement Resources
Click to read additional information at www.healthcatalyst.com

More Related Content

What's hot

Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Alia Hamwi
 
The Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalThe Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalTony Russell-Rose
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)Kuppusamy P
 
Machine Learning in Healthcare
Machine Learning in HealthcareMachine Learning in Healthcare
Machine Learning in HealthcareBigR.io
 
Repot on-hospital-manegment-system
Repot on-hospital-manegment-systemRepot on-hospital-manegment-system
Repot on-hospital-manegment-systemPNEC
 
Top 10 clinical data manager interview questions and answers
Top 10 clinical data manager interview questions and answersTop 10 clinical data manager interview questions and answers
Top 10 clinical data manager interview questions and answerscadderlux
 
Source Documents in Clinical Trials_part1
Source Documents in Clinical Trials_part1Source Documents in Clinical Trials_part1
Source Documents in Clinical Trials_part1Valentyna Korniyenko
 
Data mining techniques using weka
Data mining techniques using wekaData mining techniques using weka
Data mining techniques using wekaPrashant Menon
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceSrishti44
 
Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...
Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...
Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...Nirav Raje
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with PythonBenjamin Bengfort
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingMariana Soffer
 
Very brief overview of AI in drug discovery
Very brief overview of AI in drug discoveryVery brief overview of AI in drug discovery
Very brief overview of AI in drug discoveryDr. Gerry Higgins
 
Natural Language Processing In Healthcare
Natural Language Processing In HealthcareNatural Language Processing In Healthcare
Natural Language Processing In HealthcareLaxmiMPriya
 
Hospital information system[1]
Hospital information system[1]Hospital information system[1]
Hospital information system[1]Prasit Chanarat
 
2 Solutions to Fix Scanpst Error "An unexpected error prevented access to thi...
2 Solutions to Fix Scanpst Error "An unexpected error prevented access to thi...2 Solutions to Fix Scanpst Error "An unexpected error prevented access to thi...
2 Solutions to Fix Scanpst Error "An unexpected error prevented access to thi...DataNumen, Inc.
 
Proposal final
Proposal finalProposal final
Proposal finalMido Razaz
 

What's hot (20)

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)
 
The Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalThe Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information Retrieval
 
Budgeting study
Budgeting studyBudgeting study
Budgeting study
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
Machine Learning in Healthcare
Machine Learning in HealthcareMachine Learning in Healthcare
Machine Learning in Healthcare
 
Repot on-hospital-manegment-system
Repot on-hospital-manegment-systemRepot on-hospital-manegment-system
Repot on-hospital-manegment-system
 
Top 10 clinical data manager interview questions and answers
Top 10 clinical data manager interview questions and answersTop 10 clinical data manager interview questions and answers
Top 10 clinical data manager interview questions and answers
 
Source Documents in Clinical Trials_part1
Source Documents in Clinical Trials_part1Source Documents in Clinical Trials_part1
Source Documents in Clinical Trials_part1
 
Kaiser Permanente HealthConnect - EHR and SNOMED
Kaiser Permanente HealthConnect - EHR and SNOMEDKaiser Permanente HealthConnect - EHR and SNOMED
Kaiser Permanente HealthConnect - EHR and SNOMED
 
Data mining techniques using weka
Data mining techniques using wekaData mining techniques using weka
Data mining techniques using weka
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...
Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...
Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Very brief overview of AI in drug discovery
Very brief overview of AI in drug discoveryVery brief overview of AI in drug discovery
Very brief overview of AI in drug discovery
 
Natural Language Processing In Healthcare
Natural Language Processing In HealthcareNatural Language Processing In Healthcare
Natural Language Processing In Healthcare
 
Hospital information system[1]
Hospital information system[1]Hospital information system[1]
Hospital information system[1]
 
2 Solutions to Fix Scanpst Error "An unexpected error prevented access to thi...
2 Solutions to Fix Scanpst Error "An unexpected error prevented access to thi...2 Solutions to Fix Scanpst Error "An unexpected error prevented access to thi...
2 Solutions to Fix Scanpst Error "An unexpected error prevented access to thi...
 
Proposal final
Proposal finalProposal final
Proposal final
 

Similar to How to Use Text Analytics in Healthcare to Improve Outcomes: Why You Need More than NLP

When Healthcare Data Analysts Fulfill the Data Detective Role
When Healthcare Data Analysts Fulfill the Data Detective RoleWhen Healthcare Data Analysts Fulfill the Data Detective Role
When Healthcare Data Analysts Fulfill the Data Detective RoleHealth Catalyst
 
Data Science for Healthcare: What Today’s Leaders Must Know
Data Science for Healthcare: What Today’s Leaders Must KnowData Science for Healthcare: What Today’s Leaders Must Know
Data Science for Healthcare: What Today’s Leaders Must KnowHealth Catalyst
 
Use ACE Scores in Machine Learning to Predict Disease Earlier and Improve Out...
Use ACE Scores in Machine Learning to Predict Disease Earlier and Improve Out...Use ACE Scores in Machine Learning to Predict Disease Earlier and Improve Out...
Use ACE Scores in Machine Learning to Predict Disease Earlier and Improve Out...Health Catalyst
 
Healthcare Interoperability: New Tactics and Technology
Healthcare Interoperability: New Tactics and TechnologyHealthcare Interoperability: New Tactics and Technology
Healthcare Interoperability: New Tactics and TechnologyHealth Catalyst
 
Analyzing healthcare data
Analyzing healthcare dataAnalyzing healthcare data
Analyzing healthcare dataRepustate
 
Going Beyond Genomics in Precision Medicine: What's Next
Going Beyond Genomics in Precision Medicine: What's NextGoing Beyond Genomics in Precision Medicine: What's Next
Going Beyond Genomics in Precision Medicine: What's NextHealth Catalyst
 
The Best Way to Optimize Physician Workflow
The Best Way to Optimize Physician WorkflowThe Best Way to Optimize Physician Workflow
The Best Way to Optimize Physician WorkflowHealth Catalyst
 
Data science in healthcare-Assignment 2.pptx
Data science in healthcare-Assignment 2.pptxData science in healthcare-Assignment 2.pptx
Data science in healthcare-Assignment 2.pptxArpitaDebnath20
 
Harnessing the potential of unstructured healthcare data converted
Harnessing the potential of unstructured healthcare data convertedHarnessing the potential of unstructured healthcare data converted
Harnessing the potential of unstructured healthcare data convertedezDI
 
Krishnapuram_TransformingHealth_HIMSS_CIO_Summit_100715
Krishnapuram_TransformingHealth_HIMSS_CIO_Summit_100715Krishnapuram_TransformingHealth_HIMSS_CIO_Summit_100715
Krishnapuram_TransformingHealth_HIMSS_CIO_Summit_100715Balaji Krishnapuram
 
Harnessing the potential of unstructured healthcare data
Harnessing the potential of unstructured healthcare dataHarnessing the potential of unstructured healthcare data
Harnessing the potential of unstructured healthcare dataezDI
 
Running head HEALTH PROMOTION IN MINORITY POPULATIONS .docx
Running head  HEALTH PROMOTION IN MINORITY POPULATIONS           .docxRunning head  HEALTH PROMOTION IN MINORITY POPULATIONS           .docx
Running head HEALTH PROMOTION IN MINORITY POPULATIONS .docxhealdkathaleen
 
The Healthcare Analytics Ecosystem: A Must-Have in Today’s Transformation
 The Healthcare Analytics Ecosystem: A Must-Have in Today’s Transformation The Healthcare Analytics Ecosystem: A Must-Have in Today’s Transformation
The Healthcare Analytics Ecosystem: A Must-Have in Today’s TransformationHealth Catalyst
 
4 Essential Lessons for Adopting Predictive Analytics in Healthcare
4 Essential Lessons for Adopting Predictive Analytics in Healthcare4 Essential Lessons for Adopting Predictive Analytics in Healthcare
4 Essential Lessons for Adopting Predictive Analytics in HealthcareHealth Catalyst
 
A novel methodology for diagnosing the heart disease using fuzzy database
A novel methodology for diagnosing the heart disease using fuzzy databaseA novel methodology for diagnosing the heart disease using fuzzy database
A novel methodology for diagnosing the heart disease using fuzzy databaseeSAT Journals
 
Three Reasons Augmented Intelligence Is the Future of AI in Healthcare
Three Reasons Augmented Intelligence Is the Future of AI in HealthcareThree Reasons Augmented Intelligence Is the Future of AI in Healthcare
Three Reasons Augmented Intelligence Is the Future of AI in HealthcareHealth Catalyst
 
Simplifying semantics for biomedical applications
Simplifying semantics for biomedical applicationsSimplifying semantics for biomedical applications
Simplifying semantics for biomedical applicationsSemantic Web San Diego
 
Use of data analytics in health care
Use of data analytics in health careUse of data analytics in health care
Use of data analytics in health careAkanshabhushan
 
How to Choose the Best Healthcare Analytics Software Solution in a Crowded Ma...
How to Choose the Best Healthcare Analytics Software Solution in a Crowded Ma...How to Choose the Best Healthcare Analytics Software Solution in a Crowded Ma...
How to Choose the Best Healthcare Analytics Software Solution in a Crowded Ma...Health Catalyst
 
Fall 2014 Communique by Anesthesia Business Consultants (ABC)
Fall 2014 Communique by Anesthesia Business Consultants (ABC)Fall 2014 Communique by Anesthesia Business Consultants (ABC)
Fall 2014 Communique by Anesthesia Business Consultants (ABC)Anesthesia Business Consultants
 

Similar to How to Use Text Analytics in Healthcare to Improve Outcomes: Why You Need More than NLP (20)

When Healthcare Data Analysts Fulfill the Data Detective Role
When Healthcare Data Analysts Fulfill the Data Detective RoleWhen Healthcare Data Analysts Fulfill the Data Detective Role
When Healthcare Data Analysts Fulfill the Data Detective Role
 
Data Science for Healthcare: What Today’s Leaders Must Know
Data Science for Healthcare: What Today’s Leaders Must KnowData Science for Healthcare: What Today’s Leaders Must Know
Data Science for Healthcare: What Today’s Leaders Must Know
 
Use ACE Scores in Machine Learning to Predict Disease Earlier and Improve Out...
Use ACE Scores in Machine Learning to Predict Disease Earlier and Improve Out...Use ACE Scores in Machine Learning to Predict Disease Earlier and Improve Out...
Use ACE Scores in Machine Learning to Predict Disease Earlier and Improve Out...
 
Healthcare Interoperability: New Tactics and Technology
Healthcare Interoperability: New Tactics and TechnologyHealthcare Interoperability: New Tactics and Technology
Healthcare Interoperability: New Tactics and Technology
 
Analyzing healthcare data
Analyzing healthcare dataAnalyzing healthcare data
Analyzing healthcare data
 
Going Beyond Genomics in Precision Medicine: What's Next
Going Beyond Genomics in Precision Medicine: What's NextGoing Beyond Genomics in Precision Medicine: What's Next
Going Beyond Genomics in Precision Medicine: What's Next
 
The Best Way to Optimize Physician Workflow
The Best Way to Optimize Physician WorkflowThe Best Way to Optimize Physician Workflow
The Best Way to Optimize Physician Workflow
 
Data science in healthcare-Assignment 2.pptx
Data science in healthcare-Assignment 2.pptxData science in healthcare-Assignment 2.pptx
Data science in healthcare-Assignment 2.pptx
 
Harnessing the potential of unstructured healthcare data converted
Harnessing the potential of unstructured healthcare data convertedHarnessing the potential of unstructured healthcare data converted
Harnessing the potential of unstructured healthcare data converted
 
Krishnapuram_TransformingHealth_HIMSS_CIO_Summit_100715
Krishnapuram_TransformingHealth_HIMSS_CIO_Summit_100715Krishnapuram_TransformingHealth_HIMSS_CIO_Summit_100715
Krishnapuram_TransformingHealth_HIMSS_CIO_Summit_100715
 
Harnessing the potential of unstructured healthcare data
Harnessing the potential of unstructured healthcare dataHarnessing the potential of unstructured healthcare data
Harnessing the potential of unstructured healthcare data
 
Running head HEALTH PROMOTION IN MINORITY POPULATIONS .docx
Running head  HEALTH PROMOTION IN MINORITY POPULATIONS           .docxRunning head  HEALTH PROMOTION IN MINORITY POPULATIONS           .docx
Running head HEALTH PROMOTION IN MINORITY POPULATIONS .docx
 
The Healthcare Analytics Ecosystem: A Must-Have in Today’s Transformation
 The Healthcare Analytics Ecosystem: A Must-Have in Today’s Transformation The Healthcare Analytics Ecosystem: A Must-Have in Today’s Transformation
The Healthcare Analytics Ecosystem: A Must-Have in Today’s Transformation
 
4 Essential Lessons for Adopting Predictive Analytics in Healthcare
4 Essential Lessons for Adopting Predictive Analytics in Healthcare4 Essential Lessons for Adopting Predictive Analytics in Healthcare
4 Essential Lessons for Adopting Predictive Analytics in Healthcare
 
A novel methodology for diagnosing the heart disease using fuzzy database
A novel methodology for diagnosing the heart disease using fuzzy databaseA novel methodology for diagnosing the heart disease using fuzzy database
A novel methodology for diagnosing the heart disease using fuzzy database
 
Three Reasons Augmented Intelligence Is the Future of AI in Healthcare
Three Reasons Augmented Intelligence Is the Future of AI in HealthcareThree Reasons Augmented Intelligence Is the Future of AI in Healthcare
Three Reasons Augmented Intelligence Is the Future of AI in Healthcare
 
Simplifying semantics for biomedical applications
Simplifying semantics for biomedical applicationsSimplifying semantics for biomedical applications
Simplifying semantics for biomedical applications
 
Use of data analytics in health care
Use of data analytics in health careUse of data analytics in health care
Use of data analytics in health care
 
How to Choose the Best Healthcare Analytics Software Solution in a Crowded Ma...
How to Choose the Best Healthcare Analytics Software Solution in a Crowded Ma...How to Choose the Best Healthcare Analytics Software Solution in a Crowded Ma...
How to Choose the Best Healthcare Analytics Software Solution in a Crowded Ma...
 
Fall 2014 Communique by Anesthesia Business Consultants (ABC)
Fall 2014 Communique by Anesthesia Business Consultants (ABC)Fall 2014 Communique by Anesthesia Business Consultants (ABC)
Fall 2014 Communique by Anesthesia Business Consultants (ABC)
 

More from Health Catalyst

2024 HCAT Healthcare Technology Insights
2024 HCAT Healthcare Technology Insights2024 HCAT Healthcare Technology Insights
2024 HCAT Healthcare Technology InsightsHealth Catalyst
 
Three Keys to a Successful Margin: Charges, Costs, and Labor
Three Keys to a Successful Margin: Charges, Costs, and LaborThree Keys to a Successful Margin: Charges, Costs, and Labor
Three Keys to a Successful Margin: Charges, Costs, and LaborHealth Catalyst
 
2024 CPT® Updates (Professional Services Focused) - Part 3
2024 CPT® Updates (Professional Services Focused) - Part 32024 CPT® Updates (Professional Services Focused) - Part 3
2024 CPT® Updates (Professional Services Focused) - Part 3Health Catalyst
 
2024 CPT® Code Updates (HIM Focused) - Part 2
2024 CPT® Code Updates (HIM Focused) - Part 22024 CPT® Code Updates (HIM Focused) - Part 2
2024 CPT® Code Updates (HIM Focused) - Part 2Health Catalyst
 
2024 CPT® Code Updates (CDM Focused) - Part 1
2024 CPT® Code Updates (CDM Focused) - Part 12024 CPT® Code Updates (CDM Focused) - Part 1
2024 CPT® Code Updates (CDM Focused) - Part 1Health Catalyst
 
What’s Next for Hospital Price Transparency in 2024 and Beyond
What’s Next for Hospital Price Transparency in 2024 and BeyondWhat’s Next for Hospital Price Transparency in 2024 and Beyond
What’s Next for Hospital Price Transparency in 2024 and BeyondHealth Catalyst
 
Automated Patient Reported Outcomes (PROs) for Hip & Knee Replacement
Automated Patient Reported Outcomes (PROs) for Hip & Knee ReplacementAutomated Patient Reported Outcomes (PROs) for Hip & Knee Replacement
Automated Patient Reported Outcomes (PROs) for Hip & Knee ReplacementHealth Catalyst
 
2024 Medicare Physician Fee Schedule (MPFS) Final Rule Updates
2024 Medicare Physician Fee Schedule (MPFS) Final Rule Updates2024 Medicare Physician Fee Schedule (MPFS) Final Rule Updates
2024 Medicare Physician Fee Schedule (MPFS) Final Rule UpdatesHealth Catalyst
 
What's Next for OPPS: A Look at the 2024 Final Rule
What's Next for OPPS: A Look at the 2024 Final RuleWhat's Next for OPPS: A Look at the 2024 Final Rule
What's Next for OPPS: A Look at the 2024 Final RuleHealth Catalyst
 
Insight into the 2024 ICD-10 PCS Updates - Part 2
Insight into the 2024 ICD-10 PCS Updates - Part 2Insight into the 2024 ICD-10 PCS Updates - Part 2
Insight into the 2024 ICD-10 PCS Updates - Part 2Health Catalyst
 
Vitalware Insight Into the 2024 ICD10 CM Updates.pdf
Vitalware Insight Into the 2024 ICD10 CM Updates.pdfVitalware Insight Into the 2024 ICD10 CM Updates.pdf
Vitalware Insight Into the 2024 ICD10 CM Updates.pdfHealth Catalyst
 
Driving Value: Boosting Clinical Registry Value Using ARMUS Solutions
Driving Value: Boosting Clinical Registry Value Using ARMUS SolutionsDriving Value: Boosting Clinical Registry Value Using ARMUS Solutions
Driving Value: Boosting Clinical Registry Value Using ARMUS SolutionsHealth Catalyst
 
Tech-Enabled Managed Services: Not Your Average Outsourcing
Tech-Enabled Managed Services: Not Your Average OutsourcingTech-Enabled Managed Services: Not Your Average Outsourcing
Tech-Enabled Managed Services: Not Your Average OutsourcingHealth Catalyst
 
2023 Mid-Year CPT/HCPCS Code Set Updates
2023 Mid-Year CPT/HCPCS Code Set Updates2023 Mid-Year CPT/HCPCS Code Set Updates
2023 Mid-Year CPT/HCPCS Code Set UpdatesHealth Catalyst
 
How Managing Chronic Conditions Is Streamlined with Digital Technology
How Managing Chronic Conditions Is Streamlined with Digital TechnologyHow Managing Chronic Conditions Is Streamlined with Digital Technology
How Managing Chronic Conditions Is Streamlined with Digital TechnologyHealth Catalyst
 
COVID-19: After the Public Health Emergency Ends
COVID-19: After the Public Health Emergency EndsCOVID-19: After the Public Health Emergency Ends
COVID-19: After the Public Health Emergency EndsHealth Catalyst
 
Automated Medication Compliance Tools for the Provider and Patient
Automated Medication Compliance Tools for the Provider and PatientAutomated Medication Compliance Tools for the Provider and Patient
Automated Medication Compliance Tools for the Provider and PatientHealth Catalyst
 
A Facility-Focused Guide to Applying Modifiers Corectly.pptx
A Facility-Focused Guide to Applying Modifiers Corectly.pptxA Facility-Focused Guide to Applying Modifiers Corectly.pptx
A Facility-Focused Guide to Applying Modifiers Corectly.pptxHealth Catalyst
 
Self-Service Analytics: How to Use Healthcare Business Intelligence
Self-Service Analytics: How to Use Healthcare Business IntelligenceSelf-Service Analytics: How to Use Healthcare Business Intelligence
Self-Service Analytics: How to Use Healthcare Business IntelligenceHealth Catalyst
 
Optimize Your Labor Management with Health Catalyst PowerLabor™
Optimize Your Labor Management with Health Catalyst PowerLabor™Optimize Your Labor Management with Health Catalyst PowerLabor™
Optimize Your Labor Management with Health Catalyst PowerLabor™Health Catalyst
 

More from Health Catalyst (20)

2024 HCAT Healthcare Technology Insights
2024 HCAT Healthcare Technology Insights2024 HCAT Healthcare Technology Insights
2024 HCAT Healthcare Technology Insights
 
Three Keys to a Successful Margin: Charges, Costs, and Labor
Three Keys to a Successful Margin: Charges, Costs, and LaborThree Keys to a Successful Margin: Charges, Costs, and Labor
Three Keys to a Successful Margin: Charges, Costs, and Labor
 
2024 CPT® Updates (Professional Services Focused) - Part 3
2024 CPT® Updates (Professional Services Focused) - Part 32024 CPT® Updates (Professional Services Focused) - Part 3
2024 CPT® Updates (Professional Services Focused) - Part 3
 
2024 CPT® Code Updates (HIM Focused) - Part 2
2024 CPT® Code Updates (HIM Focused) - Part 22024 CPT® Code Updates (HIM Focused) - Part 2
2024 CPT® Code Updates (HIM Focused) - Part 2
 
2024 CPT® Code Updates (CDM Focused) - Part 1
2024 CPT® Code Updates (CDM Focused) - Part 12024 CPT® Code Updates (CDM Focused) - Part 1
2024 CPT® Code Updates (CDM Focused) - Part 1
 
What’s Next for Hospital Price Transparency in 2024 and Beyond
What’s Next for Hospital Price Transparency in 2024 and BeyondWhat’s Next for Hospital Price Transparency in 2024 and Beyond
What’s Next for Hospital Price Transparency in 2024 and Beyond
 
Automated Patient Reported Outcomes (PROs) for Hip & Knee Replacement
Automated Patient Reported Outcomes (PROs) for Hip & Knee ReplacementAutomated Patient Reported Outcomes (PROs) for Hip & Knee Replacement
Automated Patient Reported Outcomes (PROs) for Hip & Knee Replacement
 
2024 Medicare Physician Fee Schedule (MPFS) Final Rule Updates
2024 Medicare Physician Fee Schedule (MPFS) Final Rule Updates2024 Medicare Physician Fee Schedule (MPFS) Final Rule Updates
2024 Medicare Physician Fee Schedule (MPFS) Final Rule Updates
 
What's Next for OPPS: A Look at the 2024 Final Rule
What's Next for OPPS: A Look at the 2024 Final RuleWhat's Next for OPPS: A Look at the 2024 Final Rule
What's Next for OPPS: A Look at the 2024 Final Rule
 
Insight into the 2024 ICD-10 PCS Updates - Part 2
Insight into the 2024 ICD-10 PCS Updates - Part 2Insight into the 2024 ICD-10 PCS Updates - Part 2
Insight into the 2024 ICD-10 PCS Updates - Part 2
 
Vitalware Insight Into the 2024 ICD10 CM Updates.pdf
Vitalware Insight Into the 2024 ICD10 CM Updates.pdfVitalware Insight Into the 2024 ICD10 CM Updates.pdf
Vitalware Insight Into the 2024 ICD10 CM Updates.pdf
 
Driving Value: Boosting Clinical Registry Value Using ARMUS Solutions
Driving Value: Boosting Clinical Registry Value Using ARMUS SolutionsDriving Value: Boosting Clinical Registry Value Using ARMUS Solutions
Driving Value: Boosting Clinical Registry Value Using ARMUS Solutions
 
Tech-Enabled Managed Services: Not Your Average Outsourcing
Tech-Enabled Managed Services: Not Your Average OutsourcingTech-Enabled Managed Services: Not Your Average Outsourcing
Tech-Enabled Managed Services: Not Your Average Outsourcing
 
2023 Mid-Year CPT/HCPCS Code Set Updates
2023 Mid-Year CPT/HCPCS Code Set Updates2023 Mid-Year CPT/HCPCS Code Set Updates
2023 Mid-Year CPT/HCPCS Code Set Updates
 
How Managing Chronic Conditions Is Streamlined with Digital Technology
How Managing Chronic Conditions Is Streamlined with Digital TechnologyHow Managing Chronic Conditions Is Streamlined with Digital Technology
How Managing Chronic Conditions Is Streamlined with Digital Technology
 
COVID-19: After the Public Health Emergency Ends
COVID-19: After the Public Health Emergency EndsCOVID-19: After the Public Health Emergency Ends
COVID-19: After the Public Health Emergency Ends
 
Automated Medication Compliance Tools for the Provider and Patient
Automated Medication Compliance Tools for the Provider and PatientAutomated Medication Compliance Tools for the Provider and Patient
Automated Medication Compliance Tools for the Provider and Patient
 
A Facility-Focused Guide to Applying Modifiers Corectly.pptx
A Facility-Focused Guide to Applying Modifiers Corectly.pptxA Facility-Focused Guide to Applying Modifiers Corectly.pptx
A Facility-Focused Guide to Applying Modifiers Corectly.pptx
 
Self-Service Analytics: How to Use Healthcare Business Intelligence
Self-Service Analytics: How to Use Healthcare Business IntelligenceSelf-Service Analytics: How to Use Healthcare Business Intelligence
Self-Service Analytics: How to Use Healthcare Business Intelligence
 
Optimize Your Labor Management with Health Catalyst PowerLabor™
Optimize Your Labor Management with Health Catalyst PowerLabor™Optimize Your Labor Management with Health Catalyst PowerLabor™
Optimize Your Labor Management with Health Catalyst PowerLabor™
 

Recently uploaded

Call Girls Ghaziabad 9999965857 Cheap and Best with original Photos
Call Girls Ghaziabad 9999965857 Cheap and Best with original PhotosCall Girls Ghaziabad 9999965857 Cheap and Best with original Photos
Call Girls Ghaziabad 9999965857 Cheap and Best with original Photosparshadkalavatidevi7
 
Russian Call Girls Sadashivanagar | 7001305949 At Low Cost Cash Payment Booking
Russian Call Girls Sadashivanagar | 7001305949 At Low Cost Cash Payment BookingRussian Call Girls Sadashivanagar | 7001305949 At Low Cost Cash Payment Booking
Russian Call Girls Sadashivanagar | 7001305949 At Low Cost Cash Payment Bookingnarwatsonia7
 
Russian Escorts Delhi | 9711199171 | all area service available
Russian Escorts Delhi | 9711199171 | all area service availableRussian Escorts Delhi | 9711199171 | all area service available
Russian Escorts Delhi | 9711199171 | all area service availablesandeepkumar69420
 
Russian Call Girls Mohan Nagar | 9711199171 | High Profile -New Model -Availa...
Russian Call Girls Mohan Nagar | 9711199171 | High Profile -New Model -Availa...Russian Call Girls Mohan Nagar | 9711199171 | High Profile -New Model -Availa...
Russian Call Girls Mohan Nagar | 9711199171 | High Profile -New Model -Availa...sandeepkumar69420
 
Call Girls Delhi 9873940964 Elite Escort Service Available 24/7 Hire
Call Girls Delhi 9873940964 Elite Escort Service Available 24/7 HireCall Girls Delhi 9873940964 Elite Escort Service Available 24/7 Hire
Call Girls Delhi 9873940964 Elite Escort Service Available 24/7 HireCall Girls Delhi
 
Russian Call Girl Chandapura Dommasandra Road - 7001305949 Escorts Service 50...
Russian Call Girl Chandapura Dommasandra Road - 7001305949 Escorts Service 50...Russian Call Girl Chandapura Dommasandra Road - 7001305949 Escorts Service 50...
Russian Call Girl Chandapura Dommasandra Road - 7001305949 Escorts Service 50...narwatsonia7
 
2025 Inpatient Prospective Payment System (IPPS) Proposed Rule
2025 Inpatient Prospective Payment System (IPPS) Proposed Rule2025 Inpatient Prospective Payment System (IPPS) Proposed Rule
2025 Inpatient Prospective Payment System (IPPS) Proposed RuleShelby Lewis
 
SARS (SEVERE ACUTE RESPIRATORY SYNDROME).pdf
SARS (SEVERE ACUTE RESPIRATORY SYNDROME).pdfSARS (SEVERE ACUTE RESPIRATORY SYNDROME).pdf
SARS (SEVERE ACUTE RESPIRATORY SYNDROME).pdfDolisha Warbi
 
9711199012 Najafgarh Call Girls ₹5.5k With COD Free Home Delivery
9711199012 Najafgarh Call Girls ₹5.5k With COD Free Home Delivery9711199012 Najafgarh Call Girls ₹5.5k With COD Free Home Delivery
9711199012 Najafgarh Call Girls ₹5.5k With COD Free Home Deliverymarshasaifi
 
Call Girls Hsr Layout Whatsapp 7001305949 Independent Escort Service
Call Girls Hsr Layout Whatsapp 7001305949 Independent Escort ServiceCall Girls Hsr Layout Whatsapp 7001305949 Independent Escort Service
Call Girls Hsr Layout Whatsapp 7001305949 Independent Escort Servicenarwatsonia7
 
Globalny raport: „Prawdziwe piękno 2024" od Dove
Globalny raport: „Prawdziwe piękno 2024" od DoveGlobalny raport: „Prawdziwe piękno 2024" od Dove
Globalny raport: „Prawdziwe piękno 2024" od Doveagatadrynko
 
Russian Call Girls Delhi Cantt | 9711199171 | High Profile -New Model -Availa...
Russian Call Girls Delhi Cantt | 9711199171 | High Profile -New Model -Availa...Russian Call Girls Delhi Cantt | 9711199171 | High Profile -New Model -Availa...
Russian Call Girls Delhi Cantt | 9711199171 | High Profile -New Model -Availa...satishsharma69855
 
Rohini Sector 30 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Rohini Sector 30 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...Rohini Sector 30 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Rohini Sector 30 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...ddev2574
 
Hi,Fi Call Girl In Whitefield - [ Cash on Delivery ] Contact 7001305949 Escor...
Hi,Fi Call Girl In Whitefield - [ Cash on Delivery ] Contact 7001305949 Escor...Hi,Fi Call Girl In Whitefield - [ Cash on Delivery ] Contact 7001305949 Escor...
Hi,Fi Call Girl In Whitefield - [ Cash on Delivery ] Contact 7001305949 Escor...narwatsonia7
 
Russian Escorts Service Delhi 9711199171 SONI VIP & HOT BOOK NOW
Russian Escorts Service Delhi 9711199171 SONI VIP & HOT BOOK NOWRussian Escorts Service Delhi 9711199171 SONI VIP & HOT BOOK NOW
Russian Escorts Service Delhi 9711199171 SONI VIP & HOT BOOK NOWsangeevkumar5478
 
Models Call Girls Electronic City | 7001305949 At Low Cost Cash Payment Booking
Models Call Girls Electronic City | 7001305949 At Low Cost Cash Payment BookingModels Call Girls Electronic City | 7001305949 At Low Cost Cash Payment Booking
Models Call Girls Electronic City | 7001305949 At Low Cost Cash Payment Bookingnarwatsonia7
 
Biology class 12 assignment neet level practise chapter wise
Biology class 12 assignment neet level practise chapter wiseBiology class 12 assignment neet level practise chapter wise
Biology class 12 assignment neet level practise chapter wiseNAGKINGRAPELLY
 
EMS and Extrication: Coordinating Critical Care
EMS and Extrication: Coordinating Critical CareEMS and Extrication: Coordinating Critical Care
EMS and Extrication: Coordinating Critical CareRommie Duckworth
 
Call Girls Sawda 9999965857 Cheap and Best with original Photos
Call Girls Sawda 9999965857 Cheap and Best with original PhotosCall Girls Sawda 9999965857 Cheap and Best with original Photos
Call Girls Sawda 9999965857 Cheap and Best with original Photoskartikkumark7k7
 

Recently uploaded (20)

Call Girls Ghaziabad 9999965857 Cheap and Best with original Photos
Call Girls Ghaziabad 9999965857 Cheap and Best with original PhotosCall Girls Ghaziabad 9999965857 Cheap and Best with original Photos
Call Girls Ghaziabad 9999965857 Cheap and Best with original Photos
 
Russian Call Girls Sadashivanagar | 7001305949 At Low Cost Cash Payment Booking
Russian Call Girls Sadashivanagar | 7001305949 At Low Cost Cash Payment BookingRussian Call Girls Sadashivanagar | 7001305949 At Low Cost Cash Payment Booking
Russian Call Girls Sadashivanagar | 7001305949 At Low Cost Cash Payment Booking
 
Russian Escorts Delhi | 9711199171 | all area service available
Russian Escorts Delhi | 9711199171 | all area service availableRussian Escorts Delhi | 9711199171 | all area service available
Russian Escorts Delhi | 9711199171 | all area service available
 
Russian Call Girls Mohan Nagar | 9711199171 | High Profile -New Model -Availa...
Russian Call Girls Mohan Nagar | 9711199171 | High Profile -New Model -Availa...Russian Call Girls Mohan Nagar | 9711199171 | High Profile -New Model -Availa...
Russian Call Girls Mohan Nagar | 9711199171 | High Profile -New Model -Availa...
 
Call Girls Delhi 9873940964 Elite Escort Service Available 24/7 Hire
Call Girls Delhi 9873940964 Elite Escort Service Available 24/7 HireCall Girls Delhi 9873940964 Elite Escort Service Available 24/7 Hire
Call Girls Delhi 9873940964 Elite Escort Service Available 24/7 Hire
 
Russian Call Girl Chandapura Dommasandra Road - 7001305949 Escorts Service 50...
Russian Call Girl Chandapura Dommasandra Road - 7001305949 Escorts Service 50...Russian Call Girl Chandapura Dommasandra Road - 7001305949 Escorts Service 50...
Russian Call Girl Chandapura Dommasandra Road - 7001305949 Escorts Service 50...
 
2025 Inpatient Prospective Payment System (IPPS) Proposed Rule
2025 Inpatient Prospective Payment System (IPPS) Proposed Rule2025 Inpatient Prospective Payment System (IPPS) Proposed Rule
2025 Inpatient Prospective Payment System (IPPS) Proposed Rule
 
SARS (SEVERE ACUTE RESPIRATORY SYNDROME).pdf
SARS (SEVERE ACUTE RESPIRATORY SYNDROME).pdfSARS (SEVERE ACUTE RESPIRATORY SYNDROME).pdf
SARS (SEVERE ACUTE RESPIRATORY SYNDROME).pdf
 
9711199012 Najafgarh Call Girls ₹5.5k With COD Free Home Delivery
9711199012 Najafgarh Call Girls ₹5.5k With COD Free Home Delivery9711199012 Najafgarh Call Girls ₹5.5k With COD Free Home Delivery
9711199012 Najafgarh Call Girls ₹5.5k With COD Free Home Delivery
 
Call Girls Hsr Layout Whatsapp 7001305949 Independent Escort Service
Call Girls Hsr Layout Whatsapp 7001305949 Independent Escort ServiceCall Girls Hsr Layout Whatsapp 7001305949 Independent Escort Service
Call Girls Hsr Layout Whatsapp 7001305949 Independent Escort Service
 
Globalny raport: „Prawdziwe piękno 2024" od Dove
Globalny raport: „Prawdziwe piękno 2024" od DoveGlobalny raport: „Prawdziwe piękno 2024" od Dove
Globalny raport: „Prawdziwe piękno 2024" od Dove
 
Russian Call Girls Delhi Cantt | 9711199171 | High Profile -New Model -Availa...
Russian Call Girls Delhi Cantt | 9711199171 | High Profile -New Model -Availa...Russian Call Girls Delhi Cantt | 9711199171 | High Profile -New Model -Availa...
Russian Call Girls Delhi Cantt | 9711199171 | High Profile -New Model -Availa...
 
Rohini Sector 30 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Rohini Sector 30 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...Rohini Sector 30 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Rohini Sector 30 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
 
Hi,Fi Call Girl In Whitefield - [ Cash on Delivery ] Contact 7001305949 Escor...
Hi,Fi Call Girl In Whitefield - [ Cash on Delivery ] Contact 7001305949 Escor...Hi,Fi Call Girl In Whitefield - [ Cash on Delivery ] Contact 7001305949 Escor...
Hi,Fi Call Girl In Whitefield - [ Cash on Delivery ] Contact 7001305949 Escor...
 
Russian Escorts Service Delhi 9711199171 SONI VIP & HOT BOOK NOW
Russian Escorts Service Delhi 9711199171 SONI VIP & HOT BOOK NOWRussian Escorts Service Delhi 9711199171 SONI VIP & HOT BOOK NOW
Russian Escorts Service Delhi 9711199171 SONI VIP & HOT BOOK NOW
 
Russian Call Girls South Delhi 9711199171 discount on your booking
Russian Call Girls South Delhi 9711199171 discount on your bookingRussian Call Girls South Delhi 9711199171 discount on your booking
Russian Call Girls South Delhi 9711199171 discount on your booking
 
Models Call Girls Electronic City | 7001305949 At Low Cost Cash Payment Booking
Models Call Girls Electronic City | 7001305949 At Low Cost Cash Payment BookingModels Call Girls Electronic City | 7001305949 At Low Cost Cash Payment Booking
Models Call Girls Electronic City | 7001305949 At Low Cost Cash Payment Booking
 
Biology class 12 assignment neet level practise chapter wise
Biology class 12 assignment neet level practise chapter wiseBiology class 12 assignment neet level practise chapter wise
Biology class 12 assignment neet level practise chapter wise
 
EMS and Extrication: Coordinating Critical Care
EMS and Extrication: Coordinating Critical CareEMS and Extrication: Coordinating Critical Care
EMS and Extrication: Coordinating Critical Care
 
Call Girls Sawda 9999965857 Cheap and Best with original Photos
Call Girls Sawda 9999965857 Cheap and Best with original PhotosCall Girls Sawda 9999965857 Cheap and Best with original Photos
Call Girls Sawda 9999965857 Cheap and Best with original Photos
 

How to Use Text Analytics in Healthcare to Improve Outcomes: Why You Need More than NLP

  • 1. How to Use Text Analytics in Healthcare to Improve Outcomes: Why You Need More than NLP Eric Just
  • 2. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Text Analytics in Healthcare Eighty percent of clinical data is locked away in unstructured physician notes that can’t be read by an EHR and so can’t be accessed by advanced decision support and quality improvement applications.” Peter J. Embi, MD, MS, President and CEO Unfortunately, more than 95 percent of health systems can’t utilize this valuable clinical data because the analytics needed to access it is difficult, expensive, and requires an advanced technical skillset and infrastructure.
  • 3. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Text Analytics in Healthcare For example, a team of analysts in Indiana set out to identify peripheral arterial disease (PAD) patients across two health systems. The team hit a roadblock when they identified the structured EMR and claims data failed to identify over 75 percent of patients with PAD. To better understand high-risk populations, such as PAD patients, health systems must leverage text analytics, including text search refined with natural language processing (NLP).
  • 4. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Text Analytics in Healthcare So far most health systems’ use of text analytics has been limited to research departments within academic medical centers. This presentation explains why text analytics in healthcare is important in all areas of the industry and demonstrates how health systems can leverage it. It describes the four critical components of text analytics, from optimizing text search to pragmatically integrating text analytics system-wide.
  • 5. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. The Current State of Text Analytics in Healthcare Health systems simply aren’t leveraging text analytics—Gartner estimates less than 5 percent do as of July 2016. Most systems manually curate patient registries and rely only on coded data— an approach that significantly limits their understanding of patient populations.
  • 6. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. The Current State of Text Analytics in Healthcare Picking back up on the search for patients with PAD from above, let’s understand why the analytics team was looking at this high-risk cohort. PAD is a condition in which narrowed arteries reduce blood flow to limbs; it affects more than 3 million patients every year in the United States. Patients with PAD are high-risk for coronary heart disease, heart attack, and stroke, and translates to both sicker patients with higher financial cost.
  • 7. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. The Current State of Text Analytics in Healthcare At first, the Indiana team identified less than 10,000 PAD patients using a traditional approach (ICD and CPTcodes). Hoping for a more complete patient list, the team tasked a sophisticated NLP group to write algorithms to identify more PAD patients. Integrating text analytics led to the discovery of over 41,000 PAD patients. There is a clear need to leverage unstructured text data in healthcare analytics despite the limited use today. Identifying PAD Patients Traditional Methods NLP Algorithms >300% increase
  • 8. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. The Current State of Text Analytics in Healthcare In the typical text analytics scenario, a data scientist writes an NLP algorithm to determine the number of PAD patients, validates the algorithm, and returns the result to the investigator. In this scenario, only two people in the health system understand the PAD patient population. Systems need more than just a handful of specialized algorithms serving small groups of stakeholders.
  • 9. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. The Current State of Text Analytics in Healthcare As a healthcare system administrator, I want to understand my high- risk population better. I want to find all patients with peripheral arterial disease (PAD). I know there are more patients than I was able to find by simply querying diagnosis and procedure codes.” Data scientist develops PAD text mining algorithm Algorithm validated The patient cohort is defined Results returned to investigator Typical Scenario
  • 10. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. The Current State of Text Analytics in Healthcare In a much better text analytics scenario, the data scientist does more than create and validate an algorithm. The scientist deploys the algorithm to the system’s analytics environment to be run on a nightly basis, allowing any authorized analytics user to make use of the results. In this improved scenario, the health system knows how many PAD patients it has, how many new patients came into the system, and what the care gaps are.
  • 11. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. The Current State of Text Analytics in Healthcare As a healthcare system administrator, I want to understand my high- risk population better. I want to find all patients with peripheral arterial disease (PAD). I know there are more patients than I was able to find by simply querying diagnosis and procedure codes.” Data scientist develops PAD text mining algorithm Algorithm validated The patient cohort is defined PAD algorithm run nightly and stored in data warehouse Better Scenario PAD algorithm output combined with coded data to create PAD registry
  • 12. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Google: The Text Search Gold Standard When it comes to text search, Google is the gold standard. People overwhelmingly believe that Google is easy to use, fast, and accurate. It finishes our sentences for us and, most of the time, gives us exactly what we’re looking for. Even though Google search is simple and straight forward, it’s a lot more complicated than it seems.
  • 13. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Google: The Text Search Gold Standard Google analyzes every web page clicked, looks at other web pages that link to those pages, and examines synonyms too. Google seems simple but is a sophisticated text analytics machine. Google is an incredible text analytics tool for finding information in web pages online. What if finding healthcare data in medical records were as simple as using Google? Let’s look at what it would take to build “Google for the Medical Record.”
  • 14. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. How Text Search Works At the core of how text search works is an inverted index—an index similar to one at the end of a book that lists words in the text and where they appear. Search engines index documents. They read each document, break them into all the different words that appear in the documents, and create a sorted list of all the words.
  • 15. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. How Text Search Works For example the following slide features three simple clinical documents, starting with document 0. • Document 0 states that the patient is a 67-year-old female with NIDDM (noninsulin-dependent diabetes mellitus) and hypertension. • Document 1 states that the patient has no diabetes or hypertension. • Document 2 states that the patient’s mother and sister are diabetic.
  • 16. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. The Current State of Text Analytics in Healthcare The Basis of Text Search: The Inverted Index Patient is a 67 year old female with NIDDM and hypertension. Document 0 The patient has no diabetes or hypertension. Document 1 Patient’s mother is diabetic. Patient’s sister is diabetic. Document 2 Words Document Inverted Index 67 0 {(0,3)} diabet 1,2 {(1,4),(2,3),(2,7)} female 0 {(0,6)} hyperten 0,1 {(0,8),(1,6)} mother 2 {(2,1)} niddm 0 {(0,8)} no 1 {(1,3)} old 0 {(0,5)} patient 0,1,2 {(0,0),(1,1), (2,0),(2,4)} sister 2 {(2,5)} year 0 {(0,4)}
  • 17. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. How Text Search Works The second column in the inverted index (“document”) indicates in which document each word appears. The third column (“inverted index”) indicates the position of the word in the document. For example, the word “old” appears in document 0 in the fifth position. Words Document Inverted Index 67 0 {(0,3)} diabet 1,2 {(1,4),(2,3),(2,7)} female 0 {(0,6)} hyperten 0,1 {(0,8),(1,6)} mother 2 {(2,1)} niddm 0 {(0,8)} no 1 {(1,3)} old 0 {(0,5)} patient 0,1,2 {(0,0),(1,1), (2,0),(2,4)} sister 2 {(2,5)} year 0 {(0,4)}
  • 18. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. How Text Search Works The term “diabet” also appears, which is a word stem. Words that tend to have high level of variance are broken into word stems—a simpler version of the word—to broaden search capacity. In this example, “diabet” is mapped to documents 1 and 2. Words Document Inverted Index 67 0 {(0,3)} diabet 1,2 {(1,4),(2,3),(2,7)} female 0 {(0,6)} hyperten 0,1 {(0,8),(1,6)} mother 2 {(2,1)} niddm 0 {(0,8)} no 1 {(1,3)} old 0 {(0,5)} patient 0,1,2 {(0,0),(1,1), (2,0),(2,4)} sister 2 {(2,5)} year 0 {(0,4)}
  • 19. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Text Search Tools Three Great Text Search Tools for Indexing Text and Providing Search Capability • Tool #1 – Lucene • Tool #2 – Solr • Tool #3 – Elasticsearch
  • 20. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Text Search Tools Written in 1999, Lucene is a Java- based API that provides the foundation for more advanced search engine capabilities. Lucene creates and maintains the search index and does hit ranking and result sorting. Most users access Lucene through one of two technologies, Solr and Elastsicsearch.
  • 21. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Text Search Tools Solr is an open source, Java-based enterprise search tool which pioneered advances on top of Lucene starting in 2004. Health systems can use any programming language to develop with Solr, which makes it easy to create an inverted index, simple web page, and search interface like Google. Many leading technology organizations use Solr, such as Apple and NASA.
  • 22. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Text Search Tools Elasticsearch is a newer open source enterprise search tool, written in 2010. Elasticsearch, like Solr, uses Lucene as a foundation. Elasticsearch has built its own set of APIs and functionality over the inverted index. Netflix and Facebook use Elasticsearch.
  • 23. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context As demonstrated on the next slide, the search for diabetes surfaced two indexed documents. Using word stemming, the search found both diabetes and diabetic. But there are two problems with this search: it missed the mention of NIDDM (a synonym) and neither result is relevant to a medical cohort query for diabetics.
  • 24. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context diabetes Go Results: 2 records, 0.0 ms Document 2: Patient’s mother is diabetic. Patient’s sister is diabetic. Document 1: The patient has no diabetes or hypertension. Found both diabetes and diabetic (word stemming) Missed mention of NIDDM (synonyms) Neither result is relevant to a medical cohort query for diabetics (context) Although the simple, familiar interface and fast results (generated by the inverted index) worked well, three text search must-haves for medical searches—display, medical terminologies, and context—are missing.
  • 25. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context Must-Have #1: Optimize Results Display for Use Cases We must create a solution that matches the needs of those using the data. For example, users of data may not have permission to view PHI. If we aggregate the results of the text analysis, then we provide users with access to the key results they need without compromising privacy.
  • 26. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context Must-Have #2: Expand Search with Medical Terminologies If you recall, the diabetes search didn’t include NIDDM, a synonym of diabetes. Health systems will produce more sophisticated analyses by leveraging medical terminologies to automatically expand synonym lists. Providing a quick and easy way for users to identify synonyms valuable to their search will increase the quantity and accuracy of the results.
  • 27. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context Must-Have #3: Refine Results with Context Context matters. To be useful for clinical applications (e.g., searching for genotype/phenotype correlations), retrieving patients eligible for a clinical trial, or identifying disease outbreaks, simply identifying clinical conditions in the text is insufficient. Information described in the context of the clinical condition is critical for understanding the patient’s state.
  • 28. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context Must-Have #3: Refine Results with Context If we are searching for patients with pneumonia, then searching for “pneumonia” without considering the context would result in identifying the following types of phrases: • “ruled out pneumonia” • “history of pneumonia” • “family history of pneumonia” None of these indicate the patient currently has pneumonia.
  • 29. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context Must-Have #3: Refine Results with Context ConText, an NLP pattern matching algorithm published in 2009, solves for these scenarios through enhancing text search and refining results by detecting conditions and determining if they are negated, historical, or experienced by someone else.
  • 30. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context Must-Have #3: Refine Results with Context The following slide shows how the ConText algorithm analyzes the sentence: “No history of chest tightness but family history of CHF” It looks for several things, including: • Conditions (e.g., chest tightness) • Negation triggers (e.g., the word “no”) • Historical triggers (e.g., the word “history”) • Termination (e.g., the word “but”) • Experiencer triggers (e.g., the word “family”)
  • 31. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context Wendy W. Chapman, David Chu, John N. Dowling J Biomed Inform. 2009 Oct; 42(5): 839–851. Chest tightness Negation: affirmed Experiencer: patient Temporality: recent ”No history of chest tightness but family history of CHF.” ConText CHF Negation: affirmed Experiencer: patient Temporality: recent Chest tightness Negation: negated Experiencer: patient Temporality: historical CHF Negation: affirmed Experiencer: other Temporality: historical Negation trigger Historical trigger Termination Termination Historical triggerCondition Condition Other experiencer trigger
  • 32. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context Applying Context and Extracting Values with an NLP Pipeline NLP pipelines can be run using one of several Java-based, open source tools, such as Apache Unstructured Information Management Architecture (UIMA) and General Architecture for Text Engineering (GATE). After starting a query with a clinical search and expanding that search to include meaningful terms, the next step is passing the search onto an NLP pipeline.
  • 33. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context Applying Context and Extracting Values with an NLP Pipeline In this example, a series of three analyses performed on the text that relate to and build on each other: 1. Sentence detection. 2. Entity recognition (e.g., diabetes). 3. Context algorithm.
  • 34. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context Applying Context and Extracting Values with an NLP Pipeline • Analysis of context uses a sentence as an operand • Identifying sentences in clinical text is not straightforward ̶ Have you ever seen punctuation in a clinical note? • An NLP analysis pipeline ties it all together Search results Sentence detection Entity recognition (i.e. diabetes) Context Algorithm Present user with additional filters NLP Pipeline Frameworks • Apache Unstructured Information Management Architecture (UIMA) • General Architecture for Text Engineering (GATE) • Natural Language Toolkit (NLTK)
  • 35. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context After the third and final step in this NLP pipeline (context algorithm), users are presented with additional filters in which they can choose their desired context (next slide). When it comes to text analytics frameworks, flexibility to run algorithms based on a user’s specific question is key. The NLP pipeline should create this flexibility by expanding search capacity to refine results. Applying Context and Extracting Values with an NLP Pipeline
  • 36. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Filter Diabetes Results Context: Apply to Search Results
  • 37. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context The NLP pipeline should also extract values as demanded by each use case. Many health systems are burdened by regulatory reporting, especially when certain measures like ejection fraction are not stored as discrete values. In lieu of automated reporting, health systems are obligated to engage team members to be “chart abstractors” to manually open thousands of patient charts to find ejection fraction values found in clinical notes. Applying Context and Extracting Values with an NLP Pipeline
  • 38. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context An NLP pipeline should not only identify when ejection fraction is documented in a note, but also save each value in such a way that the organization’s analytics platform can utilize the discrete value in their automated reporting. Other common extraction projects include aortic root size, PHQ depression scores, and ankle brachial index.. Applying Context and Extracting Values with an NLP Pipeline
  • 39. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Three Text Search Must-Haves: Display, Medical Terminologies, and Context Other Pieces to the NLP Pipeline: Extract Values ef_phrase qualifiers ef_low ef_high ef_mid ef_word ejection fraction is at least 70-75 is at least 70 75 72.5 NULL ejection fraction of about 20 of about 20 20 20 NULL ejection fraction of 60 of 60 60 60 NULL ejection fraction of greater than 65 of greater than 65 65 65 NULL ejection fraction of 55 of 55 55 55 NULL ejection fraction by visual inspection is 65 by visual inspection is 65 65 65 NULL LVEF is normal is NULL NULL NULL normal b(((LV)?EF)|(Ejections+Fraction))s+(?<qualifiers>([^sd]+s+){0,5})(?(((?<ef_low>d+)- (?<ef_high>d+))|(?<ef_mid_txt>d+)|(?<ef_word>([^s]*?normal)|(moderate)|(severe)))
  • 40. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Always Validate the Algorithm Quantifying algorithm accuracy is critical. Every algorithm needs a validation workflow, which typically includes four steps: 1. Build studies to review query results. 2. Assign team members to review results. 3. Randomly select records to represent study. 4. Highlight key words for easy chart review. When evaluating NLP tools or devising an NLP strategy, health systems should ensure validation is either built into the tool or very easy to do.
  • 41. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Focus on Interoperability and Pragmatic Integration Health systems need to combine text analytics with discrete data, and an enterprise data warehouse (EDW) is a great place to do that. A siloed approach to text analytics won’t work; make sure to integrate validated text analytics with all other analytics within the organization. For example, a health system would put its diabetes and PAD cohorts into the EDW and make that data available system-wide. Health systems need a platform that can pull in data on a regular, timely basis to ensure it’s current.
  • 42. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Focus on Interoperability and Pragmatic Integration Text Analytics Must Be Interoperable! Diabetes Cohort PAD Cohort Ejection Fraction Tumor Sizes Validated Text Analytics … Data Warehouse • Population Analytics • Care Improvement • Operational Improvement • Financial Improvement • Research
  • 43. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Focus on Interoperability and Pragmatic Integration Text analytics will open a new world of data analysis to health systems. It’s unlikely that anyone can identify the exact data uses and needs for text data in the next two, three, or five years. Instead of trying to make lasting decisions on a data model up front, we recommend focusing on what’s needed to perform timely, relevant analytics; healthcare analytics that quickly adapt to new questions and use cases.
  • 44. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Focus on Interoperability and Pragmatic Integration This approach follows the Health Catalyst® Late-Binding™ architecture that allows for the flexibility that’s so critical for the constantly evolving world of text analytics. Health Catalyst Data Warehouse eBook. FREE DOWNLOAD
  • 45. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. How to Use Text Analytics to Improve Outcomes The power of text analytics in healthcare to create precise patient registries is undeniable. Rather than manually curating patient registries and relying solely on coded data, health systems should leverage data from coded data sets, NLP, and extraction projects to create precise patient registries. Rather than limiting the use of text analytics to data scientists, architects, and analysts, health systems should make these tools widely accessible and empower the entire organization to leverage clinical text analytics.
  • 46. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. How to Use Text Analytics to Improve Outcomes Health systems can start using text analytics to improve outcomes today by focusing on four key components: #1: Optimize Text Search (Display, Medical Terminologies, and Context) #2: Enhance Context and Extract Values with an NLP Pipeline #3: Always Validate the Algorithm #4: Focus on Interoperability and Integration Using a Late-Binding Approach
  • 47. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. How to Use Text Analytics to Improve Outcomes This broad, all-encompassing approach to text analytics in healthcare will position health systems for clinical and financial success as they strive to create precise patient registries, enhance their understanding of high-risk patient populations, and improve outcomes. Health systems should invest in a solution that not only solves today’s problems, but can also be expanded to solve future use cases.
  • 48. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. For more information: “This book is a fantastic piece of work” – Robert Lindeman MD, FAAP, Chief Physician Quality Officer
  • 49. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. More about this topic Link to original article for a more in-depth discussion. How to Use Text Analytics in Healthcare to Improve Outcomes—Why You Need More than NLP Regenstrief Institute and Health Catalyst Team to Reveal Hidden Meaning in Clinical Data for Better Patient Care – Health Catalyst News Healthcare Analytics Adoption Model: A Framework and Roadmap (white paper) Dale Sanders, Executive VP of Software Healthcare Analytics: Realizing the Value of Health IT Brian Ahier – Guest Blogger 3 Reasons Why Comparative Analytics, Predictive Analytics, and NLP Won’t Solve Healthcare’s Problems ̶ Dale Sanders, Executive VP of Software Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going Doug Adamson, Chief Technology Officer, VP
  • 50. © 2016 Health Catalyst Proprietary. Feel free to share but we would appreciate a Health Catalyst citation. Eric Just joined the Health Catalyst family in August of 2011 as Vice President of Technology, bringing over 10 years of biomedical informatics experience. Prior to Health Catalyst, he managed the research arm of the Northwestern Medical Data Warehouse at Northwestern University's Feinberg School of Medicine. In this role, he led the development of technology, processes, and teams to leverage the clinical data warehouse. Previously, as a senior data architect, he helped create the data warehouse technical foundation and innovated new ways to extract and load medical data. In addition, he led the development effort for a genome database. Eric holds a Master of Science in Chemistry from Northwestern University and a Bachelors of Science in Chemistry from the College of William and Mary. Other Clinical Quality Improvement Resources Click to read additional information at www.healthcatalyst.com