SlideShare a Scribd company logo
1 of 33
Download to read offline
Building a search engine to find
environmental and phenotypic factors
associated with disease and health
Chirag J Patel

Northeast Big Data Innovation Hub Workshop

02/24/17
chirag@hms.harvard.edu
@chiragjp
www.chiragjpgroup.org
P = G + EType 2 Diabetes

Cancer

Alzheimer’s

Gene expression
Phenotype Genome
Variants
Environment
Infectious agents

Nutrients

Pollutants

Drugs
We are great at G investigation!
over 2400 

Genome-wide Association Studies (GWAS)

https://www.ebi.ac.uk/gwas/
G
Nothing comparable to elucidate E influence!
E: ???
We lack high-throughput methods
and data to discover new E in P…
until now!
σ2
G
σ2P
H2 =
Heritability (H2) is the range of phenotypic
variability attributed to genetic variability in a
population
Indicator of the proportion of phenotypic
differences attributed to G.
Eye color
Hair curliness
Type-1 diabetes
Height
Schizophrenia
Epilepsy
Graves' disease
Celiac disease
Polycystic ovary syndrome
Attention deficit hyperactivity disorder
Bipolar disorder
Obesity
Alzheimer's disease
Anorexia nervosa
Psoriasis
Bone mineral density
Menarche, age at
Nicotine dependence
Sexual orientation
Alcoholism
Lupus
Rheumatoid arthritis
Crohn's disease
Migraine
Thyroid cancer
Autism
Blood pressure, diastolic
Body mass index
Depression
Coronary artery disease
Insomnia
Menopause, age at
Heart disease
Prostate cancer
QT interval
Breast cancer
Ovarian cancer
Hangover
Stroke
Asthma
Blood pressure, systolic
Hypertension
Osteoarthritis
Parkinson's disease
Longevity
Type-2 diabetes
Gallstone disease
Testicular cancer
Cervical cancer
Sciatica
Bladder cancer
Colon cancer
Lung cancer
Leukemia
Stomach cancer
0 25 50 75 100
Heritability: Var(G)/Var(Phenotype) Source: SNPedia.com
G estimates for burdensome diseases are low and variable:
massive opportunity for E discovery
Type 2 Diabetes
Heart Disease
Asthma
Eye color
Hair curliness
Type-1 diabetes
Height
Schizophrenia
Epilepsy
Graves' disease
Celiac disease
Polycystic ovary syndrome
Attention deficit hyperactivity disorder
Bipolar disorder
Obesity
Alzheimer's disease
Anorexia nervosa
Psoriasis
Bone mineral density
Menarche, age at
Nicotine dependence
Sexual orientation
Alcoholism
Lupus
Rheumatoid arthritis
Crohn's disease
Migraine
Thyroid cancer
Autism
Blood pressure, diastolic
Body mass index
Depression
Coronary artery disease
Insomnia
Menopause, age at
Heart disease
Prostate cancer
QT interval
Breast cancer
Ovarian cancer
Hangover
Stroke
Asthma
Blood pressure, systolic
Hypertension
Osteoarthritis
Parkinson's disease
Longevity
Type-2 diabetes
Gallstone disease
Testicular cancer
Cervical cancer
Sciatica
Bladder cancer
Colon cancer
Lung cancer
Leukemia
Stomach cancer
0 25 50 75 100
Heritability: Var(G)/Var(Phenotype) Source: SNPedia.com
G estimates for complex traits are low and variable:
massive opportunity for high-throughput E discovery
σ2
E : Exposome!
Enhance accessibility of clinical and environmental data,
and analytic artificial intelligence tools!
How can we drive discovery of 

environmental factors (E) in disease phenotypes (P)?
Enhance accessibility of large open data and tools to 

drive discovery of 

environmental factors (E) in disease phenotypes (P)
Greg Cooper, MD, PhD
Pittsburgh
Vasant Honavar, PhD
Penn State
Noémie Elhadad, PhD
Columbia
Chirag Patel, PhD
Harvard
Where do we get disease (P) data?
wearable.com
Where do we get disease P data?

Health record data from your doctor!
• Longitudinal data on millions of patients
• diagnoses, prescriptions, lab reports, notes
• Sitting there in institutional IT infrastructure
• OHDSI provides a unified model to access data across
institutions, enhancing the scientific process!
Noémie Elhadad, PhD
Columbia
Capitalize on digitalized health record data 

(from around the world)!
High-powered dataset(s) for discovery.
Where do we get environmental (E) data?
Examples of sources of disparate external exposome datasets
available in the Exposome Data Warehouse
• Geological
• NASA - Cloud and Atmosphere Profiles

• NOAA Climate Data

• Pollution
• EPA Air Quality Surveillance Data Mart, or AirData,

• Socio-Economic
• US Census American Community Survey (ACS)

• Epidemiological
• CDC Wonder, USDA Food Atlas
A key challenge: 

mashing up Exposome Data Warehouse with patient
data from OHDSI
f(location, time)
PM2.5
income
Pollen count
EPA AirData
American Community
Survey
NOAA Climate
home zipcode
encounter time
0.50
gini index
0.20
0.1
hh income
(K)
100
50
3030
15
Wind
0
individualn
…
..
..
..
..
3/5/1998yes
10
E?age
no
95376
20
sex
M
Temp
individual2
PM2.5
individual1
75 55
02215
1/1/2016 23
Pb
individual3
21
70
F
M
zip
35
Time(E)
2312/11/2015
0
yes
50
302124
OHDSI EPA NOAA Census
millionsofpatients
Will it work? yep!
Does temperature (and weather) influence
asthma-related pediatric ER visits?
• Children <= 17 y/o with >=1
ICD9 code corresponding to 493.*

• N=56K, >84K ER visits
• Weather station data 

• (daily temperature, wind,
humidity)

• Case-crossover design (only
investigated cases)
Yeran Li, PhD (MS, HSPH)
Chirag Lakhani, PhD (HMS)
Yun Wang, PhD (Post-doc, HSPH)
Prevalence of asthma attack varies across the US
Temperature(F)
20
40
60
80
Lag
0
1
2
3
4
5
6
Relativerisk
0.95
1.00
1.05
20 40 60 80
0.91.11.3
Overall temperature effect
Mean temperature (F)
Relativerisk
●
●
●
●
● ● ●
0 1 2 3 4 5 6
0.901.001.10
Lag effect at T=60F
Lag(day)
RelativeRisk
20 40 60 80
0.901.001.10
Temperature effect at Lag=0
Mean temperature (F)
RelativeRisk
Does temperature influence asthma ER visits?: yes!
Relative risk of asthma attack by mean temperature
Rates of asthma attacks depend on season?: yes!
0.9
1.0
1.1
20 40 60 80
season Fall Spring Summer Winter
Temperature)effects)vary)at)different)regions)
(use)NOAA)defini9on,)ER)kids))
h@ps://www.ncdc.noaa.gov/monitoringEreferences/maps/usEclimateEregions.php)
south
northeast
southeast
central
southwest
wn_central
en_central
west
northwest
Region counts by NOAA
Numofobs
02000060000
51052
78126
57238
68719
15256
18050
44551
22923
13895
south northeast southeast central southwest wn_central en_central west northwest
0.5
1.0
1.5
2.0
20 40 60 80 20 40 60 80 20 40 60 80 20 40 60 80 20 40 60 80 20 40 60 80 20 40 60 80 20 40 60 80 20 40 60 80
Temperature
Risk
weather effect: different weather zones by NOAA
Rates of asthma attacks dependent on region?: yes!
Does temperature (and weather) influence asthma-
related ER visits in kids?: the tip of the iceberg!
Lag
0
1
2
20 40 60 80
0.91.11.3
Overall temperature effect
Mean temperature (F)
Relativerisk
● ●
5 6
0F
20 40 60 80
0.901.001.10
Temperature effect at Lag=0
Mean temperature (F)
RelativeRisk
What other scientific questions?
•what is influence of pollen?

•what is the influence of air pollution?

•what about adults?
Can we replicate the analysis?
•different populations

•using different data

•with different analysts
ExposomeDB
Environmental Protection Agency
AirData
US Census
Socioeconomic data
National Oceanic and Atmospheric Administration (NOAA)
Climate Data
Observational Health Data
Sciences and Informatics
(OHDSI)
Harvard Medical School Columbia University P&S
Geotemporal database Network of observational data
Causal Analytics
A2
A3
A1
AX Aim X
Analytics Workshop
Hands-on workshops
for leveraging work in
Aims 1-3
Training Resources
Jupyter notebooks
Student internships at Pitt,
Harvard, PSU, and
Columbia P&S
OHDSI Node
A4
University of Pittsburgh Penn State University
Integrating the ExposomeData Warehouse with OHDSI
and causal modeling tools to drive and demonstrate
discovery.
Many hypotheses are possible to address: useful for
public health & planning!
What is the effect of air pollution levels in disease?
Do adverse weather conditions influence hospital use?
What pharmaceutical drugs lead to adverse health
outcomes?
How does socioeconomic context influence hospital use,
disease rates, and recovery?
We will harness tools in machine learning
extract signal from noise!
Greg Cooper, MD, PhD
Pittsburgh
Vasant Honavar, PhD
Penn State
SES
PM2.5
asthma
oF
bayesian networks
socio-economic
case-crossover
weather
air pollution
Sci Rep 2016
http://www.ccd.pitt.edu/tools/
Integrating the ExposomeDB with OHDSI and analytics
to drive and demonstrate discovery by the community!
(trainees especially welcome)
• 2-day hands-on workshop in New York or Boston

• remote “exchange” internship program and 2-week
immersion

• dissemination of electronic training resources
Many hypotheses are possible to address: useful for can
we build a machine learning predictor to estimate E?
Eye color
Hair curliness
Type-1 diabetes
Height
Schizophrenia
Epilepsy
Graves' disease
Celiac disease
Polycystic ovary syndrome
Attention deficit hyperactivity disorder
Bipolar disorder
Obesity
Alzheimer's disease
Anorexia nervosa
Psoriasis
Bone mineral density
Menarche, age at
Nicotine dependence
Sexual orientation
Alcoholism
Lupus
Rheumatoid arthritis
Crohn's disease
Migraine
Thyroid cancer
Autism
Blood pressure, diastolic
Body mass index
Depression
Coronary artery disease
Insomnia
Menopause, age at
Heart disease
Prostate cancer
QT interval
Breast cancer
Ovarian cancer
Hangover
Stroke
Asthma
Blood pressure, systolic
Hypertension
Osteoarthritis
Parkinson's disease
Longevity
Type-2 diabetes
Gallstone disease
Testicular cancer
Cervical cancer
Sciatica
Bladder cancer
Colon cancer
Lung cancer
Leukemia
Stomach cancer
0 25 50 75 100
Heritability: Var(G)/Var(Phenotype) Source: SNPedia.com
σ2
E : Exposome!
Harvard DBMI
Isaac Kohane

Susanne Churchill

Nathan Palmer

Sunny Alvear

Michal Preminger
Chirag J Patel

chirag@hms.harvard.edu

@chiragjp

www.chiragjpgroup.org
Thanks
RagGroup
Chirag Lakhani
Yeran Li
Shreyas Bhave

Rolando Acosta
Noémie Elhadad (Columbia)
Vasant Honavar (PSU)
Greg Cooper (Pitt)
George Hripcsak (Columbia)
René Baston
Katie Naum
Kathleen McKeown
NIH Common Fund

Big Data to Knowledge

More Related Content

What's hot

Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Chirag Patel
 
Repurposing large datasets for exposomic discovery in disease
Repurposing large datasets for exposomic discovery in diseaseRepurposing large datasets for exposomic discovery in disease
Repurposing large datasets for exposomic discovery in diseaseChirag Patel
 
AACR 041616 digital exposomes
AACR 041616 digital exposomesAACR 041616 digital exposomes
AACR 041616 digital exposomesChirag Patel
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Chirag Patel
 
Correlation globes of the exposome 2016
Correlation globes of the exposome 2016Correlation globes of the exposome 2016
Correlation globes of the exposome 2016Chirag Patel
 
Data analytics to support exposome research course slides
Data analytics to support exposome research course slidesData analytics to support exposome research course slides
Data analytics to support exposome research course slidesChirag Patel
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataChirag Patel
 
Japanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven EJapanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven EChirag Patel
 
Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Chirag Patel
 
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...ExternalEvents
 
Day2 145pm Crawford
Day2 145pm CrawfordDay2 145pm Crawford
Day2 145pm CrawfordSean Paul
 
Frequency and Risk-Factors Analysis of Escherichia coli O157:H7 in Bali-Cattle
Frequency and Risk-Factors Analysis of Escherichia coli O157:H7 in Bali-CattleFrequency and Risk-Factors Analysis of Escherichia coli O157:H7 in Bali-Cattle
Frequency and Risk-Factors Analysis of Escherichia coli O157:H7 in Bali-CattleUniversitasGadjahMada
 
Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...
Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...
Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...John Blue
 
Placental gene expression mediates the interaction between obstetrical histor...
Placental gene expression mediates the interaction between obstetrical histor...Placental gene expression mediates the interaction between obstetrical histor...
Placental gene expression mediates the interaction between obstetrical histor...BARRY STANLEY 2 fasd
 
Manure Irrigation: Airborne Pathogen Transport and Assessment of Technology U...
Manure Irrigation: Airborne Pathogen Transport and Assessment of Technology U...Manure Irrigation: Airborne Pathogen Transport and Assessment of Technology U...
Manure Irrigation: Airborne Pathogen Transport and Assessment of Technology U...LPE Learning Center
 
Wildlife-livestock-human interface: recognising drivers of disease
Wildlife-livestock-human interface: recognising drivers of diseaseWildlife-livestock-human interface: recognising drivers of disease
Wildlife-livestock-human interface: recognising drivers of diseaseILRI
 
Pioneer dehradun-english-edition-2021-06-05
Pioneer dehradun-english-edition-2021-06-05Pioneer dehradun-english-edition-2021-06-05
Pioneer dehradun-english-edition-2021-06-05DunEditorial
 

What's hot (20)

Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701
 
Repurposing large datasets for exposomic discovery in disease
Repurposing large datasets for exposomic discovery in diseaseRepurposing large datasets for exposomic discovery in disease
Repurposing large datasets for exposomic discovery in disease
 
AACR 041616 digital exposomes
AACR 041616 digital exposomesAACR 041616 digital exposomes
AACR 041616 digital exposomes
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
 
Correlation globes of the exposome 2016
Correlation globes of the exposome 2016Correlation globes of the exposome 2016
Correlation globes of the exposome 2016
 
Data analytics to support exposome research course slides
Data analytics to support exposome research course slidesData analytics to support exposome research course slides
Data analytics to support exposome research course slides
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
 
Japanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven EJapanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven E
 
Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Search engine for E NEU network science 080817
Search engine for E NEU network science 080817
 
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
 
Day2 145pm Crawford
Day2 145pm CrawfordDay2 145pm Crawford
Day2 145pm Crawford
 
Frequency and Risk-Factors Analysis of Escherichia coli O157:H7 in Bali-Cattle
Frequency and Risk-Factors Analysis of Escherichia coli O157:H7 in Bali-CattleFrequency and Risk-Factors Analysis of Escherichia coli O157:H7 in Bali-Cattle
Frequency and Risk-Factors Analysis of Escherichia coli O157:H7 in Bali-Cattle
 
Parent of origin effect
Parent of origin effectParent of origin effect
Parent of origin effect
 
Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...
Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...
Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...
 
Placental gene expression mediates the interaction between obstetrical histor...
Placental gene expression mediates the interaction between obstetrical histor...Placental gene expression mediates the interaction between obstetrical histor...
Placental gene expression mediates the interaction between obstetrical histor...
 
Manure Irrigation: Airborne Pathogen Transport and Assessment of Technology U...
Manure Irrigation: Airborne Pathogen Transport and Assessment of Technology U...Manure Irrigation: Airborne Pathogen Transport and Assessment of Technology U...
Manure Irrigation: Airborne Pathogen Transport and Assessment of Technology U...
 
SEMINAR SLIDE
SEMINAR SLIDESEMINAR SLIDE
SEMINAR SLIDE
 
BUi_Final Presentation_SDZSP
BUi_Final Presentation_SDZSPBUi_Final Presentation_SDZSP
BUi_Final Presentation_SDZSP
 
Wildlife-livestock-human interface: recognising drivers of disease
Wildlife-livestock-human interface: recognising drivers of diseaseWildlife-livestock-human interface: recognising drivers of disease
Wildlife-livestock-human interface: recognising drivers of disease
 
Pioneer dehradun-english-edition-2021-06-05
Pioneer dehradun-english-edition-2021-06-05Pioneer dehradun-english-edition-2021-06-05
Pioneer dehradun-english-edition-2021-06-05
 

Similar to NSF Northeast Hub Big Data Workshop

Eysenbach: Infodemiology and Infoveillance
Eysenbach: Infodemiology and InfoveillanceEysenbach: Infodemiology and Infoveillance
Eysenbach: Infodemiology and InfoveillanceGunther Eysenbach
 
pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...
pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...
pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...butest
 
Leverage machine learning and new technologies to enhance rwe generation and ...
Leverage machine learning and new technologies to enhance rwe generation and ...Leverage machine learning and new technologies to enhance rwe generation and ...
Leverage machine learning and new technologies to enhance rwe generation and ...Athula Herath
 
Meg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery OrganizationMeg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery OrganizationTHL
 
Introduction to Epidemiology and Surveillance
Introduction to Epidemiology and SurveillanceIntroduction to Epidemiology and Surveillance
Introduction to Epidemiology and SurveillanceGeorge Moulton
 
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...ExternalEvents
 
Oral presentation Slides
Oral presentation SlidesOral presentation Slides
Oral presentation SlidesJessTuck
 
Khoury ashg2014
Khoury ashg2014Khoury ashg2014
Khoury ashg2014muink
 
Role of standards, consumer education and media literacy
Role of standards, consumer education and media literacyRole of standards, consumer education and media literacy
Role of standards, consumer education and media literacyFrancisco J Grajales III
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Francisco J Grajales III
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Francisco J Grajales III
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Francisco J Grajales III
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Francisco J Grajales III
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Francisco J Grajales III
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Francisco J Grajales III
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Francisco J Grajales III
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Francisco J Grajales III
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Francisco J Grajales III
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Francisco J Grajales III
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Francisco J Grajales III
 

Similar to NSF Northeast Hub Big Data Workshop (20)

Eysenbach: Infodemiology and Infoveillance
Eysenbach: Infodemiology and InfoveillanceEysenbach: Infodemiology and Infoveillance
Eysenbach: Infodemiology and Infoveillance
 
pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...
pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...
pptx - Preventing Sepsis: Artificial Intelligence, Knowledge ...
 
Leverage machine learning and new technologies to enhance rwe generation and ...
Leverage machine learning and new technologies to enhance rwe generation and ...Leverage machine learning and new technologies to enhance rwe generation and ...
Leverage machine learning and new technologies to enhance rwe generation and ...
 
Meg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery OrganizationMeg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
 
Introduction to Epidemiology and Surveillance
Introduction to Epidemiology and SurveillanceIntroduction to Epidemiology and Surveillance
Introduction to Epidemiology and Surveillance
 
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
 
Oral presentation Slides
Oral presentation SlidesOral presentation Slides
Oral presentation Slides
 
Khoury ashg2014
Khoury ashg2014Khoury ashg2014
Khoury ashg2014
 
Role of standards, consumer education and media literacy
Role of standards, consumer education and media literacyRole of standards, consumer education and media literacy
Role of standards, consumer education and media literacy
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...
 

Recently uploaded

Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...narwatsonia7
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiAlinaDevecerski
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...Taniya Sharma
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...Garima Khatri
 
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service KochiLow Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service KochiSuhani Kapoor
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escortsaditipandeya
 
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls JaipurCall Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipurparulsinha
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...astropune
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...Neha Kaur
 
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night EnjoyCall Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoybabeytanya
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...aartirawatdelhi
 
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...narwatsonia7
 
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service CoimbatoreCall Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatorenarwatsonia7
 
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night EnjoyCall Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoybabeytanya
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Servicevidya singh
 
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Chandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD availableChandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD availableDipal Arora
 

Recently uploaded (20)

Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
Top Rated Bangalore Call Girls Mg Road ⟟ 8250192130 ⟟ Call Me For Genuine Sex...
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
 
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service KochiLow Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
 
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls JaipurCall Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
 
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
 
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night EnjoyCall Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
 
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service CoimbatoreCall Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
 
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night EnjoyCall Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Panvel Mumbai📲 9833363713 💞 Full Night Enjoy
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
 
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
 
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
 
Chandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD availableChandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD available
 

NSF Northeast Hub Big Data Workshop

  • 1. Building a search engine to find environmental and phenotypic factors associated with disease and health Chirag J Patel Northeast Big Data Innovation Hub Workshop 02/24/17 chirag@hms.harvard.edu @chiragjp www.chiragjpgroup.org
  • 2. P = G + EType 2 Diabetes Cancer Alzheimer’s Gene expression Phenotype Genome Variants Environment Infectious agents Nutrients Pollutants Drugs
  • 3. We are great at G investigation! over 2400 Genome-wide Association Studies (GWAS) https://www.ebi.ac.uk/gwas/ G
  • 4. Nothing comparable to elucidate E influence! E: ??? We lack high-throughput methods and data to discover new E in P… until now!
  • 5. σ2 G σ2P H2 = Heritability (H2) is the range of phenotypic variability attributed to genetic variability in a population Indicator of the proportion of phenotypic differences attributed to G.
  • 6. Eye color Hair curliness Type-1 diabetes Height Schizophrenia Epilepsy Graves' disease Celiac disease Polycystic ovary syndrome Attention deficit hyperactivity disorder Bipolar disorder Obesity Alzheimer's disease Anorexia nervosa Psoriasis Bone mineral density Menarche, age at Nicotine dependence Sexual orientation Alcoholism Lupus Rheumatoid arthritis Crohn's disease Migraine Thyroid cancer Autism Blood pressure, diastolic Body mass index Depression Coronary artery disease Insomnia Menopause, age at Heart disease Prostate cancer QT interval Breast cancer Ovarian cancer Hangover Stroke Asthma Blood pressure, systolic Hypertension Osteoarthritis Parkinson's disease Longevity Type-2 diabetes Gallstone disease Testicular cancer Cervical cancer Sciatica Bladder cancer Colon cancer Lung cancer Leukemia Stomach cancer 0 25 50 75 100 Heritability: Var(G)/Var(Phenotype) Source: SNPedia.com G estimates for burdensome diseases are low and variable: massive opportunity for E discovery Type 2 Diabetes Heart Disease Asthma
  • 7. Eye color Hair curliness Type-1 diabetes Height Schizophrenia Epilepsy Graves' disease Celiac disease Polycystic ovary syndrome Attention deficit hyperactivity disorder Bipolar disorder Obesity Alzheimer's disease Anorexia nervosa Psoriasis Bone mineral density Menarche, age at Nicotine dependence Sexual orientation Alcoholism Lupus Rheumatoid arthritis Crohn's disease Migraine Thyroid cancer Autism Blood pressure, diastolic Body mass index Depression Coronary artery disease Insomnia Menopause, age at Heart disease Prostate cancer QT interval Breast cancer Ovarian cancer Hangover Stroke Asthma Blood pressure, systolic Hypertension Osteoarthritis Parkinson's disease Longevity Type-2 diabetes Gallstone disease Testicular cancer Cervical cancer Sciatica Bladder cancer Colon cancer Lung cancer Leukemia Stomach cancer 0 25 50 75 100 Heritability: Var(G)/Var(Phenotype) Source: SNPedia.com G estimates for complex traits are low and variable: massive opportunity for high-throughput E discovery σ2 E : Exposome!
  • 8. Enhance accessibility of clinical and environmental data, and analytic artificial intelligence tools! How can we drive discovery of environmental factors (E) in disease phenotypes (P)?
  • 9. Enhance accessibility of large open data and tools to drive discovery of environmental factors (E) in disease phenotypes (P) Greg Cooper, MD, PhD Pittsburgh Vasant Honavar, PhD Penn State Noémie Elhadad, PhD Columbia Chirag Patel, PhD Harvard
  • 10. Where do we get disease (P) data?
  • 12. Where do we get disease P data? Health record data from your doctor! • Longitudinal data on millions of patients • diagnoses, prescriptions, lab reports, notes • Sitting there in institutional IT infrastructure • OHDSI provides a unified model to access data across institutions, enhancing the scientific process!
  • 14. Capitalize on digitalized health record data (from around the world)! High-powered dataset(s) for discovery.
  • 15. Where do we get environmental (E) data?
  • 16. Examples of sources of disparate external exposome datasets available in the Exposome Data Warehouse • Geological • NASA - Cloud and Atmosphere Profiles • NOAA Climate Data • Pollution • EPA Air Quality Surveillance Data Mart, or AirData, • Socio-Economic • US Census American Community Survey (ACS) • Epidemiological • CDC Wonder, USDA Food Atlas
  • 17. A key challenge: mashing up Exposome Data Warehouse with patient data from OHDSI f(location, time) PM2.5 income Pollen count EPA AirData American Community Survey NOAA Climate home zipcode encounter time
  • 18. 0.50 gini index 0.20 0.1 hh income (K) 100 50 3030 15 Wind 0 individualn … .. .. .. .. 3/5/1998yes 10 E?age no 95376 20 sex M Temp individual2 PM2.5 individual1 75 55 02215 1/1/2016 23 Pb individual3 21 70 F M zip 35 Time(E) 2312/11/2015 0 yes 50 302124 OHDSI EPA NOAA Census millionsofpatients
  • 20. Does temperature (and weather) influence asthma-related pediatric ER visits? • Children <= 17 y/o with >=1 ICD9 code corresponding to 493.* • N=56K, >84K ER visits • Weather station data • (daily temperature, wind, humidity) • Case-crossover design (only investigated cases) Yeran Li, PhD (MS, HSPH) Chirag Lakhani, PhD (HMS) Yun Wang, PhD (Post-doc, HSPH)
  • 21. Prevalence of asthma attack varies across the US
  • 22. Temperature(F) 20 40 60 80 Lag 0 1 2 3 4 5 6 Relativerisk 0.95 1.00 1.05 20 40 60 80 0.91.11.3 Overall temperature effect Mean temperature (F) Relativerisk ● ● ● ● ● ● ● 0 1 2 3 4 5 6 0.901.001.10 Lag effect at T=60F Lag(day) RelativeRisk 20 40 60 80 0.901.001.10 Temperature effect at Lag=0 Mean temperature (F) RelativeRisk Does temperature influence asthma ER visits?: yes! Relative risk of asthma attack by mean temperature
  • 23. Rates of asthma attacks depend on season?: yes! 0.9 1.0 1.1 20 40 60 80 season Fall Spring Summer Winter
  • 24. Temperature)effects)vary)at)different)regions) (use)NOAA)defini9on,)ER)kids)) h@ps://www.ncdc.noaa.gov/monitoringEreferences/maps/usEclimateEregions.php) south northeast southeast central southwest wn_central en_central west northwest Region counts by NOAA Numofobs 02000060000 51052 78126 57238 68719 15256 18050 44551 22923 13895 south northeast southeast central southwest wn_central en_central west northwest 0.5 1.0 1.5 2.0 20 40 60 80 20 40 60 80 20 40 60 80 20 40 60 80 20 40 60 80 20 40 60 80 20 40 60 80 20 40 60 80 20 40 60 80 Temperature Risk weather effect: different weather zones by NOAA Rates of asthma attacks dependent on region?: yes!
  • 25.
  • 26. Does temperature (and weather) influence asthma- related ER visits in kids?: the tip of the iceberg! Lag 0 1 2 20 40 60 80 0.91.11.3 Overall temperature effect Mean temperature (F) Relativerisk ● ● 5 6 0F 20 40 60 80 0.901.001.10 Temperature effect at Lag=0 Mean temperature (F) RelativeRisk What other scientific questions? •what is influence of pollen? •what is the influence of air pollution? •what about adults? Can we replicate the analysis? •different populations •using different data •with different analysts
  • 27. ExposomeDB Environmental Protection Agency AirData US Census Socioeconomic data National Oceanic and Atmospheric Administration (NOAA) Climate Data Observational Health Data Sciences and Informatics (OHDSI) Harvard Medical School Columbia University P&S Geotemporal database Network of observational data Causal Analytics A2 A3 A1 AX Aim X Analytics Workshop Hands-on workshops for leveraging work in Aims 1-3 Training Resources Jupyter notebooks Student internships at Pitt, Harvard, PSU, and Columbia P&S OHDSI Node A4 University of Pittsburgh Penn State University Integrating the ExposomeData Warehouse with OHDSI and causal modeling tools to drive and demonstrate discovery.
  • 28. Many hypotheses are possible to address: useful for public health & planning! What is the effect of air pollution levels in disease? Do adverse weather conditions influence hospital use? What pharmaceutical drugs lead to adverse health outcomes? How does socioeconomic context influence hospital use, disease rates, and recovery?
  • 29. We will harness tools in machine learning extract signal from noise! Greg Cooper, MD, PhD Pittsburgh Vasant Honavar, PhD Penn State SES PM2.5 asthma oF bayesian networks socio-economic case-crossover weather air pollution Sci Rep 2016
  • 31. Integrating the ExposomeDB with OHDSI and analytics to drive and demonstrate discovery by the community! (trainees especially welcome) • 2-day hands-on workshop in New York or Boston • remote “exchange” internship program and 2-week immersion • dissemination of electronic training resources
  • 32. Many hypotheses are possible to address: useful for can we build a machine learning predictor to estimate E? Eye color Hair curliness Type-1 diabetes Height Schizophrenia Epilepsy Graves' disease Celiac disease Polycystic ovary syndrome Attention deficit hyperactivity disorder Bipolar disorder Obesity Alzheimer's disease Anorexia nervosa Psoriasis Bone mineral density Menarche, age at Nicotine dependence Sexual orientation Alcoholism Lupus Rheumatoid arthritis Crohn's disease Migraine Thyroid cancer Autism Blood pressure, diastolic Body mass index Depression Coronary artery disease Insomnia Menopause, age at Heart disease Prostate cancer QT interval Breast cancer Ovarian cancer Hangover Stroke Asthma Blood pressure, systolic Hypertension Osteoarthritis Parkinson's disease Longevity Type-2 diabetes Gallstone disease Testicular cancer Cervical cancer Sciatica Bladder cancer Colon cancer Lung cancer Leukemia Stomach cancer 0 25 50 75 100 Heritability: Var(G)/Var(Phenotype) Source: SNPedia.com σ2 E : Exposome!
  • 33. Harvard DBMI Isaac Kohane Susanne Churchill Nathan Palmer Sunny Alvear Michal Preminger Chirag J Patel chirag@hms.harvard.edu @chiragjp www.chiragjpgroup.org Thanks RagGroup Chirag Lakhani Yeran Li Shreyas Bhave Rolando Acosta Noémie Elhadad (Columbia) Vasant Honavar (PSU) Greg Cooper (Pitt) George Hripcsak (Columbia) René Baston Katie Naum Kathleen McKeown NIH Common Fund Big Data to Knowledge