SlideShare a Scribd company logo
1 of 32
Translating a Trillion Points of Data into
Diagnostics, Therapies and New Insights
in Health and Disease
atul.butte@ucsf.edu
@atulbutte
Atul Butte, MD, PhD
Director, Institute for Computational
Health Sciences
University of California, San Francisco
Conflicts of Interest
• Scientific founder and
advisory board membership
– Genstruct
– NuMedii
– Personalis
– Carmenta
• Honoraria for talks
– Lilly
– Pfizer
– Siemens
– Bristol Myers Squibb
– AstraZeneca
– Roche
– Genentech
– Warburg Pincus
• Past or present consultancy
– Lilly
– Johnson and Johnson
– Roche
– NuMedii
– Genstruct
– Tercica
– Ecoeos
– Helix
– Ansh Labs
– Prevendia
– Samsung
– Assay Depot
– Regeneron
– Verinata
– Pathway Diagnostics
– Geisinger Health
– Covance
– Wilson Sonsini Goodrich & Rosati
– Orrick
– 10X Genomics
– Medgenics
– GNS Healthcare
– Gerson Lehman Group
– Coatue Management
• Corporate Relationships
– Northrop Grumman
– Aptalis
– Allergan
– Astellas
– Thomson Reuters
– Intel
– SAP
– SV Angel
– Progenity
– Illumina
• Speakers’ bureau
– None
• Companies started by students
– Carmenta
– Serendipity
– Stimulomics
– NunaHealth
– Praedicat
– MyTime
– Flipora
– Tumbl.in
@affymetrix
bit.ly/genedata
The Cancer Genome Atlas
• 14 thousand cases
• 39 types of cancers
• 13 types of data: molecular, clinical, sequencing
227 million substances x
1.3 million assays
More than a billion measurements
within a grid of 300 trillion cells
71 million meet Lipinski 5
1.2 million active substances
http://www.nap.edu/catalog.php?record_id=13284
Marina Sirota
Preeclampsia: large cause of maternal and fetal death
• Incidence
• 5-8% of all pregnancies in the U.S. and worldwide
• 4.1 million births in the U.S. in 2009
• Up to 300K cases of preeclampsia annually in the U.S.
• Mortality
• Responsible for 18% of all maternal deaths in the U.S.
• Maternal death in 56 out of every 100,000 live births in US
• Neonatal death in 71 out of every 100,000 live births in US
• Cost
• $20 billion in direct costs in the U.S. annually
• Average hospital stay of 3.5 days
Linda Liu
Bruce Ling
Matt Cooper
New blood markers for preeclampsia
Linda Liu
Bruce Ling
Matt Cooper
@MarchofDimes
bit.ly/preeclamp
Need a
diagnostic for
preeclampsia
Public big data
available
March of Dimes
Center for
Prematurity
Research
Data analyzed,
diagnostic
designed
SPARK grant
($50k)
Life Science
Angels, other
seed investors
($2 million)
@CarmentaBio
progenity.com
bit.ly/carm_prog
@MatthewHerper
bit.ly/newdrug1
Cancer Discovery 2013, 3:1.
Psychiatric Drug Imipramine Shows Significant Activity
Against Small Cell Lung Cancer
Vehicle control Imipramine
p53/Rb/p130
triple knockout
model of SCLC
Mice dosed after
tumor formation
Joel Dudley
Nadine Jahchan
Julien Sage
Alejandro Sweet-Cordero
Joel Neal
@NuMedii
Bin Chen
Wei Wei
Li Ma
Bin Yang
Mei-Sze Chua
Samuel So
Gastroenterology, 2017
Need more drugs
for more diseases
Public big data
available
NIH funding
Data analyzed,
method designed
Company launched,
ARRA, StartX,
Stanford license,
first deal
Claremont Creek,
Lightspeed ($3.5
million)
@NuMedii
The next big open data: clinical trials
Download 100+ studies today
Drug repositioning, new patient subsets,
digital comparative effectiveness, more!
immport.org
Sanchita Bhattacharya
Elizabeth Thomson
22
Clinical Data Warehouse
A Big UC Healthcare Data Analytics Platform
Combining healthcare data from across the
six University of California medical schools and systems
The next big data: clinical data
ML lessons I’ve learned over 20 years
• Get the question right; solve the problems that health care professionals need solved
– Solve the problems that health care professionals need solved: Don't just guess
• And verify good questions and good unmet needs with more than one doc
– Build a great diagnostic vs. understanding the biology
• Perfectly lassoed variables may miss the big picture biology
– Biologists and medical professionals really love explanations over black boxes
• Watch out for input limiting models
– Patients might not type in the right codes for their symptoms
– They barely enter their own race/ethnicity
– And docs?
• Learn what IRB, HIPAA, BAA are. Learn what ICD-10 and CPT codes are. CLIA and CAP.
– And learn patience.
– Not all of us are cloud allergic.
• Not everything needs deep learning
• Having all data on everyone is super rare: genomics, images, and longitudinal EHR data?
• Health care inefficiency is not about friction
• Data integration and harmonization can happen if there is a business reason for it
• Platforms and their companies are seemingly commoditized
– Come to us with more medical knowledge and background. Convince us you care about this vertical.
– Show us that we are going to learn more from you, than we are going to have to teach you.
Open challenges
• Can’t teach a computer with half the game. Need the start and finish of
medical “stories”
– In medical care, need primary and tertiary care in the same database
– Compound data might be available, but trials data is not
– Means data integration and harmonization is a rate limiter
• Need the right diversity in data
– Otherwise might be extrapolating beyond what was learned
– Need enough data, big amounts of data, true-positive cases
• Need methods to handle complicated multi-modal data
• What does validation mean?
– Drug discovery: pre-clinical success? Or Phase 2 success?
– Clinical accuracy? Or clinical utility?
• Career fear uncertainty and doubt (FUD) in some circles
– Tech recruitment
– Too many startups?
UC Clinical Data Warehouse Team
Executive Team
• Atul Butte
• Joe Bengfort
• Michael Pfeffer
• Tom Andriola
• Chris Longhurst
Steering Committee
• Irfan Chaudhry
• Mohammed Mahbouba
• Lisa Dahm
• David Dobbs
• Kent Andersen
• Ralph James
• Jennifer Holland
• Eugene Lee
ETL Team
• Albert Dugan
• Tony Choe
• Michael Sweeney
• Timothy Satterwhite
• Ayan Patel
• Niranjan Wagle
• Ralph James
• Joseph Dalton
Data Harmonization
• Dana Ludwig
• Daniella Meeker
Data Quality
• Momeena Ali
• Jodie Nygaard
Epic
• Kevin Ames
• Ben Jenkins
• Steve Gesualdo
Business Analyst
• Ankeeta Shukla
Hardware
• Sandeep Chandra
• Jeff Love
• Scott Bailey
• Kwong Law
• Pallav Saxena
Support
• Jack Stobo
• Michael Blum
• Sam Hawgood
Support
Admin and Tech Staff
• Mary Lyall
• Mounira Kenaani
• Kevin Kaier
• Boris Oskotsky
• Mae Moredo
• Ada Chen
• University of California, San Francisco
• Pricilla Chan and Mark Zuckerberg
• NIH: NIAID, NLM, NIGMS, NCI, NHLBI, OD; NIDDK, NHGRI, NIA, NCATS
• March of Dimes
• Juvenile Diabetes Research Foundation
• Hewlett Packard
• Howard Hughes Medical Institute
• California Institute for Regenerative Medicine
• Luke Evnin and Deann Wright (Scleroderma Research Foundation)
• Clayville Research Fund
• PhRMA Foundation
• Stanford Cancer Center, Bio-X, SPARK
• Tarangini Deshpande
• Kimayani Butte
• Sam Hawgood and Keith Yamamoto
• Isaac Kohane

More Related Content

What's hot

2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...
2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...
2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...University of California, San Francisco
 
Atul Butte's presentation at the From Data to Discovery symposium at Westat
Atul Butte's presentation at the From Data to Discovery symposium at WestatAtul Butte's presentation at the From Data to Discovery symposium at Westat
Atul Butte's presentation at the From Data to Discovery symposium at WestatUniversity of California, San Francisco
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineIda Sim
 
Atul Butte's presentation at #AMIA2021 for the Knowledge Discovery and Data M...
Atul Butte's presentation at #AMIA2021 for the Knowledge Discovery and Data M...Atul Butte's presentation at #AMIA2021 for the Knowledge Discovery and Data M...
Atul Butte's presentation at #AMIA2021 for the Knowledge Discovery and Data M...University of California, San Francisco
 
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...The Hive
 
2020-03-08 Atul Butte's keynote for the AMIA Virtual Informatics Summit
2020-03-08 Atul Butte's keynote for the AMIA Virtual Informatics Summit 2020-03-08 Atul Butte's keynote for the AMIA Virtual Informatics Summit
2020-03-08 Atul Butte's keynote for the AMIA Virtual Informatics Summit University of California, San Francisco
 

What's hot (20)

2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...
2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...
2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...
 
Atul Butte's presentation at the Milken Institute Public Health Summit
Atul Butte's presentation at the Milken Institute Public Health SummitAtul Butte's presentation at the Milken Institute Public Health Summit
Atul Butte's presentation at the Milken Institute Public Health Summit
 
2013 09 atul butte mahajani symposium
2013 09 atul butte mahajani symposium2013 09 atul butte mahajani symposium
2013 09 atul butte mahajani symposium
 
Atul Butte's presentation at the From Data to Discovery symposium at Westat
Atul Butte's presentation at the From Data to Discovery symposium at WestatAtul Butte's presentation at the From Data to Discovery symposium at Westat
Atul Butte's presentation at the From Data to Discovery symposium at Westat
 
2014 simr presentation
2014 simr presentation2014 simr presentation
2014 simr presentation
 
Atul Butte's AAPS big data workshop presentation 6/2015
Atul Butte's AAPS big data workshop presentation 6/2015Atul Butte's AAPS big data workshop presentation 6/2015
Atul Butte's AAPS big data workshop presentation 6/2015
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based Medicine
 
2015-11 Atul Butte's Presentation at Exponential Medicine
2015-11 Atul Butte's Presentation at Exponential Medicine2015-11 Atul Butte's Presentation at Exponential Medicine
2015-11 Atul Butte's Presentation at Exponential Medicine
 
Atul Butte's AAPS keynote presentation 6/2015
Atul Butte's AAPS keynote presentation 6/2015Atul Butte's AAPS keynote presentation 6/2015
Atul Butte's AAPS keynote presentation 6/2015
 
Atul Butte's presentation at JGI March 2015
Atul Butte's presentation at JGI March 2015Atul Butte's presentation at JGI March 2015
Atul Butte's presentation at JGI March 2015
 
Intro: California Initiative to Advance Precision Medicine Workshop
Intro: California Initiative to Advance Precision Medicine WorkshopIntro: California Initiative to Advance Precision Medicine Workshop
Intro: California Initiative to Advance Precision Medicine Workshop
 
2013 05 society for clinical trials
2013 05 society for clinical trials2013 05 society for clinical trials
2013 05 society for clinical trials
 
Atul Butte's presentation at LINCS 2013
Atul Butte's presentation at LINCS 2013Atul Butte's presentation at LINCS 2013
Atul Butte's presentation at LINCS 2013
 
2014 farr institute presentation
2014 farr institute presentation2014 farr institute presentation
2014 farr institute presentation
 
Presentation at ISMB NIH Office of Data Science Strategy Panel
Presentation at ISMB NIH Office of Data Science Strategy PanelPresentation at ISMB NIH Office of Data Science Strategy Panel
Presentation at ISMB NIH Office of Data Science Strategy Panel
 
Atul Butte's presentation at #AMIA2021 for the Knowledge Discovery and Data M...
Atul Butte's presentation at #AMIA2021 for the Knowledge Discovery and Data M...Atul Butte's presentation at #AMIA2021 for the Knowledge Discovery and Data M...
Atul Butte's presentation at #AMIA2021 for the Knowledge Discovery and Data M...
 
2014 07 ismb personalized medicine
2014 07 ismb personalized medicine2014 07 ismb personalized medicine
2014 07 ismb personalized medicine
 
2013 01 pmwc atul butte scrubbed
2013 01 pmwc atul butte scrubbed2013 01 pmwc atul butte scrubbed
2013 01 pmwc atul butte scrubbed
 
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
 
2020-03-08 Atul Butte's keynote for the AMIA Virtual Informatics Summit
2020-03-08 Atul Butte's keynote for the AMIA Virtual Informatics Summit 2020-03-08 Atul Butte's keynote for the AMIA Virtual Informatics Summit
2020-03-08 Atul Butte's keynote for the AMIA Virtual Informatics Summit
 

Similar to Translating Trillion Points of Data into New Health Insights

The Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesThe Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesPhilip Payne
 
AI in Healthcare
AI in HealthcareAI in Healthcare
AI in HealthcarePaul Agapow
 
bootcamp 2023.pptx
bootcamp 2023.pptxbootcamp 2023.pptx
bootcamp 2023.pptxHVCClibrary
 
Presentation by Atul Butte at the NSTC Interagency Working Group on Biologica...
Presentation by Atul Butte at the NSTC Interagency Working Group on Biologica...Presentation by Atul Butte at the NSTC Interagency Working Group on Biologica...
Presentation by Atul Butte at the NSTC Interagency Working Group on Biologica...University of California, San Francisco
 
Sophia Zilber - Mito research and data webinar - June 3, 2021
Sophia Zilber - Mito research and data webinar - June 3, 2021Sophia Zilber - Mito research and data webinar - June 3, 2021
Sophia Zilber - Mito research and data webinar - June 3, 2021SophiaZilber
 
The End of the Drug Development Casino?
The End of the Drug Development Casino?The End of the Drug Development Casino?
The End of the Drug Development Casino?Paul Agapow
 
How to Transition from Allopathic to Integrated Practice - IMM Brazil 2015
How to Transition from Allopathic to Integrated Practice - IMM Brazil 2015How to Transition from Allopathic to Integrated Practice - IMM Brazil 2015
How to Transition from Allopathic to Integrated Practice - IMM Brazil 2015Louis Cady, MD
 
Digital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient AdvocateDigital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient AdvocateDavid Lee Scher, MD
 
ML & AI in pharma: an overview
ML & AI in pharma: an overviewML & AI in pharma: an overview
ML & AI in pharma: an overviewPaul Agapow
 
Beyond Proofs of Concept for Biomedical AI
Beyond Proofs of Concept for Biomedical AIBeyond Proofs of Concept for Biomedical AI
Beyond Proofs of Concept for Biomedical AIPaul Agapow
 
6-005-1430-Keeppanasseril
6-005-1430-Keeppanasseril6-005-1430-Keeppanasseril
6-005-1430-Keeppanasserilmed20su
 
Health IT: The Big Picture (October 4, 2018)
Health IT: The Big Picture (October 4, 2018)Health IT: The Big Picture (October 4, 2018)
Health IT: The Big Picture (October 4, 2018)Nawanan Theera-Ampornpunt
 
From “Big Data” to Digital Medicine--PYA Explores Innovations in Healthcare
From “Big Data” to Digital Medicine--PYA Explores Innovations in HealthcareFrom “Big Data” to Digital Medicine--PYA Explores Innovations in Healthcare
From “Big Data” to Digital Medicine--PYA Explores Innovations in HealthcarePYA, P.C.
 

Similar to Translating Trillion Points of Data into New Health Insights (20)

The Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesThe Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across Scales
 
Atul Butte's presentation at CTIC 2020
Atul Butte's presentation at CTIC 2020Atul Butte's presentation at CTIC 2020
Atul Butte's presentation at CTIC 2020
 
Overview of Health IT
Overview of Health ITOverview of Health IT
Overview of Health IT
 
AI in Healthcare
AI in HealthcareAI in Healthcare
AI in Healthcare
 
bootcamp 2023.pptx
bootcamp 2023.pptxbootcamp 2023.pptx
bootcamp 2023.pptx
 
Presentation by Atul Butte at the NSTC Interagency Working Group on Biologica...
Presentation by Atul Butte at the NSTC Interagency Working Group on Biologica...Presentation by Atul Butte at the NSTC Interagency Working Group on Biologica...
Presentation by Atul Butte at the NSTC Interagency Working Group on Biologica...
 
Sophia Zilber - Mito research and data webinar - June 3, 2021
Sophia Zilber - Mito research and data webinar - June 3, 2021Sophia Zilber - Mito research and data webinar - June 3, 2021
Sophia Zilber - Mito research and data webinar - June 3, 2021
 
The End of the Drug Development Casino?
The End of the Drug Development Casino?The End of the Drug Development Casino?
The End of the Drug Development Casino?
 
eReferrals - it's about communication
eReferrals - it's about communicationeReferrals - it's about communication
eReferrals - it's about communication
 
How to Transition from Allopathic to Integrated Practice - IMM Brazil 2015
How to Transition from Allopathic to Integrated Practice - IMM Brazil 2015How to Transition from Allopathic to Integrated Practice - IMM Brazil 2015
How to Transition from Allopathic to Integrated Practice - IMM Brazil 2015
 
Health IT: The Big Picture (July 2, 2015)
Health IT: The Big Picture (July 2, 2015)Health IT: The Big Picture (July 2, 2015)
Health IT: The Big Picture (July 2, 2015)
 
Overview of Health IT
Overview of Health ITOverview of Health IT
Overview of Health IT
 
Digital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient AdvocateDigital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient Advocate
 
Overview of Health IT (October 2, 2016)
Overview of Health IT (October 2, 2016)Overview of Health IT (October 2, 2016)
Overview of Health IT (October 2, 2016)
 
ML & AI in pharma: an overview
ML & AI in pharma: an overviewML & AI in pharma: an overview
ML & AI in pharma: an overview
 
Beyond Proofs of Concept for Biomedical AI
Beyond Proofs of Concept for Biomedical AIBeyond Proofs of Concept for Biomedical AI
Beyond Proofs of Concept for Biomedical AI
 
6-005-1430-Keeppanasseril
6-005-1430-Keeppanasseril6-005-1430-Keeppanasseril
6-005-1430-Keeppanasseril
 
The Attending Model Of Medicine
The Attending Model Of MedicineThe Attending Model Of Medicine
The Attending Model Of Medicine
 
Health IT: The Big Picture (October 4, 2018)
Health IT: The Big Picture (October 4, 2018)Health IT: The Big Picture (October 4, 2018)
Health IT: The Big Picture (October 4, 2018)
 
From “Big Data” to Digital Medicine--PYA Explores Innovations in Healthcare
From “Big Data” to Digital Medicine--PYA Explores Innovations in HealthcareFrom “Big Data” to Digital Medicine--PYA Explores Innovations in Healthcare
From “Big Data” to Digital Medicine--PYA Explores Innovations in Healthcare
 

Recently uploaded

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSINGmarianagonzalez07
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 

Recently uploaded (20)

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 

Translating Trillion Points of Data into New Health Insights

  • 1. Translating a Trillion Points of Data into Diagnostics, Therapies and New Insights in Health and Disease atul.butte@ucsf.edu @atulbutte Atul Butte, MD, PhD Director, Institute for Computational Health Sciences University of California, San Francisco
  • 2. Conflicts of Interest • Scientific founder and advisory board membership – Genstruct – NuMedii – Personalis – Carmenta • Honoraria for talks – Lilly – Pfizer – Siemens – Bristol Myers Squibb – AstraZeneca – Roche – Genentech – Warburg Pincus • Past or present consultancy – Lilly – Johnson and Johnson – Roche – NuMedii – Genstruct – Tercica – Ecoeos – Helix – Ansh Labs – Prevendia – Samsung – Assay Depot – Regeneron – Verinata – Pathway Diagnostics – Geisinger Health – Covance – Wilson Sonsini Goodrich & Rosati – Orrick – 10X Genomics – Medgenics – GNS Healthcare – Gerson Lehman Group – Coatue Management • Corporate Relationships – Northrop Grumman – Aptalis – Allergan – Astellas – Thomson Reuters – Intel – SAP – SV Angel – Progenity – Illumina • Speakers’ bureau – None • Companies started by students – Carmenta – Serendipity – Stimulomics – NunaHealth – Praedicat – MyTime – Flipora – Tumbl.in
  • 5. The Cancer Genome Atlas • 14 thousand cases • 39 types of cancers • 13 types of data: molecular, clinical, sequencing
  • 6. 227 million substances x 1.3 million assays More than a billion measurements within a grid of 300 trillion cells 71 million meet Lipinski 5 1.2 million active substances
  • 9.
  • 10. Preeclampsia: large cause of maternal and fetal death • Incidence • 5-8% of all pregnancies in the U.S. and worldwide • 4.1 million births in the U.S. in 2009 • Up to 300K cases of preeclampsia annually in the U.S. • Mortality • Responsible for 18% of all maternal deaths in the U.S. • Maternal death in 56 out of every 100,000 live births in US • Neonatal death in 71 out of every 100,000 live births in US • Cost • $20 billion in direct costs in the U.S. annually • Average hospital stay of 3.5 days Linda Liu Bruce Ling Matt Cooper
  • 11.
  • 12. New blood markers for preeclampsia Linda Liu Bruce Ling Matt Cooper @MarchofDimes bit.ly/preeclamp
  • 13. Need a diagnostic for preeclampsia Public big data available March of Dimes Center for Prematurity Research Data analyzed, diagnostic designed SPARK grant ($50k) Life Science Angels, other seed investors ($2 million) @CarmentaBio progenity.com bit.ly/carm_prog
  • 15.
  • 16. Cancer Discovery 2013, 3:1. Psychiatric Drug Imipramine Shows Significant Activity Against Small Cell Lung Cancer Vehicle control Imipramine p53/Rb/p130 triple knockout model of SCLC Mice dosed after tumor formation Joel Dudley Nadine Jahchan Julien Sage Alejandro Sweet-Cordero Joel Neal @NuMedii
  • 17. Bin Chen Wei Wei Li Ma Bin Yang Mei-Sze Chua Samuel So Gastroenterology, 2017
  • 18. Need more drugs for more diseases Public big data available NIH funding Data analyzed, method designed Company launched, ARRA, StartX, Stanford license, first deal Claremont Creek, Lightspeed ($3.5 million) @NuMedii
  • 19. The next big open data: clinical trials Download 100+ studies today Drug repositioning, new patient subsets, digital comparative effectiveness, more! immport.org Sanchita Bhattacharya Elizabeth Thomson
  • 20.
  • 21.
  • 22. 22
  • 23.
  • 24.
  • 25. Clinical Data Warehouse A Big UC Healthcare Data Analytics Platform Combining healthcare data from across the six University of California medical schools and systems
  • 26. The next big data: clinical data
  • 27.
  • 28.
  • 29. ML lessons I’ve learned over 20 years • Get the question right; solve the problems that health care professionals need solved – Solve the problems that health care professionals need solved: Don't just guess • And verify good questions and good unmet needs with more than one doc – Build a great diagnostic vs. understanding the biology • Perfectly lassoed variables may miss the big picture biology – Biologists and medical professionals really love explanations over black boxes • Watch out for input limiting models – Patients might not type in the right codes for their symptoms – They barely enter their own race/ethnicity – And docs? • Learn what IRB, HIPAA, BAA are. Learn what ICD-10 and CPT codes are. CLIA and CAP. – And learn patience. – Not all of us are cloud allergic. • Not everything needs deep learning • Having all data on everyone is super rare: genomics, images, and longitudinal EHR data? • Health care inefficiency is not about friction • Data integration and harmonization can happen if there is a business reason for it • Platforms and their companies are seemingly commoditized – Come to us with more medical knowledge and background. Convince us you care about this vertical. – Show us that we are going to learn more from you, than we are going to have to teach you.
  • 30. Open challenges • Can’t teach a computer with half the game. Need the start and finish of medical “stories” – In medical care, need primary and tertiary care in the same database – Compound data might be available, but trials data is not – Means data integration and harmonization is a rate limiter • Need the right diversity in data – Otherwise might be extrapolating beyond what was learned – Need enough data, big amounts of data, true-positive cases • Need methods to handle complicated multi-modal data • What does validation mean? – Drug discovery: pre-clinical success? Or Phase 2 success? – Clinical accuracy? Or clinical utility? • Career fear uncertainty and doubt (FUD) in some circles – Tech recruitment – Too many startups?
  • 31. UC Clinical Data Warehouse Team Executive Team • Atul Butte • Joe Bengfort • Michael Pfeffer • Tom Andriola • Chris Longhurst Steering Committee • Irfan Chaudhry • Mohammed Mahbouba • Lisa Dahm • David Dobbs • Kent Andersen • Ralph James • Jennifer Holland • Eugene Lee ETL Team • Albert Dugan • Tony Choe • Michael Sweeney • Timothy Satterwhite • Ayan Patel • Niranjan Wagle • Ralph James • Joseph Dalton Data Harmonization • Dana Ludwig • Daniella Meeker Data Quality • Momeena Ali • Jodie Nygaard Epic • Kevin Ames • Ben Jenkins • Steve Gesualdo Business Analyst • Ankeeta Shukla Hardware • Sandeep Chandra • Jeff Love • Scott Bailey • Kwong Law • Pallav Saxena Support • Jack Stobo • Michael Blum • Sam Hawgood
  • 32. Support Admin and Tech Staff • Mary Lyall • Mounira Kenaani • Kevin Kaier • Boris Oskotsky • Mae Moredo • Ada Chen • University of California, San Francisco • Pricilla Chan and Mark Zuckerberg • NIH: NIAID, NLM, NIGMS, NCI, NHLBI, OD; NIDDK, NHGRI, NIA, NCATS • March of Dimes • Juvenile Diabetes Research Foundation • Hewlett Packard • Howard Hughes Medical Institute • California Institute for Regenerative Medicine • Luke Evnin and Deann Wright (Scleroderma Research Foundation) • Clayville Research Fund • PhRMA Foundation • Stanford Cancer Center, Bio-X, SPARK • Tarangini Deshpande • Kimayani Butte • Sam Hawgood and Keith Yamamoto • Isaac Kohane