SlideShare a Scribd company logo
1 of 59
Insights from Genomics and
Computational Biology that inform
Precision Medicine
Warren Kibbe, PhD
warren.kibbe@nih.gov
@wakibbe
March 6th, 2017
2
1. Background
2. Data in Biomedicine
3. Data Sharing
4. Data Commons
5. Genomics and
Computation
Thanks to many folks for slides, but especially Jerry Lee
3
In 2016 there were an estimated
1,700,000 new cancer cases
and
600,000 cancer deaths
- American Cancer Society
Cancer remains the second most common cause of
death in the U.S.
- Centers for Disease Control and Prevention
4
In 2016 there are an estimated
15,500,000
cancer survivors in the U. S.
5
Understanding Cancer
 Precision medicine will lead to fundamental
understanding of the complex interplay between
genetics, epigenetics, nutrition, environment and clinical
presentation and direct effective, evidence-based
prevention and treatment.
6
(10,000+ patient tumors and increasing)
Courtesy of P. Kuhn (USC)
2006-2015:
A Decade of Illuminating the
Underlying Causes of Primary
Untreated Tumors Omics
Characterization
Cancer is a grand challenge
Deep biological understanding
Advances in scientific methods
Advances in instrumentation
Advances in technology
Data and computation
Cancer Research and Care generate
detailed data that is critical to
create a learning health system for cancer
Requires:
7
(10,000+ patient tumors and increasing)
Courtesy of P. Kuhn (USC)
2006-2015:
A Decade of Illuminating the
Underlying Causes of Primary
Untreated Tumors Omics
Characterization
The primary goal of TCGA is to identify disease subtypes and generate a
comprehensive catalog of all cancer genes and pathways
Pilot phase started in 2006 -- 3 tumor types 500 T/N pairs each
Extended in 2009 -- 20 x 500 T/N + 15 x (50-150) T/N
Freely available genomic data enable researchers anywhere
around the world to make and validate important discoveries
9
10
11
Biological Scales
13
Tumor, Cancer, and Metastasis:
(Length-scale and Time-scale Matter)
“…>90% of deaths are caused by disseminated disease or metastasis…”
Gupta et. al., Cell, 2006 and Siegel et. al. CA Cancer J Clin, Jan/Feb 2016
5 year Relative Survival Rates (2016 report of 2005-2011 data)
14
1997 20152001 20051998 20142002 20061999 2003 20112000 2004 2007 20122008 20132009 2010
10/23/2001
(~5 yrs old)
4/21/1997
1/9/2007
(~10 yrs old)
iPod (10GB max)
WinAMP(mp3)
iPhone (EDGE, 16 GB max)
9/16/1999
(~3 yrs old)
802.11b WiFi
4/3/2010
(~13 yrs old)
iPad (EDGE, 64 GB max)
4/23/2005
(~8 yrs old)
9/26/2006
(~9 yrs old)
7/15/2006
2/7/2007
Google
Drive
4/24/2012
(~15 yrs old)7/11/2008
(~11 yrs old)
iPhone 3G
(16 GB max)
9/12/2012
(~15 yrs old)
iPhone5 (LTE, 128 GB max)
Google
Baseline
7/14/2014
(~17 yrs old)
3/9/2015
(~18 yrs old)
Digital technology is changing rapidly
15
$640M
(FY74)
$5.21 B
(FY16)
Cancer therapy is changing
1618
Application of Cancer Genomics is changing
17
18
19
Biology and Medicine are now data
intensive enterprises
Scale is rapidly changing
Technology, data, computing and IT are
pervasive in the lab, the clinic, in the
home, and across the population
20
21
22
“...an advantage of machine learning
is that it can be used even in cases
where it is infeasible or difficult to
write down explicit rules to solve a
problem...”
https://www.whitehouse.gov/sites/default/files/whitehouse_files/microsites/ostp/NSTC/preparing_for_the_future_of_ai.pdf
23
24
25
" there is great potential for new insights to come
from the combined analysis of cancer proteomic
and genomic data, as proteomic data can now
reproducibly provide information about protein
levels and activities that are difficult or impossible
to infer from genomic data alone ”
Douglas R. Lowy, MD
Acting Director of the National Cancer Institute, National Institutes of Health
5/25/2016
26
Genomics is the beginning of
precision medicine, not the end!
27
Plumbing, architecture, genomics
Cancer Research Data Ecosystem – Cancer Moonshot BRP
Well characterized
research data sets Cancer cohorts Patient data
EHR, Lab Data, Imaging,
PROs, Smart Devices,
Decision Support
Learning from every
cancer patient
Active research
participation
Research information
donor
Clinical Research
Observational studies
Proteogenomics
Imaging data
Clinical trials
Discovery
Patient engaged
Research
Surveillance
Big Data
Implementation research
SEERGDC
Data Commons Framework
30
Data Commons Structure
DICOM, AIM
Amazon
Google
IBM
Imaging
Validator
Q/A
Proteomic
Validator
Q/A
Clinical Phenotype
Validator
Q/A
MOD Phenotype
Validator
Q/A
Pathology
Radiology
Mass
Spectrometry
Array
Data
Commons
Security
Visualization
Authentication
& Authorization
Genomic
Validator
Q/A Germline Pipelines
DNA, RNA Pipelines
EMRs, Clinical
Trials
Azure
Data Contributors and Consumers
Researchers PatientsCliniciansInstitutions
NCI Thesaurus
caDSR
NLM UMLS
RxNorm
LOINC
SNOMED
31
Cancer Data Sharing
& Data Commons
• Support open science
• Support data reusability
• Aligned with Cancer Moonshot
• Part of Precision Medicine
• Reduce Health Disparities
• Improve patient access to clinical
trials
• Work toward a learning National
Cancer Data Ecosystem
Reduce the risk, improve early detection, outcomes and survivorship in cancer
Cancer Data Sharing Efforts
Signature Efforts Data
BRCA Challenge
Somatic variant sharing
Isolated genetic variants
No raw sequencing data
Precision medicine questions
Somatic variant sharing
Panel gene resequencing
Clinical response
Clinical trial
Public-private partnerships
Comprehensive genomics
Detailed clinical
phenotype data
Clinical trial access
Clinical/genomic data
aggregation
EHR data
Clinical sequencing
Clinical oncology standards
EHR data
Clinical sequencing
33
Changing the conversation around data sharing
 How do we find data, software, standards?
 How can we make data, annotations, software, metadata accessible?
 How do we reuse data standards?
 How do we make more data machine readable?
NIH Data Commons
NCI Genomic Data Commons
National Cancer Data Ecosystem
Data Commons co-locate data, storage and computing infrastructure, and
frequently used tools for analyzing and sharing data to create an
interoperable resource for the research community.
*Robert L. Grossman, Allison Heath, Mark Murphy, Maria Patterson, A Case for Data Commons Towards Data Science as a
Service, to appear. Source of image: Interior of one of Google’s Data Center, www.google.com/about/datacenters/.
34
Basic Ingredients of a Cancer Research Data Ecosystem
• Open Science. Supporting Open Access, Open Data, Open Source, and
Data Liquidity for the cancer community
• Standardization through CDEs and Case Report Forms
• Interoperability by exposing existing knowledge through appropriate
integration of ontologies, vocabularies and taxonomies
• Consistency of curation and capture of evidence (tissue types, recurrence,
therapy – including genomic, epigenomic, etc context)
• Sustainable models for informatics infrastructure, services, data, metadata
35
GDC as an example of a new
architecture for storing and sharing
cancer data
36
The Cancer Genomic Data Commons
(GDC) is an existing effort to standardize
and simplify submission of genomic data
to NCI and follow the principles of FAIR
– Findable, Accessible, Attributable,
Interoperable, Reusable, and Provide
Recognition.
The GDC is part of the NIH Big Data to
Knowledge (BD2K) initiative and an
example of the NIH Commons
Genomic Data Commons
Microattribution, nanopublications, tracking the use of
data, annotation of data, use of algorithms, supports
the data /software /metadata life cycle to provide
credit and analyze impact of data, software, analytics,
algorithm, curation and knowledge sharing
Force11 white paper
https://www.force11.org/group/fairgroup/fairprinciples
37
The NCI Genomic Data Commons
• Unify fragmentary repositories
• Support the receipt, quality control, integration, storage, and
redistribution of standardized genomic data sets derived from cancer
research studies
• Harmonization of raw sequence both from existing and new cancer research
programs
• Application of state-of-the-art methods of generating derived genomic data
• Provide the foundation for:
• Identification of high- and low-frequency cancer drivers
• Defining genomic determinants of response to therapy
• Clinical trial cohorts sharing targeted genetic lesions
NCI Genomic Data Commons
 The GDC went live on June 6, 2016 with approximately 4.1 PB of data.
 577,878 files about 14194 cases (patients), in 42 cancer types, across 29 primary
disease sites, 400 clinical data elements
 10 major data types, ranging from Raw Sequencing Data, Raw Microarray Data, to
Copy Number Variation, Simple Nucleotide Variation and Gene Expression.
 Data are derived from 17 different experimental strategies, with the major ones
being RNA-Seq, WXS, WGS, miRNA-Seq, Genotyping Array and Expression Array.
 Foundation Medicine announced the release of 18,000 genomic profiles to the GDC
at the Cancer Moonshot Summit, June 29th, 2016
 The Multiple Myeloma Research Foundation announced it would be releasing its
CoMMpass study of more than 1000 cases of Multiple Myeloma on Sept 29, 2016.
39
Content in the Genomic Data Commons
 TCGA 11,353 cases
 TARGET 3,178 cases
Current
 Foundation Medicine 18,000 cases
 Cancer studies in dbGaP ~4,000 cases
 Multiple Myeloma RF ~1,000 cases
Coming soon
 NCI-MATCH ~3,000 cases
 Clinical Trial Sequencing Program ~3,000 cases
Planned (1-3 years)
 Cancer Driver Discovery Program ~5,000 cases
 Human Cancer Model Initiative ~1,000 cases
 APOLLO – NCI, VA and DoD ~8,000 cases
~58,000 cases
Exome-seq
Whole genome-seq
RNA-seq
Copy number
Genome
alignment
Genome
alignment
Genome
alignment
Data
segmentation
1° processing
Mutations
Mutations +
structural variants
Digital gene
expression
Copy number
calls
2° processing
Oncogene vs.
Tumor suppressor
Translocations
Relative RNA levels
Alternative splicing
Gene amplification/
deletion
3° processing
GDC Data Harmonization
Multiple data types and levels of processing
Mutect2
pipeline
GDC Data Harmonization
Open Source, Dockerized Pipelines
Recovery
rate
(% true
positives) A0F0
SomaticSniper 81.1% 76.5%
VarScan 93.9% 84.3%
MuSE 93.1% 87.3%
All Three 96.4% 91.2%
GDC variant calling
pipelines
Wash U
Baylor
Broad
GDC Data Harmonization
Multiple pipelines needed to recover all variants
Development of the NCI Genomic Data Commons (GDC)
To Foster the Molecular Diagnosis and Treatment of Cancer
GDC
Bob Grossman PI
Univ. of Chicago
Ontario Inst. Cancer Res.
Leidos
Institute of Medicine
Towards Precision Medicine
2011
Discovery of Cancer Drivers With 2% Prevalence
Lung adeno.
+ 2,900
Colorectal
+ 1,200
Ovarian
+ 500
Lawrence et al, Nature 2014
Power Calculation for Cancer Driver Discovery
Need to resequence >100,000 tumors to
identify all cancer drivers at >2% prevalence
The NCI Cancer Genomics
Cloud Pilots
Understanding how to meet the
research community’s need to
analyze large-scale cancer
genomic and clinical data
48
NCI Cancer Genomics Cloud Pilots
Democratize access to
NCI-generated genomic
and related data, and to
create a cost-effective
way to provide scalable
computational capacity
to the cancer research
community.
Cloud Pilots provide:
• Access to large genomic data sets without need to download
• Access to popular pipelines and visualization tools
• Ability for researchers to bring their own tools and pipelines to the data
• Ability for researchers to bring their own data and analyze in combination with existing genomic
data
• Workspaces, for researchers to save and share their data and results of analyses
49
• PI: Gad Getz
• Google Cloud
• Firehose in the cloud including Broad best practices workflows
•http://firecloud.org
Broad Institute
• PI: Ilya Shmulevich
• Google Cloud
• Leverage Google infrastructure; Novel query and visualization
•http://cgc.systemsbiology.net/
Institute for
Systems Biology
• PI: Deniz Kural
• Amazon Web Services
• Interactive data exploration; > 30 public pipelines
•http://www.cancergenomicscloud.org
Seven Bridges
Genomics
Three NCI Genomics Cloud Pilots
Selection
Design/Build
I
Design/Build
II
Evaluation Extension
Sept 2016Jan 2016April 2015Sept 2014
Jan 2014
Broad Institute Cloud Pilot
• Targeted at users performing
analyses at scale
• Modeled after their Firehose
analysis infrastructure
developed for the TCGA
program
• Users can upload their own data
and tools and/or run the Broad’s
best practice tools and pipelines
on pre-loaded data
Institute for Systems Biology Cloud Pilot
51
PI / Biologist
web access
Computational
Research Scientist
Python, R, SQL
Algorithm Developer
ssh, programmatic
access
ISB-CGC Web App Google Cloud Console
Google APIs
ISB-CGC APIs
Compute
Engine VMs
Cloud
Storage
BigQuery Genomics
Local
Storage
ISB-CGC
Hosted Data
Controlled-Access Data
Open-Access Data User Data
• Closely tied with Google Cloud Platform tools including BigQuery, App Engine, Cloud
Datalab, Google Genomics, and Compute Engine
• Aggregated TCGA data in BigQuery allows fast SQL-like queries across the entire dataset
• Web interface allows scientists to interactively compare and define cohorts
Seven Bridges Genomics Cloud Pilot
• Built upon the SBG commercial
cloud-based genomics platform
• Graphical query interface to
identify hosted data of interest
• Includes a native
implementation of the Common
Workflow Language
specification and graphical
interface for creating user-
defined workflows
Workspace –
isolated environment for collaborative analysis
Data + Methods → Results
sample data and
metadata (e.g.
BAMs, tissue type)
algorithms
(e.g. mutation
calling)
Wiring logic
(e.g. use the exome
capture BAM)
executions and results
(e.g. run mutation caller v41
on this exact bam and track
results)
Slide courtesy of Broad Institute
GDC Acknowledgements
NCI Center for Cancer Genomics Univ. of Chicago
Bob Grossman
Allison Heath
Mike Ford
Zhenyu Zhang
Ontario Institute for Cancer Research
Lou Staudt
Zhining Wang
Martin Ferguson
JC Zenklusen
Daniela Gerhard
Deb Steverson
Vincent Ferretti
'Francois Gerthoffert
JunJun Zhang
Leidos Biomedical Research
Mark Jensen
Sharon Gaheen
Himanso Sahni
NCI NCI CBIIT
Tony Kerlavage
Tanya Davidsen
CGC Pilot Team Principal Investigators
• Gad Getz, Ph.D - Broad Institute - http://firecloud.org
• Ilya Shmulevich, Ph.D - ISB - http://cgc.systemsbiology.net/
• Deniz Kural, Ph.D - Seven Bridges – http://www.cancergenomicscloud.org
NCI Project Officer & CORs
• Anthony Kerlavage, Ph.D –Project Officer
• Juli Klemm, Ph.D – COR, Broad Institute
• Tanja Davidsen, Ph.D – COR, Institute for Systems Biology
• Ishwar Chandramouliswaran, MS, MBA – COR, Seven Bridges Genomics
GDC Principal Investigator
• Robert Grossman, Ph.D - University of Chicago
• Allison Heath, Ph.D - University of Chicago
• Vincent Ferretti, Ph.D - Ontario Institute for Cancer Research
Cancer Genomics Project Teams
NCI Leadership Team
• Doug Lowy, M.D.
• Lou Staudt, M.D., Ph.D.
• Stephen Chanock, M.D.
• George Komatsoulis, Ph.D.
• Warren Kibbe, Ph.D.
Center for Cancer Genomics Partners
• JC Zenklusen, Ph.D.
• Daniela Gerhard, Ph.D.
• Zhining Wang, Ph.D.
• Liming Yang, Ph.D.
• Martin Ferguson, Ph.D.
56
Integrated data sets, interoperable
resources, harmonized data are
necessary for and enable
biologically informed cancer
predictive models
57
Questions?
Warren Kibbe, Ph.D.
Warren.kibbe@nih.gov
@wakibbe
www.cancer.gov www.cancer.gov/espanol
59
NIH Genomic Data Sharing Policy
https://gds.nih.gov/
Went into effect January 25, 2015
NCI guidance:
http://www.cancer.gov/grants-training/grants-
management/nci-policies/genomic-data
Requires public sharing of genomic data sets

More Related Content

What's hot

Uses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsUses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsPragya Pai
 
Protein function prediction
Protein function predictionProtein function prediction
Protein function predictionLars Juhl Jensen
 
AI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth IsraelAI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth IsraelLevi Shapiro
 
Deep learning for biomedicine
Deep learning for biomedicineDeep learning for biomedicine
Deep learning for biomedicineDeakin University
 
Single nucleotide polymorphism
Single nucleotide polymorphismSingle nucleotide polymorphism
Single nucleotide polymorphismBipul Das
 
Single Nucleotide Polymorphism
Single Nucleotide PolymorphismSingle Nucleotide Polymorphism
Single Nucleotide Polymorphismsajal shrivastav
 
Introduction to systems biology
Introduction to systems biologyIntroduction to systems biology
Introduction to systems biologylemberger
 
Introduction cancer genetics and genomics
Introduction cancer genetics and genomicsIntroduction cancer genetics and genomics
Introduction cancer genetics and genomicsNuria Lopez-Bigas
 
Machine Learning and Reasoning for Drug Discovery
Machine Learning and Reasoning for Drug DiscoveryMachine Learning and Reasoning for Drug Discovery
Machine Learning and Reasoning for Drug DiscoveryDeakin University
 
Introduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems BiologyIntroduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems BiologyMrinal Vashisth
 
Comparative genomics presentation
Comparative genomics presentationComparative genomics presentation
Comparative genomics presentationEmmanuel Aguon
 
Precision Medicine - The Future of Healthcare
Precision Medicine - The Future of HealthcarePrecision Medicine - The Future of Healthcare
Precision Medicine - The Future of HealthcareData Science Thailand
 
Deep learning for genomics: Present and future
Deep learning for genomics: Present and futureDeep learning for genomics: Present and future
Deep learning for genomics: Present and futureDeakin University
 

What's hot (20)

Artificial Intelligence and Diagnostics
Artificial Intelligence and DiagnosticsArtificial Intelligence and Diagnostics
Artificial Intelligence and Diagnostics
 
Genomics
GenomicsGenomics
Genomics
 
Uses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsUses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in Bioinformatics
 
Protein function prediction
Protein function predictionProtein function prediction
Protein function prediction
 
AI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth IsraelAI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth Israel
 
Protein Predictinon
Protein PredictinonProtein Predictinon
Protein Predictinon
 
Ml in genomics
Ml in genomicsMl in genomics
Ml in genomics
 
Deep learning for biomedicine
Deep learning for biomedicineDeep learning for biomedicine
Deep learning for biomedicine
 
Single nucleotide polymorphism
Single nucleotide polymorphismSingle nucleotide polymorphism
Single nucleotide polymorphism
 
Single Nucleotide Polymorphism
Single Nucleotide PolymorphismSingle Nucleotide Polymorphism
Single Nucleotide Polymorphism
 
Genomics
GenomicsGenomics
Genomics
 
Whole exome sequencing(wes)
Whole exome sequencing(wes)Whole exome sequencing(wes)
Whole exome sequencing(wes)
 
Introduction to systems biology
Introduction to systems biologyIntroduction to systems biology
Introduction to systems biology
 
Introduction cancer genetics and genomics
Introduction cancer genetics and genomicsIntroduction cancer genetics and genomics
Introduction cancer genetics and genomics
 
Machine Learning and Reasoning for Drug Discovery
Machine Learning and Reasoning for Drug DiscoveryMachine Learning and Reasoning for Drug Discovery
Machine Learning and Reasoning for Drug Discovery
 
Introduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems BiologyIntroduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems Biology
 
Lecture 7 gwas full
Lecture 7 gwas fullLecture 7 gwas full
Lecture 7 gwas full
 
Comparative genomics presentation
Comparative genomics presentationComparative genomics presentation
Comparative genomics presentation
 
Precision Medicine - The Future of Healthcare
Precision Medicine - The Future of HealthcarePrecision Medicine - The Future of Healthcare
Precision Medicine - The Future of Healthcare
 
Deep learning for genomics: Present and future
Deep learning for genomics: Present and futureDeep learning for genomics: Present and future
Deep learning for genomics: Present and future
 

Viewers also liked

National Cancer Data Ecosystem and Data Sharing
National Cancer Data Ecosystem and Data SharingNational Cancer Data Ecosystem and Data Sharing
National Cancer Data Ecosystem and Data SharingWarren Kibbe
 
What is A Cloud Stack in 2017
What is A Cloud Stack in 2017What is A Cloud Stack in 2017
What is A Cloud Stack in 2017Gaurav Roy
 
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
NCI Cancer Imaging Program - Cancer Research Data EcosystemNCI Cancer Imaging Program - Cancer Research Data Ecosystem
NCI Cancer Imaging Program - Cancer Research Data EcosystemWarren Kibbe
 
Cancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data CommonsCancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data CommonsWarren Kibbe
 
Design in Tech Report 2017
Design in Tech Report 2017Design in Tech Report 2017
Design in Tech Report 2017John Maeda
 
SuperComputing 16 HPC Matters Panel on Precision Medicine
SuperComputing 16 HPC Matters Panel on Precision MedicineSuperComputing 16 HPC Matters Panel on Precision Medicine
SuperComputing 16 HPC Matters Panel on Precision MedicineWarren Kibbe
 
From Clinical Decision Support to Precision Medicine
From Clinical Decision Support to Precision MedicineFrom Clinical Decision Support to Precision Medicine
From Clinical Decision Support to Precision MedicineKent State University
 
Europe ai scaleups report 2016
Europe ai scaleups report 2016Europe ai scaleups report 2016
Europe ai scaleups report 2016Omar Mohout
 
3 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 20173 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 2017Drift
 
Precision Medicine in Oncology Informatics
Precision Medicine in Oncology InformaticsPrecision Medicine in Oncology Informatics
Precision Medicine in Oncology InformaticsWarren Kibbe
 
Precision Medicine in Oncology Informatics
Precision Medicine in Oncology InformaticsPrecision Medicine in Oncology Informatics
Precision Medicine in Oncology InformaticsWarren Kibbe
 
Genomic Medicine: Personalized Care for Just Pennies
Genomic Medicine: Personalized Care for Just PenniesGenomic Medicine: Personalized Care for Just Pennies
Genomic Medicine: Personalized Care for Just PenniesHealth Catalyst
 
Comparing approaches: Running database workloads on Dell EMC and Microsoft hy...
Comparing approaches: Running database workloads on Dell EMC and Microsoft hy...Comparing approaches: Running database workloads on Dell EMC and Microsoft hy...
Comparing approaches: Running database workloads on Dell EMC and Microsoft hy...Principled Technologies
 
Precision medicine the future of_healthcare
Precision medicine  the future of_healthcarePrecision medicine  the future of_healthcare
Precision medicine the future of_healthcareThomas Wilckens
 
Intel precision medicine apr 2015
Intel precision medicine apr 2015Intel precision medicine apr 2015
Intel precision medicine apr 2015Ketan Paranjape
 
NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016
NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016
NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016Warren Kibbe
 
Precision Medicine: Opportunities and Challenges for Clinical Trials
Precision Medicine: Opportunities and Challenges for Clinical TrialsPrecision Medicine: Opportunities and Challenges for Clinical Trials
Precision Medicine: Opportunities and Challenges for Clinical TrialsMedpace
 
A Vision for a Cancer Research Knowledge System
A Vision for a Cancer Research Knowledge SystemA Vision for a Cancer Research Knowledge System
A Vision for a Cancer Research Knowledge SystemWarren Kibbe
 
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud PilotsC-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud PilotsWarren Kibbe
 
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meetingDay 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meetingWarren Kibbe
 

Viewers also liked (20)

National Cancer Data Ecosystem and Data Sharing
National Cancer Data Ecosystem and Data SharingNational Cancer Data Ecosystem and Data Sharing
National Cancer Data Ecosystem and Data Sharing
 
What is A Cloud Stack in 2017
What is A Cloud Stack in 2017What is A Cloud Stack in 2017
What is A Cloud Stack in 2017
 
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
NCI Cancer Imaging Program - Cancer Research Data EcosystemNCI Cancer Imaging Program - Cancer Research Data Ecosystem
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
 
Cancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data CommonsCancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data Commons
 
Design in Tech Report 2017
Design in Tech Report 2017Design in Tech Report 2017
Design in Tech Report 2017
 
SuperComputing 16 HPC Matters Panel on Precision Medicine
SuperComputing 16 HPC Matters Panel on Precision MedicineSuperComputing 16 HPC Matters Panel on Precision Medicine
SuperComputing 16 HPC Matters Panel on Precision Medicine
 
From Clinical Decision Support to Precision Medicine
From Clinical Decision Support to Precision MedicineFrom Clinical Decision Support to Precision Medicine
From Clinical Decision Support to Precision Medicine
 
Europe ai scaleups report 2016
Europe ai scaleups report 2016Europe ai scaleups report 2016
Europe ai scaleups report 2016
 
3 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 20173 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 2017
 
Precision Medicine in Oncology Informatics
Precision Medicine in Oncology InformaticsPrecision Medicine in Oncology Informatics
Precision Medicine in Oncology Informatics
 
Precision Medicine in Oncology Informatics
Precision Medicine in Oncology InformaticsPrecision Medicine in Oncology Informatics
Precision Medicine in Oncology Informatics
 
Genomic Medicine: Personalized Care for Just Pennies
Genomic Medicine: Personalized Care for Just PenniesGenomic Medicine: Personalized Care for Just Pennies
Genomic Medicine: Personalized Care for Just Pennies
 
Comparing approaches: Running database workloads on Dell EMC and Microsoft hy...
Comparing approaches: Running database workloads on Dell EMC and Microsoft hy...Comparing approaches: Running database workloads on Dell EMC and Microsoft hy...
Comparing approaches: Running database workloads on Dell EMC and Microsoft hy...
 
Precision medicine the future of_healthcare
Precision medicine  the future of_healthcarePrecision medicine  the future of_healthcare
Precision medicine the future of_healthcare
 
Intel precision medicine apr 2015
Intel precision medicine apr 2015Intel precision medicine apr 2015
Intel precision medicine apr 2015
 
NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016
NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016
NCI TEAG Cancer Moonshot Blue Ribbon Panel Presentation Oct 2016
 
Precision Medicine: Opportunities and Challenges for Clinical Trials
Precision Medicine: Opportunities and Challenges for Clinical TrialsPrecision Medicine: Opportunities and Challenges for Clinical Trials
Precision Medicine: Opportunities and Challenges for Clinical Trials
 
A Vision for a Cancer Research Knowledge System
A Vision for a Cancer Research Knowledge SystemA Vision for a Cancer Research Knowledge System
A Vision for a Cancer Research Knowledge System
 
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud PilotsC-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
 
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meetingDay 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
 

Similar to Genomics and Computation in Precision Medicine March 2017

Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Warren Kibbe
 
NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016Warren Kibbe
 
Converged IT Summit - NCI Data Sharing
Converged IT Summit - NCI Data SharingConverged IT Summit - NCI Data Sharing
Converged IT Summit - NCI Data SharingWarren Kibbe
 
Cancer Research Data Ecosystem - Dr. Warren Kibbe
Cancer Research Data Ecosystem - Dr. Warren KibbeCancer Research Data Ecosystem - Dr. Warren Kibbe
Cancer Research Data Ecosystem - Dr. Warren Kibbeimgcommcall
 
Cancer moonshot and data sharing
Cancer moonshot and data sharingCancer moonshot and data sharing
Cancer moonshot and data sharingWarren Kibbe
 
Federal Research & Development for the Florida system Sept 2014
Federal Research & Development for the Florida system Sept 2014 Federal Research & Development for the Florida system Sept 2014
Federal Research & Development for the Florida system Sept 2014 Warren Kibbe
 
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...Warren Kibbe
 
Data Commons & Data Science Workshop
Data Commons & Data Science WorkshopData Commons & Data Science Workshop
Data Commons & Data Science WorkshopWarren Kibbe
 
ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014Warren Kibbe
 
Precision Medicine in the Age of NCI MATCH and the Beau Biden Cancer Moonshot
Precision Medicine in the Age of NCI MATCH and the Beau Biden Cancer MoonshotPrecision Medicine in the Age of NCI MATCH and the Beau Biden Cancer Moonshot
Precision Medicine in the Age of NCI MATCH and the Beau Biden Cancer MoonshotWarren Kibbe
 
ISCB ECCB BD2K keynote Kibbe 201707
ISCB ECCB BD2K keynote Kibbe 201707ISCB ECCB BD2K keynote Kibbe 201707
ISCB ECCB BD2K keynote Kibbe 201707Warren Kibbe
 
Oncoset symposium kibbe 201704
Oncoset symposium kibbe 201704Oncoset symposium kibbe 201704
Oncoset symposium kibbe 201704Warren Kibbe
 
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...Jerry Lee
 
Keynote at NVIDIA GPU Technology Conference in D.C.
Keynote at NVIDIA GPU Technology Conference in D.C.Keynote at NVIDIA GPU Technology Conference in D.C.
Keynote at NVIDIA GPU Technology Conference in D.C.Jerry Lee
 
NCI Cancer Genomics, Open Science and PMI: FAIR
NCI Cancer Genomics, Open Science and PMI: FAIR NCI Cancer Genomics, Open Science and PMI: FAIR
NCI Cancer Genomics, Open Science and PMI: FAIR Warren Kibbe
 
Genomics2 Phenomics Complete
Genomics2 Phenomics CompleteGenomics2 Phenomics Complete
Genomics2 Phenomics CompleteInterpretOmics
 
Data Sharing: Highlights from NCI Experience and Cancer Moonshot Task Force
Data Sharing: Highlights from NCI Experience and Cancer Moonshot Task ForceData Sharing: Highlights from NCI Experience and Cancer Moonshot Task Force
Data Sharing: Highlights from NCI Experience and Cancer Moonshot Task ForceJerry Lee
 

Similar to Genomics and Computation in Precision Medicine March 2017 (20)

Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016
 
NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016
 
Converged IT Summit - NCI Data Sharing
Converged IT Summit - NCI Data SharingConverged IT Summit - NCI Data Sharing
Converged IT Summit - NCI Data Sharing
 
Cancer Research Data Ecosystem - Dr. Warren Kibbe
Cancer Research Data Ecosystem - Dr. Warren KibbeCancer Research Data Ecosystem - Dr. Warren Kibbe
Cancer Research Data Ecosystem - Dr. Warren Kibbe
 
Cancer moonshot and data sharing
Cancer moonshot and data sharingCancer moonshot and data sharing
Cancer moonshot and data sharing
 
Federal Research & Development for the Florida system Sept 2014
Federal Research & Development for the Florida system Sept 2014 Federal Research & Development for the Florida system Sept 2014
Federal Research & Development for the Florida system Sept 2014
 
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
 
Data Commons & Data Science Workshop
Data Commons & Data Science WorkshopData Commons & Data Science Workshop
Data Commons & Data Science Workshop
 
ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014
 
Precision Medicine in the Age of NCI MATCH and the Beau Biden Cancer Moonshot
Precision Medicine in the Age of NCI MATCH and the Beau Biden Cancer MoonshotPrecision Medicine in the Age of NCI MATCH and the Beau Biden Cancer Moonshot
Precision Medicine in the Age of NCI MATCH and the Beau Biden Cancer Moonshot
 
ISCB ECCB BD2K keynote Kibbe 201707
ISCB ECCB BD2K keynote Kibbe 201707ISCB ECCB BD2K keynote Kibbe 201707
ISCB ECCB BD2K keynote Kibbe 201707
 
Oncoset symposium kibbe 201704
Oncoset symposium kibbe 201704Oncoset symposium kibbe 201704
Oncoset symposium kibbe 201704
 
16
1616
16
 
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
 
Keynote at NVIDIA GPU Technology Conference in D.C.
Keynote at NVIDIA GPU Technology Conference in D.C.Keynote at NVIDIA GPU Technology Conference in D.C.
Keynote at NVIDIA GPU Technology Conference in D.C.
 
NCI Cancer Genomics, Open Science and PMI: FAIR
NCI Cancer Genomics, Open Science and PMI: FAIR NCI Cancer Genomics, Open Science and PMI: FAIR
NCI Cancer Genomics, Open Science and PMI: FAIR
 
Big data
Big dataBig data
Big data
 
Genomics2 Phenomics Complete
Genomics2 Phenomics CompleteGenomics2 Phenomics Complete
Genomics2 Phenomics Complete
 
Perosnalized
PerosnalizedPerosnalized
Perosnalized
 
Data Sharing: Highlights from NCI Experience and Cancer Moonshot Task Force
Data Sharing: Highlights from NCI Experience and Cancer Moonshot Task ForceData Sharing: Highlights from NCI Experience and Cancer Moonshot Task Force
Data Sharing: Highlights from NCI Experience and Cancer Moonshot Task Force
 

More from Warren Kibbe

CCDI Kibbe Wake Forest University Dec 2023.pptx
CCDI Kibbe Wake Forest University Dec 2023.pptxCCDI Kibbe Wake Forest University Dec 2023.pptx
CCDI Kibbe Wake Forest University Dec 2023.pptxWarren Kibbe
 
Big Data Training for Cancer Research, Purdue, May 2023
Big Data Training for Cancer Research, Purdue, May 2023Big Data Training for Cancer Research, Purdue, May 2023
Big Data Training for Cancer Research, Purdue, May 2023Warren Kibbe
 
CCDI Overview November 2022
CCDI Overview November 2022CCDI Overview November 2022
CCDI Overview November 2022Warren Kibbe
 
RADx-UP CDCC Overview November 2022
RADx-UP CDCC Overview November 2022RADx-UP CDCC Overview November 2022
RADx-UP CDCC Overview November 2022Warren Kibbe
 
CCDI Kibbe Big Data Training May 2022
CCDI Kibbe Big Data Training May 2022CCDI Kibbe Big Data Training May 2022
CCDI Kibbe Big Data Training May 2022Warren Kibbe
 
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021Warren Kibbe
 
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...Warren Kibbe
 
RADx-UP CDCC presentation for the NIH Disaster Interest Group
RADx-UP CDCC presentation for the NIH Disaster Interest GroupRADx-UP CDCC presentation for the NIH Disaster Interest Group
RADx-UP CDCC presentation for the NIH Disaster Interest GroupWarren Kibbe
 
DCHI webinar on N3C January 2021
DCHI webinar on N3C January 2021DCHI webinar on N3C January 2021
DCHI webinar on N3C January 2021Warren Kibbe
 
NCI HTAN, cancer trajectories, precision oncology
NCI HTAN, cancer trajectories, precision oncologyNCI HTAN, cancer trajectories, precision oncology
NCI HTAN, cancer trajectories, precision oncologyWarren Kibbe
 
Technology and connected health for population science kibbe duke jan 2020
Technology and connected health for population science kibbe duke jan 2020Technology and connected health for population science kibbe duke jan 2020
Technology and connected health for population science kibbe duke jan 2020Warren Kibbe
 
Super computing 19 Cancer Computing Workshop Keynote
Super computing 19 Cancer Computing Workshop KeynoteSuper computing 19 Cancer Computing Workshop Keynote
Super computing 19 Cancer Computing Workshop KeynoteWarren Kibbe
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
 
Data supporting precision oncology fda wakibbe
Data supporting precision oncology fda wakibbeData supporting precision oncology fda wakibbe
Data supporting precision oncology fda wakibbeWarren Kibbe
 
Role of data in precision oncology
Role of data in precision oncologyRole of data in precision oncology
Role of data in precision oncologyWarren Kibbe
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
 

More from Warren Kibbe (20)

CCDI Kibbe Wake Forest University Dec 2023.pptx
CCDI Kibbe Wake Forest University Dec 2023.pptxCCDI Kibbe Wake Forest University Dec 2023.pptx
CCDI Kibbe Wake Forest University Dec 2023.pptx
 
Big Data Training for Cancer Research, Purdue, May 2023
Big Data Training for Cancer Research, Purdue, May 2023Big Data Training for Cancer Research, Purdue, May 2023
Big Data Training for Cancer Research, Purdue, May 2023
 
CCDI Overview November 2022
CCDI Overview November 2022CCDI Overview November 2022
CCDI Overview November 2022
 
RADx-UP CDCC Overview November 2022
RADx-UP CDCC Overview November 2022RADx-UP CDCC Overview November 2022
RADx-UP CDCC Overview November 2022
 
CCDI Kibbe Big Data Training May 2022
CCDI Kibbe Big Data Training May 2022CCDI Kibbe Big Data Training May 2022
CCDI Kibbe Big Data Training May 2022
 
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
 
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
 
RADx-UP CDCC presentation for the NIH Disaster Interest Group
RADx-UP CDCC presentation for the NIH Disaster Interest GroupRADx-UP CDCC presentation for the NIH Disaster Interest Group
RADx-UP CDCC presentation for the NIH Disaster Interest Group
 
DCHI webinar on N3C January 2021
DCHI webinar on N3C January 2021DCHI webinar on N3C January 2021
DCHI webinar on N3C January 2021
 
NCATS CTSA N3C
NCATS CTSA N3C NCATS CTSA N3C
NCATS CTSA N3C
 
NAACCR June 2020
NAACCR June 2020NAACCR June 2020
NAACCR June 2020
 
NCI HTAN, cancer trajectories, precision oncology
NCI HTAN, cancer trajectories, precision oncologyNCI HTAN, cancer trajectories, precision oncology
NCI HTAN, cancer trajectories, precision oncology
 
ENAR 2020
ENAR 2020ENAR 2020
ENAR 2020
 
ENAR 2020
ENAR 2020ENAR 2020
ENAR 2020
 
Technology and connected health for population science kibbe duke jan 2020
Technology and connected health for population science kibbe duke jan 2020Technology and connected health for population science kibbe duke jan 2020
Technology and connected health for population science kibbe duke jan 2020
 
Super computing 19 Cancer Computing Workshop Keynote
Super computing 19 Cancer Computing Workshop KeynoteSuper computing 19 Cancer Computing Workshop Keynote
Super computing 19 Cancer Computing Workshop Keynote
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Data supporting precision oncology fda wakibbe
Data supporting precision oncology fda wakibbeData supporting precision oncology fda wakibbe
Data supporting precision oncology fda wakibbe
 
Role of data in precision oncology
Role of data in precision oncologyRole of data in precision oncology
Role of data in precision oncology
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 

Recently uploaded

Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...hotbabesbook
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...aartirawatdelhi
 
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...indiancallgirl4rent
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Dipal Arora
 
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...chandars293
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋TANUJA PANDEY
 
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls JaipurRussian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipurparulsinha
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...Garima Khatri
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...jageshsingh5554
 
Call Girls Mumbai Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Mumbai Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Mumbai Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Mumbai Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...narwatsonia7
 
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore EscortsCall Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escortsvidya singh
 
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...vidya singh
 
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...Dipal Arora
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...Arohi Goyal
 
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...Taniya Sharma
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeCall Girls Delhi
 

Recently uploaded (20)

Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
 
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
 
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls JaipurRussian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
 
Call Girls Mumbai Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Mumbai Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Mumbai Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Mumbai Just Call 9907093804 Top Class Call Girl Service Available
 
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
 
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore EscortsCall Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
 
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
 
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
 

Genomics and Computation in Precision Medicine March 2017

  • 1. Insights from Genomics and Computational Biology that inform Precision Medicine Warren Kibbe, PhD warren.kibbe@nih.gov @wakibbe March 6th, 2017
  • 2. 2 1. Background 2. Data in Biomedicine 3. Data Sharing 4. Data Commons 5. Genomics and Computation Thanks to many folks for slides, but especially Jerry Lee
  • 3. 3 In 2016 there were an estimated 1,700,000 new cancer cases and 600,000 cancer deaths - American Cancer Society Cancer remains the second most common cause of death in the U.S. - Centers for Disease Control and Prevention
  • 4. 4 In 2016 there are an estimated 15,500,000 cancer survivors in the U. S.
  • 5. 5 Understanding Cancer  Precision medicine will lead to fundamental understanding of the complex interplay between genetics, epigenetics, nutrition, environment and clinical presentation and direct effective, evidence-based prevention and treatment.
  • 6. 6 (10,000+ patient tumors and increasing) Courtesy of P. Kuhn (USC) 2006-2015: A Decade of Illuminating the Underlying Causes of Primary Untreated Tumors Omics Characterization Cancer is a grand challenge Deep biological understanding Advances in scientific methods Advances in instrumentation Advances in technology Data and computation Cancer Research and Care generate detailed data that is critical to create a learning health system for cancer Requires:
  • 7. 7 (10,000+ patient tumors and increasing) Courtesy of P. Kuhn (USC) 2006-2015: A Decade of Illuminating the Underlying Causes of Primary Untreated Tumors Omics Characterization
  • 8. The primary goal of TCGA is to identify disease subtypes and generate a comprehensive catalog of all cancer genes and pathways Pilot phase started in 2006 -- 3 tumor types 500 T/N pairs each Extended in 2009 -- 20 x 500 T/N + 15 x (50-150) T/N Freely available genomic data enable researchers anywhere around the world to make and validate important discoveries
  • 9. 9
  • 10. 10
  • 11. 11
  • 13. 13 Tumor, Cancer, and Metastasis: (Length-scale and Time-scale Matter) “…>90% of deaths are caused by disseminated disease or metastasis…” Gupta et. al., Cell, 2006 and Siegel et. al. CA Cancer J Clin, Jan/Feb 2016 5 year Relative Survival Rates (2016 report of 2005-2011 data)
  • 14. 14 1997 20152001 20051998 20142002 20061999 2003 20112000 2004 2007 20122008 20132009 2010 10/23/2001 (~5 yrs old) 4/21/1997 1/9/2007 (~10 yrs old) iPod (10GB max) WinAMP(mp3) iPhone (EDGE, 16 GB max) 9/16/1999 (~3 yrs old) 802.11b WiFi 4/3/2010 (~13 yrs old) iPad (EDGE, 64 GB max) 4/23/2005 (~8 yrs old) 9/26/2006 (~9 yrs old) 7/15/2006 2/7/2007 Google Drive 4/24/2012 (~15 yrs old)7/11/2008 (~11 yrs old) iPhone 3G (16 GB max) 9/12/2012 (~15 yrs old) iPhone5 (LTE, 128 GB max) Google Baseline 7/14/2014 (~17 yrs old) 3/9/2015 (~18 yrs old) Digital technology is changing rapidly
  • 16. 1618 Application of Cancer Genomics is changing
  • 17. 17
  • 18. 18
  • 19. 19 Biology and Medicine are now data intensive enterprises Scale is rapidly changing Technology, data, computing and IT are pervasive in the lab, the clinic, in the home, and across the population
  • 20. 20
  • 21. 21
  • 22. 22 “...an advantage of machine learning is that it can be used even in cases where it is infeasible or difficult to write down explicit rules to solve a problem...” https://www.whitehouse.gov/sites/default/files/whitehouse_files/microsites/ostp/NSTC/preparing_for_the_future_of_ai.pdf
  • 23. 23
  • 24. 24
  • 25. 25 " there is great potential for new insights to come from the combined analysis of cancer proteomic and genomic data, as proteomic data can now reproducibly provide information about protein levels and activities that are difficult or impossible to infer from genomic data alone ” Douglas R. Lowy, MD Acting Director of the National Cancer Institute, National Institutes of Health 5/25/2016
  • 26. 26 Genomics is the beginning of precision medicine, not the end!
  • 28. Cancer Research Data Ecosystem – Cancer Moonshot BRP Well characterized research data sets Cancer cohorts Patient data EHR, Lab Data, Imaging, PROs, Smart Devices, Decision Support Learning from every cancer patient Active research participation Research information donor Clinical Research Observational studies Proteogenomics Imaging data Clinical trials Discovery Patient engaged Research Surveillance Big Data Implementation research SEERGDC
  • 30. 30 Data Commons Structure DICOM, AIM Amazon Google IBM Imaging Validator Q/A Proteomic Validator Q/A Clinical Phenotype Validator Q/A MOD Phenotype Validator Q/A Pathology Radiology Mass Spectrometry Array Data Commons Security Visualization Authentication & Authorization Genomic Validator Q/A Germline Pipelines DNA, RNA Pipelines EMRs, Clinical Trials Azure Data Contributors and Consumers Researchers PatientsCliniciansInstitutions NCI Thesaurus caDSR NLM UMLS RxNorm LOINC SNOMED
  • 31. 31 Cancer Data Sharing & Data Commons • Support open science • Support data reusability • Aligned with Cancer Moonshot • Part of Precision Medicine • Reduce Health Disparities • Improve patient access to clinical trials • Work toward a learning National Cancer Data Ecosystem Reduce the risk, improve early detection, outcomes and survivorship in cancer
  • 32. Cancer Data Sharing Efforts Signature Efforts Data BRCA Challenge Somatic variant sharing Isolated genetic variants No raw sequencing data Precision medicine questions Somatic variant sharing Panel gene resequencing Clinical response Clinical trial Public-private partnerships Comprehensive genomics Detailed clinical phenotype data Clinical trial access Clinical/genomic data aggregation EHR data Clinical sequencing Clinical oncology standards EHR data Clinical sequencing
  • 33. 33 Changing the conversation around data sharing  How do we find data, software, standards?  How can we make data, annotations, software, metadata accessible?  How do we reuse data standards?  How do we make more data machine readable? NIH Data Commons NCI Genomic Data Commons National Cancer Data Ecosystem Data Commons co-locate data, storage and computing infrastructure, and frequently used tools for analyzing and sharing data to create an interoperable resource for the research community. *Robert L. Grossman, Allison Heath, Mark Murphy, Maria Patterson, A Case for Data Commons Towards Data Science as a Service, to appear. Source of image: Interior of one of Google’s Data Center, www.google.com/about/datacenters/.
  • 34. 34 Basic Ingredients of a Cancer Research Data Ecosystem • Open Science. Supporting Open Access, Open Data, Open Source, and Data Liquidity for the cancer community • Standardization through CDEs and Case Report Forms • Interoperability by exposing existing knowledge through appropriate integration of ontologies, vocabularies and taxonomies • Consistency of curation and capture of evidence (tissue types, recurrence, therapy – including genomic, epigenomic, etc context) • Sustainable models for informatics infrastructure, services, data, metadata
  • 35. 35 GDC as an example of a new architecture for storing and sharing cancer data
  • 36. 36 The Cancer Genomic Data Commons (GDC) is an existing effort to standardize and simplify submission of genomic data to NCI and follow the principles of FAIR – Findable, Accessible, Attributable, Interoperable, Reusable, and Provide Recognition. The GDC is part of the NIH Big Data to Knowledge (BD2K) initiative and an example of the NIH Commons Genomic Data Commons Microattribution, nanopublications, tracking the use of data, annotation of data, use of algorithms, supports the data /software /metadata life cycle to provide credit and analyze impact of data, software, analytics, algorithm, curation and knowledge sharing Force11 white paper https://www.force11.org/group/fairgroup/fairprinciples
  • 37. 37 The NCI Genomic Data Commons • Unify fragmentary repositories • Support the receipt, quality control, integration, storage, and redistribution of standardized genomic data sets derived from cancer research studies • Harmonization of raw sequence both from existing and new cancer research programs • Application of state-of-the-art methods of generating derived genomic data • Provide the foundation for: • Identification of high- and low-frequency cancer drivers • Defining genomic determinants of response to therapy • Clinical trial cohorts sharing targeted genetic lesions
  • 38. NCI Genomic Data Commons  The GDC went live on June 6, 2016 with approximately 4.1 PB of data.  577,878 files about 14194 cases (patients), in 42 cancer types, across 29 primary disease sites, 400 clinical data elements  10 major data types, ranging from Raw Sequencing Data, Raw Microarray Data, to Copy Number Variation, Simple Nucleotide Variation and Gene Expression.  Data are derived from 17 different experimental strategies, with the major ones being RNA-Seq, WXS, WGS, miRNA-Seq, Genotyping Array and Expression Array.  Foundation Medicine announced the release of 18,000 genomic profiles to the GDC at the Cancer Moonshot Summit, June 29th, 2016  The Multiple Myeloma Research Foundation announced it would be releasing its CoMMpass study of more than 1000 cases of Multiple Myeloma on Sept 29, 2016.
  • 39. 39 Content in the Genomic Data Commons  TCGA 11,353 cases  TARGET 3,178 cases Current  Foundation Medicine 18,000 cases  Cancer studies in dbGaP ~4,000 cases  Multiple Myeloma RF ~1,000 cases Coming soon  NCI-MATCH ~3,000 cases  Clinical Trial Sequencing Program ~3,000 cases Planned (1-3 years)  Cancer Driver Discovery Program ~5,000 cases  Human Cancer Model Initiative ~1,000 cases  APOLLO – NCI, VA and DoD ~8,000 cases ~58,000 cases
  • 40. Exome-seq Whole genome-seq RNA-seq Copy number Genome alignment Genome alignment Genome alignment Data segmentation 1° processing Mutations Mutations + structural variants Digital gene expression Copy number calls 2° processing Oncogene vs. Tumor suppressor Translocations Relative RNA levels Alternative splicing Gene amplification/ deletion 3° processing GDC Data Harmonization Multiple data types and levels of processing
  • 41. Mutect2 pipeline GDC Data Harmonization Open Source, Dockerized Pipelines
  • 42. Recovery rate (% true positives) A0F0 SomaticSniper 81.1% 76.5% VarScan 93.9% 84.3% MuSE 93.1% 87.3% All Three 96.4% 91.2% GDC variant calling pipelines Wash U Baylor Broad GDC Data Harmonization Multiple pipelines needed to recover all variants
  • 43. Development of the NCI Genomic Data Commons (GDC) To Foster the Molecular Diagnosis and Treatment of Cancer GDC Bob Grossman PI Univ. of Chicago Ontario Inst. Cancer Res. Leidos Institute of Medicine Towards Precision Medicine 2011
  • 44.
  • 45.
  • 46. Discovery of Cancer Drivers With 2% Prevalence Lung adeno. + 2,900 Colorectal + 1,200 Ovarian + 500 Lawrence et al, Nature 2014 Power Calculation for Cancer Driver Discovery Need to resequence >100,000 tumors to identify all cancer drivers at >2% prevalence
  • 47. The NCI Cancer Genomics Cloud Pilots Understanding how to meet the research community’s need to analyze large-scale cancer genomic and clinical data
  • 48. 48 NCI Cancer Genomics Cloud Pilots Democratize access to NCI-generated genomic and related data, and to create a cost-effective way to provide scalable computational capacity to the cancer research community. Cloud Pilots provide: • Access to large genomic data sets without need to download • Access to popular pipelines and visualization tools • Ability for researchers to bring their own tools and pipelines to the data • Ability for researchers to bring their own data and analyze in combination with existing genomic data • Workspaces, for researchers to save and share their data and results of analyses
  • 49. 49 • PI: Gad Getz • Google Cloud • Firehose in the cloud including Broad best practices workflows •http://firecloud.org Broad Institute • PI: Ilya Shmulevich • Google Cloud • Leverage Google infrastructure; Novel query and visualization •http://cgc.systemsbiology.net/ Institute for Systems Biology • PI: Deniz Kural • Amazon Web Services • Interactive data exploration; > 30 public pipelines •http://www.cancergenomicscloud.org Seven Bridges Genomics Three NCI Genomics Cloud Pilots Selection Design/Build I Design/Build II Evaluation Extension Sept 2016Jan 2016April 2015Sept 2014 Jan 2014
  • 50. Broad Institute Cloud Pilot • Targeted at users performing analyses at scale • Modeled after their Firehose analysis infrastructure developed for the TCGA program • Users can upload their own data and tools and/or run the Broad’s best practice tools and pipelines on pre-loaded data
  • 51. Institute for Systems Biology Cloud Pilot 51 PI / Biologist web access Computational Research Scientist Python, R, SQL Algorithm Developer ssh, programmatic access ISB-CGC Web App Google Cloud Console Google APIs ISB-CGC APIs Compute Engine VMs Cloud Storage BigQuery Genomics Local Storage ISB-CGC Hosted Data Controlled-Access Data Open-Access Data User Data • Closely tied with Google Cloud Platform tools including BigQuery, App Engine, Cloud Datalab, Google Genomics, and Compute Engine • Aggregated TCGA data in BigQuery allows fast SQL-like queries across the entire dataset • Web interface allows scientists to interactively compare and define cohorts
  • 52. Seven Bridges Genomics Cloud Pilot • Built upon the SBG commercial cloud-based genomics platform • Graphical query interface to identify hosted data of interest • Includes a native implementation of the Common Workflow Language specification and graphical interface for creating user- defined workflows
  • 53. Workspace – isolated environment for collaborative analysis Data + Methods → Results sample data and metadata (e.g. BAMs, tissue type) algorithms (e.g. mutation calling) Wiring logic (e.g. use the exome capture BAM) executions and results (e.g. run mutation caller v41 on this exact bam and track results) Slide courtesy of Broad Institute
  • 54. GDC Acknowledgements NCI Center for Cancer Genomics Univ. of Chicago Bob Grossman Allison Heath Mike Ford Zhenyu Zhang Ontario Institute for Cancer Research Lou Staudt Zhining Wang Martin Ferguson JC Zenklusen Daniela Gerhard Deb Steverson Vincent Ferretti 'Francois Gerthoffert JunJun Zhang Leidos Biomedical Research Mark Jensen Sharon Gaheen Himanso Sahni NCI NCI CBIIT Tony Kerlavage Tanya Davidsen
  • 55. CGC Pilot Team Principal Investigators • Gad Getz, Ph.D - Broad Institute - http://firecloud.org • Ilya Shmulevich, Ph.D - ISB - http://cgc.systemsbiology.net/ • Deniz Kural, Ph.D - Seven Bridges – http://www.cancergenomicscloud.org NCI Project Officer & CORs • Anthony Kerlavage, Ph.D –Project Officer • Juli Klemm, Ph.D – COR, Broad Institute • Tanja Davidsen, Ph.D – COR, Institute for Systems Biology • Ishwar Chandramouliswaran, MS, MBA – COR, Seven Bridges Genomics GDC Principal Investigator • Robert Grossman, Ph.D - University of Chicago • Allison Heath, Ph.D - University of Chicago • Vincent Ferretti, Ph.D - Ontario Institute for Cancer Research Cancer Genomics Project Teams NCI Leadership Team • Doug Lowy, M.D. • Lou Staudt, M.D., Ph.D. • Stephen Chanock, M.D. • George Komatsoulis, Ph.D. • Warren Kibbe, Ph.D. Center for Cancer Genomics Partners • JC Zenklusen, Ph.D. • Daniela Gerhard, Ph.D. • Zhining Wang, Ph.D. • Liming Yang, Ph.D. • Martin Ferguson, Ph.D.
  • 56. 56 Integrated data sets, interoperable resources, harmonized data are necessary for and enable biologically informed cancer predictive models
  • 59. 59 NIH Genomic Data Sharing Policy https://gds.nih.gov/ Went into effect January 25, 2015 NCI guidance: http://www.cancer.gov/grants-training/grants- management/nci-policies/genomic-data Requires public sharing of genomic data sets