SlideShare a Scribd company logo
1 of 9
A view from science-driven “big industry”
Duncan Irving, Oil and Gas Consulting Practice Lead, Teradata
Fiona Murphy, Earth Science Journals Publisher, Wiley
PARTNERSHIPS, TRUST, QUALITY
@duncanirving
2
The pace of science-based industry
what is an acceptable provenance latency if you cannot
make a decision until trust has been established?
seconds minutes hours days weeks
“How do I know that a ‘fact’ has altered in my
view of the world and when did it happen?”
Leading Advisor (Global Subsurface Data Management), Statoil
Facts Decision
• hypothesis
• experiment
• model
• interpretation
• context
3
now: we publish knowledge + data
Hypothesise Model Test Contextualise Publish
Subject Area
Drivers
Experimental
Methodologies
Technical
Approaches
Direct
Comparison
Broader
Context
Relevance
Publishing Categories or
Degrees of Freedom?
Hypothesise Model
Contextualise Test
Publish
future: knowledge will be continuously updated*
* with more
attention to its
intended, and
unintended, use
4
well
logs
How data moves through upstream Oil and Gas
Seismic surveys
Permanent seismic
Production sensors
Logging
seismic
imagery
metadata
event
location
well logs
sensor
streams
seismic and survey
data store
data sorting and
conditioning
QC/QA tools
seismic imaging
on HPC
• Data
processing
• CEP
• DSP
subsampled
data
fracture location
well
logs
hr-day
assimilation
sensor data store
model
building
and
testing
reservoir
modelling
ops
control
inter-
domain
analytics
subsurface
modelling
Well log
store
seismic
seismic
Bathymetry, Geospatial, Geology, Well completions, Historical data, Prediction, Maintenance,
Contractors, Logistics, Costs, External feeds, Human resources, HSE
production
modelling
5
MS
How data moves through upstream Oil and Gas
Seismic surveys
Permanent seismic
Production sensors
Logging
trial data
protocls
mapping
Raw MS
sensor
streams
structure and recipe store
data sorting and
conditioning
QC/QA tools
proteome
matching on
HPC
• Data
processing
• CEP
• DSP
subsampled
data
fracture location
MS
hr-day
assimilation
sensor data store
intra-
domain
analytics
intra-
domain
analytics
intra-
domain
analytics
intra-
domain
analytics
inter-
domain
analytics
chemical
modelling
MS
store
recipes
Patient Records, Drug Trials, Blind Studies, Historical data, Prediction, Maintenance, Contractors,
Logistics, Costs, External feeds, Human resources, HSE
Biopharma
6
Who maintains trust for us?
The Community Experts Rules Engines
• Provenance
• Versioning
• Sources
• Unique ID
Most big organisations can
afford teams who understand
the technical and scientific
domains and care enough to
“fight the good data fight”
The Data Guardians
7
The Architecture of Partnerships
Access Layer
User Layer
Us Them Knowledge
Data
• IP and legal departments manage parameters of knowledge sharing
extension of intra-organisational processes
licensing and sharing can be driven by data value (societal or economic)
• Technical challenge is in the physical and logical connectivity
Provenance and Quality are human-guaranteed
Semantic framework needs to describe data AND infrastructure
Source Layer
8
But what about using the data at
the time of querying?
• too voluminous
• needs API
• who pays for the clock cycles?
• relational v. non-relational
What can technology do for data publishing?
Access Layer
Query Layer
Us Them Knowledge
Data
Source Layer
Relational Databases allow:
• searching/filtering on metadata
• auditing and logging
• query recording
New ontologies
support “metadata”
discovery
“push” and
synchronisation
services
Massively Parallel Processing platforms
enable:
• scalable data processing at query time
• RESTful encapsulation of results
• caching of results summary for re-use
Provenance info locked
into proprietary
application formats
difficult to link internal
and external data
sources (IHS, Elsevier
Geofacets achieve this
to some extent)
9
• Who owns the data?
> Read the contract!
• What value does the community place on trust and what
cost are they prepared to pay?
> It is such a new area that value will outstrip cost for some time
> The challenge in the public sector is articulating the value and spreading
the cost when there are so many stakeholders
• What part do publishers play?
> Filter / Enabler
> Content aggregation
> Minimise provenance latency - Timeliness of usable knowledge
> Move from knowledge reporter to value enabler
• Robust data publishing in science-driven industries is
emerging as a massive channel opportunity to link:
Scientists
Decision makers
Equipment manufacturers
Technology vendors
The future

More Related Content

What's hot

Myths about data science and big data analytics
Myths about data science and big data analyticsMyths about data science and big data analytics
Myths about data science and big data analyticsChulalongkorn University
 
Generating actionable consumer insights from analytics
Generating actionable consumer insights from analyticsGenerating actionable consumer insights from analytics
Generating actionable consumer insights from analyticsMohammad Shazri Shahrir
 
Neo4j for Healthcare & Life Sciences
Neo4j for Healthcare & Life SciencesNeo4j for Healthcare & Life Sciences
Neo4j for Healthcare & Life SciencesNeo4j
 
Threat Hunting with Elastic at SpectorOps: Welcome to HELK
Threat Hunting with Elastic at SpectorOps: Welcome to HELKThreat Hunting with Elastic at SpectorOps: Welcome to HELK
Threat Hunting with Elastic at SpectorOps: Welcome to HELKElasticsearch
 
2016 Scope david cocker
2016 Scope david cocker2016 Scope david cocker
2016 Scope david cockerDavid Cocker
 
Derilinx - Supporting Open Data Publication
Derilinx - Supporting Open Data PublicationDerilinx - Supporting Open Data Publication
Derilinx - Supporting Open Data PublicationDerilinx
 
Umm, how did you get that number? Managing Data Integrity throughout the Data...
Umm, how did you get that number? Managing Data Integrity throughout the Data...Umm, how did you get that number? Managing Data Integrity throughout the Data...
Umm, how did you get that number? Managing Data Integrity throughout the Data...John Kinmonth
 
Data analytics - May 2016
Data analytics - May 2016Data analytics - May 2016
Data analytics - May 2016Mark Yunger
 
CTO Perspectives: What's Next for Data Management and Healthcare?
CTO Perspectives: What's Next for Data Management and Healthcare?CTO Perspectives: What's Next for Data Management and Healthcare?
CTO Perspectives: What's Next for Data Management and Healthcare?Health Catalyst
 
Democratizing Data Science: Balancing Flexibility and Usability for Scientifi...
Democratizing Data Science: Balancing Flexibility and Usability for Scientifi...Democratizing Data Science: Balancing Flexibility and Usability for Scientifi...
Democratizing Data Science: Balancing Flexibility and Usability for Scientifi...PerkinElmer Informatics
 
Maven and google pharma r&d (1)
Maven and google pharma r&d  (1)Maven and google pharma r&d  (1)
Maven and google pharma r&d (1)Matt Barnes
 
Big Data in Pediatric Critical Care by Mohit Mehra
Big Data in Pediatric Critical Care by Mohit MehraBig Data in Pediatric Critical Care by Mohit Mehra
Big Data in Pediatric Critical Care by Mohit MehraData Con LA
 
Building Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowsBuilding Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowskCura_Relativity
 
Genomics Applications in the Cloud with the DNAnexus Platform
Genomics Applications in the Cloud with the DNAnexus PlatformGenomics Applications in the Cloud with the DNAnexus Platform
Genomics Applications in the Cloud with the DNAnexus Platformkislyuk
 
Neo4j GraphDay Munich - Life & Health Sciences Intro to Graphs
Neo4j GraphDay Munich - Life & Health Sciences Intro to GraphsNeo4j GraphDay Munich - Life & Health Sciences Intro to Graphs
Neo4j GraphDay Munich - Life & Health Sciences Intro to GraphsNeo4j
 
DataSpryng Overview
DataSpryng OverviewDataSpryng Overview
DataSpryng Overviewjkvr
 
Enabling Better Clinical Operations through a Clinical Operations Store
Enabling Better Clinical Operations through a Clinical Operations StoreEnabling Better Clinical Operations through a Clinical Operations Store
Enabling Better Clinical Operations through a Clinical Operations StoreSaama
 
ICRISAT Global Planning Meeting 2019: Research Data Management by Abhishek Ra...
ICRISAT Global Planning Meeting 2019: Research Data Management by Abhishek Ra...ICRISAT Global Planning Meeting 2019: Research Data Management by Abhishek Ra...
ICRISAT Global Planning Meeting 2019: Research Data Management by Abhishek Ra...ICRISAT
 

What's hot (19)

Myths about data science and big data analytics
Myths about data science and big data analyticsMyths about data science and big data analytics
Myths about data science and big data analytics
 
Generating actionable consumer insights from analytics
Generating actionable consumer insights from analyticsGenerating actionable consumer insights from analytics
Generating actionable consumer insights from analytics
 
Neo4j for Healthcare & Life Sciences
Neo4j for Healthcare & Life SciencesNeo4j for Healthcare & Life Sciences
Neo4j for Healthcare & Life Sciences
 
Threat Hunting with Elastic at SpectorOps: Welcome to HELK
Threat Hunting with Elastic at SpectorOps: Welcome to HELKThreat Hunting with Elastic at SpectorOps: Welcome to HELK
Threat Hunting with Elastic at SpectorOps: Welcome to HELK
 
White Manipulating Metadata to Enhance Access
White Manipulating Metadata to Enhance AccessWhite Manipulating Metadata to Enhance Access
White Manipulating Metadata to Enhance Access
 
2016 Scope david cocker
2016 Scope david cocker2016 Scope david cocker
2016 Scope david cocker
 
Derilinx - Supporting Open Data Publication
Derilinx - Supporting Open Data PublicationDerilinx - Supporting Open Data Publication
Derilinx - Supporting Open Data Publication
 
Umm, how did you get that number? Managing Data Integrity throughout the Data...
Umm, how did you get that number? Managing Data Integrity throughout the Data...Umm, how did you get that number? Managing Data Integrity throughout the Data...
Umm, how did you get that number? Managing Data Integrity throughout the Data...
 
Data analytics - May 2016
Data analytics - May 2016Data analytics - May 2016
Data analytics - May 2016
 
CTO Perspectives: What's Next for Data Management and Healthcare?
CTO Perspectives: What's Next for Data Management and Healthcare?CTO Perspectives: What's Next for Data Management and Healthcare?
CTO Perspectives: What's Next for Data Management and Healthcare?
 
Democratizing Data Science: Balancing Flexibility and Usability for Scientifi...
Democratizing Data Science: Balancing Flexibility and Usability for Scientifi...Democratizing Data Science: Balancing Flexibility and Usability for Scientifi...
Democratizing Data Science: Balancing Flexibility and Usability for Scientifi...
 
Maven and google pharma r&d (1)
Maven and google pharma r&d  (1)Maven and google pharma r&d  (1)
Maven and google pharma r&d (1)
 
Big Data in Pediatric Critical Care by Mohit Mehra
Big Data in Pediatric Critical Care by Mohit MehraBig Data in Pediatric Critical Care by Mohit Mehra
Big Data in Pediatric Critical Care by Mohit Mehra
 
Building Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowsBuilding Information Governance Policies and Workflows
Building Information Governance Policies and Workflows
 
Genomics Applications in the Cloud with the DNAnexus Platform
Genomics Applications in the Cloud with the DNAnexus PlatformGenomics Applications in the Cloud with the DNAnexus Platform
Genomics Applications in the Cloud with the DNAnexus Platform
 
Neo4j GraphDay Munich - Life & Health Sciences Intro to Graphs
Neo4j GraphDay Munich - Life & Health Sciences Intro to GraphsNeo4j GraphDay Munich - Life & Health Sciences Intro to Graphs
Neo4j GraphDay Munich - Life & Health Sciences Intro to Graphs
 
DataSpryng Overview
DataSpryng OverviewDataSpryng Overview
DataSpryng Overview
 
Enabling Better Clinical Operations through a Clinical Operations Store
Enabling Better Clinical Operations through a Clinical Operations StoreEnabling Better Clinical Operations through a Clinical Operations Store
Enabling Better Clinical Operations through a Clinical Operations Store
 
ICRISAT Global Planning Meeting 2019: Research Data Management by Abhishek Ra...
ICRISAT Global Planning Meeting 2019: Research Data Management by Abhishek Ra...ICRISAT Global Planning Meeting 2019: Research Data Management by Abhishek Ra...
ICRISAT Global Planning Meeting 2019: Research Data Management by Abhishek Ra...
 

Viewers also liked

Hole-data journal-nfdp13
Hole-data journal-nfdp13Hole-data journal-nfdp13
Hole-data journal-nfdp13DataDryad
 
Michener-institutional and subject-specific data repositories-nfdp13
Michener-institutional and subject-specific data repositories-nfdp13Michener-institutional and subject-specific data repositories-nfdp13
Michener-institutional and subject-specific data repositories-nfdp13DataDryad
 
Shotton force11-nfdp13
Shotton force11-nfdp13Shotton force11-nfdp13
Shotton force11-nfdp13DataDryad
 
Manola-open aire and data publishing-nfdp13
Manola-open aire and data publishing-nfdp13Manola-open aire and data publishing-nfdp13
Manola-open aire and data publishing-nfdp13DataDryad
 
Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13DataDryad
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13DataDryad
 
Karunkara-Keynote-msf and open data-nfdp2013
Karunkara-Keynote-msf and open data-nfdp2013Karunkara-Keynote-msf and open data-nfdp2013
Karunkara-Keynote-msf and open data-nfdp2013DataDryad
 

Viewers also liked (7)

Hole-data journal-nfdp13
Hole-data journal-nfdp13Hole-data journal-nfdp13
Hole-data journal-nfdp13
 
Michener-institutional and subject-specific data repositories-nfdp13
Michener-institutional and subject-specific data repositories-nfdp13Michener-institutional and subject-specific data repositories-nfdp13
Michener-institutional and subject-specific data repositories-nfdp13
 
Shotton force11-nfdp13
Shotton force11-nfdp13Shotton force11-nfdp13
Shotton force11-nfdp13
 
Manola-open aire and data publishing-nfdp13
Manola-open aire and data publishing-nfdp13Manola-open aire and data publishing-nfdp13
Manola-open aire and data publishing-nfdp13
 
Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
 
Karunkara-Keynote-msf and open data-nfdp2013
Karunkara-Keynote-msf and open data-nfdp2013Karunkara-Keynote-msf and open data-nfdp2013
Karunkara-Keynote-msf and open data-nfdp2013
 

Similar to Irving-TeraData: data and science driven big industry-nfdp13

Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...ICPSR
 
Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...
Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...
Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...Denodo
 
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Denodo
 
2017 bio it world
2017 bio it world2017 bio it world
2017 bio it worldChris Dwan
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsVivien Bonazzi
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Geoffrey Fox
 
Breed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptxBreed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptxGautamPopli1
 
BDVe Webinar Series - Ocean Protocol – Why you need to care about how you sha...
BDVe Webinar Series - Ocean Protocol – Why you need to care about how you sha...BDVe Webinar Series - Ocean Protocol – Why you need to care about how you sha...
BDVe Webinar Series - Ocean Protocol – Why you need to care about how you sha...Big Data Value Association
 
Graham Pryor
Graham PryorGraham Pryor
Graham PryorEduserv
 
Predictive Human Capital Analytics (1).pptx
Predictive Human Capital Analytics (1).pptxPredictive Human Capital Analytics (1).pptx
Predictive Human Capital Analytics (1).pptxSaminaNawaz14
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeGeoffrey Fox
 
Starting From Scratch - the ELN Reality
Starting From Scratch - the ELN RealityStarting From Scratch - the ELN Reality
Starting From Scratch - the ELN RealityJohn Trigg
 
Noise to Signal - The Biggest Problem in Data
Noise to Signal - The Biggest Problem in DataNoise to Signal - The Biggest Problem in Data
Noise to Signal - The Biggest Problem in DataDATAVERSITY
 
Next Gen Clinical Data Sciences
Next Gen Clinical Data SciencesNext Gen Clinical Data Sciences
Next Gen Clinical Data SciencesSaama
 
Regulatory Intelligence
Regulatory IntelligenceRegulatory Intelligence
Regulatory IntelligenceArmin Torres
 
Activate Your Data Lakehouse with an Enterprise Knowledge Graph
Activate Your Data Lakehouse with an Enterprise Knowledge GraphActivate Your Data Lakehouse with an Enterprise Knowledge Graph
Activate Your Data Lakehouse with an Enterprise Knowledge GraphDATAVERSITY
 
Direct Project HIT Standards 10.27
Direct Project HIT Standards 10.27Direct Project HIT Standards 10.27
Direct Project HIT Standards 10.27Brian Ahier
 
Data lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiryData lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amirydatastack
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data CommonsVivien Bonazzi
 

Similar to Irving-TeraData: data and science driven big industry-nfdp13 (20)

Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...
Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...
Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...
 
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
 
2017 bio it world
2017 bio it world2017 bio it world
2017 bio it world
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
 
Breed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptxBreed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptx
 
BDVe Webinar Series - Ocean Protocol – Why you need to care about how you sha...
BDVe Webinar Series - Ocean Protocol – Why you need to care about how you sha...BDVe Webinar Series - Ocean Protocol – Why you need to care about how you sha...
BDVe Webinar Series - Ocean Protocol – Why you need to care about how you sha...
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
 
Predictive Human Capital Analytics (1).pptx
Predictive Human Capital Analytics (1).pptxPredictive Human Capital Analytics (1).pptx
Predictive Human Capital Analytics (1).pptx
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
 
Starting From Scratch - the ELN Reality
Starting From Scratch - the ELN RealityStarting From Scratch - the ELN Reality
Starting From Scratch - the ELN Reality
 
Noise to Signal - The Biggest Problem in Data
Noise to Signal - The Biggest Problem in DataNoise to Signal - The Biggest Problem in Data
Noise to Signal - The Biggest Problem in Data
 
Next Gen Clinical Data Sciences
Next Gen Clinical Data SciencesNext Gen Clinical Data Sciences
Next Gen Clinical Data Sciences
 
Regulatory Intelligence
Regulatory IntelligenceRegulatory Intelligence
Regulatory Intelligence
 
Activate Your Data Lakehouse with an Enterprise Knowledge Graph
Activate Your Data Lakehouse with an Enterprise Knowledge GraphActivate Your Data Lakehouse with an Enterprise Knowledge Graph
Activate Your Data Lakehouse with an Enterprise Knowledge Graph
 
Direct Project HIT Standards 10.27
Direct Project HIT Standards 10.27Direct Project HIT Standards 10.27
Direct Project HIT Standards 10.27
 
Data lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiryData lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiry
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
 

More from DataDryad

Wood-RDA and-data publishing-nfdp13
Wood-RDA and-data publishing-nfdp13Wood-RDA and-data publishing-nfdp13
Wood-RDA and-data publishing-nfdp13DataDryad
 
Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13DataDryad
 
Coles partnerships quality and trust-nfdp13
Coles partnerships quality and trust-nfdp13Coles partnerships quality and trust-nfdp13
Coles partnerships quality and trust-nfdp13DataDryad
 
Mounce-Herding Cats
Mounce-Herding CatsMounce-Herding Cats
Mounce-Herding CatsDataDryad
 
Pfeiffenberger-Data Policies and Sustainability-NFDP13
Pfeiffenberger-Data Policies and Sustainability-NFDP13Pfeiffenberger-Data Policies and Sustainability-NFDP13
Pfeiffenberger-Data Policies and Sustainability-NFDP13DataDryad
 
Lyon-data metrics panel introduction-nfdp13
Lyon-data metrics panel introduction-nfdp13Lyon-data metrics panel introduction-nfdp13
Lyon-data metrics panel introduction-nfdp13DataDryad
 
Lyon-data publishing challenges-nfdp13
Lyon-data publishing challenges-nfdp13Lyon-data publishing challenges-nfdp13
Lyon-data publishing challenges-nfdp13DataDryad
 
Costas-data metrics-nfdp13
Costas-data metrics-nfdp13Costas-data metrics-nfdp13
Costas-data metrics-nfdp13DataDryad
 
Mowlam-semantic publishing-up-nfdp13
Mowlam-semantic publishing-up-nfdp13Mowlam-semantic publishing-up-nfdp13
Mowlam-semantic publishing-up-nfdp13DataDryad
 
Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13DataDryad
 
Pulverer-embo-source data-nfdp13
Pulverer-embo-source data-nfdp13Pulverer-embo-source data-nfdp13
Pulverer-embo-source data-nfdp13DataDryad
 
Green-oecd and data publishing-nfdp13
Green-oecd and data publishing-nfdp13Green-oecd and data publishing-nfdp13
Green-oecd and data publishing-nfdp13DataDryad
 
Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13DataDryad
 
Hodson-Introduction-nfdp13
Hodson-Introduction-nfdp13Hodson-Introduction-nfdp13
Hodson-Introduction-nfdp13DataDryad
 

More from DataDryad (14)

Wood-RDA and-data publishing-nfdp13
Wood-RDA and-data publishing-nfdp13Wood-RDA and-data publishing-nfdp13
Wood-RDA and-data publishing-nfdp13
 
Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13
 
Coles partnerships quality and trust-nfdp13
Coles partnerships quality and trust-nfdp13Coles partnerships quality and trust-nfdp13
Coles partnerships quality and trust-nfdp13
 
Mounce-Herding Cats
Mounce-Herding CatsMounce-Herding Cats
Mounce-Herding Cats
 
Pfeiffenberger-Data Policies and Sustainability-NFDP13
Pfeiffenberger-Data Policies and Sustainability-NFDP13Pfeiffenberger-Data Policies and Sustainability-NFDP13
Pfeiffenberger-Data Policies and Sustainability-NFDP13
 
Lyon-data metrics panel introduction-nfdp13
Lyon-data metrics panel introduction-nfdp13Lyon-data metrics panel introduction-nfdp13
Lyon-data metrics panel introduction-nfdp13
 
Lyon-data publishing challenges-nfdp13
Lyon-data publishing challenges-nfdp13Lyon-data publishing challenges-nfdp13
Lyon-data publishing challenges-nfdp13
 
Costas-data metrics-nfdp13
Costas-data metrics-nfdp13Costas-data metrics-nfdp13
Costas-data metrics-nfdp13
 
Mowlam-semantic publishing-up-nfdp13
Mowlam-semantic publishing-up-nfdp13Mowlam-semantic publishing-up-nfdp13
Mowlam-semantic publishing-up-nfdp13
 
Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13
 
Pulverer-embo-source data-nfdp13
Pulverer-embo-source data-nfdp13Pulverer-embo-source data-nfdp13
Pulverer-embo-source data-nfdp13
 
Green-oecd and data publishing-nfdp13
Green-oecd and data publishing-nfdp13Green-oecd and data publishing-nfdp13
Green-oecd and data publishing-nfdp13
 
Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13
 
Hodson-Introduction-nfdp13
Hodson-Introduction-nfdp13Hodson-Introduction-nfdp13
Hodson-Introduction-nfdp13
 

Recently uploaded

Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 

Recently uploaded (20)

Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 

Irving-TeraData: data and science driven big industry-nfdp13

  • 1. A view from science-driven “big industry” Duncan Irving, Oil and Gas Consulting Practice Lead, Teradata Fiona Murphy, Earth Science Journals Publisher, Wiley PARTNERSHIPS, TRUST, QUALITY @duncanirving
  • 2. 2 The pace of science-based industry what is an acceptable provenance latency if you cannot make a decision until trust has been established? seconds minutes hours days weeks “How do I know that a ‘fact’ has altered in my view of the world and when did it happen?” Leading Advisor (Global Subsurface Data Management), Statoil Facts Decision • hypothesis • experiment • model • interpretation • context
  • 3. 3 now: we publish knowledge + data Hypothesise Model Test Contextualise Publish Subject Area Drivers Experimental Methodologies Technical Approaches Direct Comparison Broader Context Relevance Publishing Categories or Degrees of Freedom? Hypothesise Model Contextualise Test Publish future: knowledge will be continuously updated* * with more attention to its intended, and unintended, use
  • 4. 4 well logs How data moves through upstream Oil and Gas Seismic surveys Permanent seismic Production sensors Logging seismic imagery metadata event location well logs sensor streams seismic and survey data store data sorting and conditioning QC/QA tools seismic imaging on HPC • Data processing • CEP • DSP subsampled data fracture location well logs hr-day assimilation sensor data store model building and testing reservoir modelling ops control inter- domain analytics subsurface modelling Well log store seismic seismic Bathymetry, Geospatial, Geology, Well completions, Historical data, Prediction, Maintenance, Contractors, Logistics, Costs, External feeds, Human resources, HSE production modelling
  • 5. 5 MS How data moves through upstream Oil and Gas Seismic surveys Permanent seismic Production sensors Logging trial data protocls mapping Raw MS sensor streams structure and recipe store data sorting and conditioning QC/QA tools proteome matching on HPC • Data processing • CEP • DSP subsampled data fracture location MS hr-day assimilation sensor data store intra- domain analytics intra- domain analytics intra- domain analytics intra- domain analytics inter- domain analytics chemical modelling MS store recipes Patient Records, Drug Trials, Blind Studies, Historical data, Prediction, Maintenance, Contractors, Logistics, Costs, External feeds, Human resources, HSE Biopharma
  • 6. 6 Who maintains trust for us? The Community Experts Rules Engines • Provenance • Versioning • Sources • Unique ID Most big organisations can afford teams who understand the technical and scientific domains and care enough to “fight the good data fight” The Data Guardians
  • 7. 7 The Architecture of Partnerships Access Layer User Layer Us Them Knowledge Data • IP and legal departments manage parameters of knowledge sharing extension of intra-organisational processes licensing and sharing can be driven by data value (societal or economic) • Technical challenge is in the physical and logical connectivity Provenance and Quality are human-guaranteed Semantic framework needs to describe data AND infrastructure Source Layer
  • 8. 8 But what about using the data at the time of querying? • too voluminous • needs API • who pays for the clock cycles? • relational v. non-relational What can technology do for data publishing? Access Layer Query Layer Us Them Knowledge Data Source Layer Relational Databases allow: • searching/filtering on metadata • auditing and logging • query recording New ontologies support “metadata” discovery “push” and synchronisation services Massively Parallel Processing platforms enable: • scalable data processing at query time • RESTful encapsulation of results • caching of results summary for re-use Provenance info locked into proprietary application formats difficult to link internal and external data sources (IHS, Elsevier Geofacets achieve this to some extent)
  • 9. 9 • Who owns the data? > Read the contract! • What value does the community place on trust and what cost are they prepared to pay? > It is such a new area that value will outstrip cost for some time > The challenge in the public sector is articulating the value and spreading the cost when there are so many stakeholders • What part do publishers play? > Filter / Enabler > Content aggregation > Minimise provenance latency - Timeliness of usable knowledge > Move from knowledge reporter to value enabler • Robust data publishing in science-driven industries is emerging as a massive channel opportunity to link: Scientists Decision makers Equipment manufacturers Technology vendors The future