SlideShare a Scribd company logo
1 of 54
Paul Groth  |  Vrije Universiteit Amsterdam | pgroth@few.vu.nl Image: http://www.flickr.com/photos/tomk32/2988993409 / All images are under a creative commons license
Image: http://www.flickr.com/photos/lyza/2487848260/sizes/l/
Image: http://www.flickr.com/photos/gigi_murru/2757085392/sizes/l
[object Object],[object Object],[object Object],[object Object],3. http://www.flickr.com/photos/oskay/1364146497/sizes/m/ 2. http://www.flickr.com/photos/cwalker71/1041784395/sizes/l / 1. http://www.flickr.com/photos/restlessglobetrotter/448362507/sizes/m/ 1 2 3
[object Object],[object Object],[object Object]
Image: http://www.flickr.com/photos/davestfu/2157396025/sizes/l/ Image: http://www.flickr.com/photos/danielleblue/170497153/sizes/o/
[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object]
1. Records 2. Turntables and mixers 3. Recording equipment
Image: http://www.flickr.com/photos/melodramababs/2446537799/sizes/l/
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],No more conversion components
Shared Techniques [WIKIAI’09 @IJCAI]
Open Task Repository
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Image: http://www.flickr.com/photos/danielleblue/170497153/sizes/o/
[object Object],[object Object],[object Object],[object Object],[object Object]
7/10/09 SWF 2009
[object Object],[object Object],[object Object],[object Object],[object Object]
Title: BLASTP with simplified results returned   Description: This workflow Performs a blastp search on protein sequence, extracts sequence id within the blast report and retrives the corresponding seuqences.[sic]  ≅
- myexperiment.org - 2300 users - 750 workflows - 160 groups
 
[IUI’09] [AAAI SS 09] [SWF 2009]
[object Object],[object Object],[object Object],[e-science 09]
Data (triples) How were they produced? Which ones should I trust? Who’s responsible? From Chris Bizer From pipes.deri.org
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],http://www.ifixit.com/Teardown/iPod-touch-3rd-Generation/1158/1
Image: http://www.flickr.com/photos/seidsvag/122718624/sizes/l/
[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[IEEE TPDS Groth 08]
[object Object],[object Object],[object Object],[object Object],[object Object],[ACM Toit 08: Groth, Moreau, Miles]
[e-Science 08]
from esaw09
http://www.flickr.com/photos/newbirth/2834643961/
 
Reputation
http://www.flickr.com/photos/el_ramon/3804532661/
Content http://www.flickr.com/photos/ogcodes/2095054686/
Content Nice Letterhead
Content Nice Letterhead Official Seal
Content Nice Letterhead Official Seal A particular statement is present
Content Nice Letterhead Official Seal ≈ A particular statement is present
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object]
[object Object],[esaw 09]
[object Object],Trust of new workflow components
The  Community http://www.flickr.com/photos/dunechaser/142079357/sizes/o/
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],1. Data and Data Discovery 2. Component exposure and composition 3. Process capture and organization
[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

More Related Content

Viewers also liked

Bleach 386
Bleach 386Bleach 386
Bleach 386
Elfam
 
Article Pengenalan Konsep Xml Web Services
Article Pengenalan Konsep Xml Web ServicesArticle Pengenalan Konsep Xml Web Services
Article Pengenalan Konsep Xml Web Services
Fredy Budimansyah
 
[1041] basileios boulgaroktonos
[1041] basileios boulgaroktonos[1041] basileios boulgaroktonos
[1041] basileios boulgaroktonos
Petros Michailidis
 
Keynote Steve Outram Elluminate 12 Nov
Keynote   Steve Outram   Elluminate 12 NovKeynote   Steve Outram   Elluminate 12 Nov
Keynote Steve Outram Elluminate 12 Nov
JISC SSBR
 
Romanian questionnaire 5 8-2011
Romanian questionnaire 5 8-2011Romanian questionnaire 5 8-2011
Romanian questionnaire 5 8-2011
Petros Michailidis
 

Viewers also liked (16)

Hxh291
Hxh291Hxh291
Hxh291
 
Bleach 386
Bleach 386Bleach 386
Bleach 386
 
9
99
9
 
University Of Michigan
University Of MichiganUniversity Of Michigan
University Of Michigan
 
Salary Exchange
Salary ExchangeSalary Exchange
Salary Exchange
 
mc
mcmc
mc
 
Article Pengenalan Konsep Xml Web Services
Article Pengenalan Konsep Xml Web ServicesArticle Pengenalan Konsep Xml Web Services
Article Pengenalan Konsep Xml Web Services
 
The Socializers - A Thousand True Fans - IMH Communications 2011 Cyprus
The Socializers - A Thousand True Fans - IMH Communications 2011 CyprusThe Socializers - A Thousand True Fans - IMH Communications 2011 Cyprus
The Socializers - A Thousand True Fans - IMH Communications 2011 Cyprus
 
[1041] basileios boulgaroktonos
[1041] basileios boulgaroktonos[1041] basileios boulgaroktonos
[1041] basileios boulgaroktonos
 
Texas S Ta R Chart
Texas S Ta R ChartTexas S Ta R Chart
Texas S Ta R Chart
 
Uk graphs-2011
Uk graphs-2011Uk graphs-2011
Uk graphs-2011
 
Greek 1 4-2011
Greek  1 4-2011Greek  1 4-2011
Greek 1 4-2011
 
Does your brand need a mobile strategy? (Digiday Brand Summit 2012)
Does your brand need a mobile strategy? (Digiday Brand Summit 2012)Does your brand need a mobile strategy? (Digiday Brand Summit 2012)
Does your brand need a mobile strategy? (Digiday Brand Summit 2012)
 
Keynote Steve Outram Elluminate 12 Nov
Keynote   Steve Outram   Elluminate 12 NovKeynote   Steve Outram   Elluminate 12 Nov
Keynote Steve Outram Elluminate 12 Nov
 
Impresii icl2012
Impresii icl2012Impresii icl2012
Impresii icl2012
 
Romanian questionnaire 5 8-2011
Romanian questionnaire 5 8-2011Romanian questionnaire 5 8-2011
Romanian questionnaire 5 8-2011
 

Similar to I want to be a Data DJ!

Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docxSimulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
edgar6wallace88877
 
Rivera_COSC880_Presentation
Rivera_COSC880_PresentationRivera_COSC880_Presentation
Rivera_COSC880_Presentation
Emanuel Rivera
 
Better integrations through open interfaces
Better integrations through open interfacesBetter integrations through open interfaces
Better integrations through open interfaces
Steve Speicher
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble
 

Similar to I want to be a Data DJ! (20)

Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docxSimulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
 
Rock Overview
Rock OverviewRock Overview
Rock Overview
 
How do I aggregate oers
How do I aggregate oersHow do I aggregate oers
How do I aggregate oers
 
Object oriented software_engg
Object oriented software_enggObject oriented software_engg
Object oriented software_engg
 
Overview of modern software ecosystem for big data analysis
Overview of modern software ecosystem for big data analysisOverview of modern software ecosystem for big data analysis
Overview of modern software ecosystem for big data analysis
 
IoT Open Platforms
IoT Open PlatformsIoT Open Platforms
IoT Open Platforms
 
Cloud Reliability: Decreasing outage frequency using fault injection
Cloud Reliability: Decreasing outage frequency using fault injectionCloud Reliability: Decreasing outage frequency using fault injection
Cloud Reliability: Decreasing outage frequency using fault injection
 
OpenTelemetry 101 FTW
OpenTelemetry 101 FTWOpenTelemetry 101 FTW
OpenTelemetry 101 FTW
 
Top10 Characteristics of Awesome Apps
Top10 Characteristics of Awesome AppsTop10 Characteristics of Awesome Apps
Top10 Characteristics of Awesome Apps
 
[WSO2 Summit Sydney 2019] Emerging Architecture Patterns: API-centric and Cel...
[WSO2 Summit Sydney 2019] Emerging Architecture Patterns: API-centric and Cel...[WSO2 Summit Sydney 2019] Emerging Architecture Patterns: API-centric and Cel...
[WSO2 Summit Sydney 2019] Emerging Architecture Patterns: API-centric and Cel...
 
Modeling and Simulation of Parallel and Distributed Computing Systems with Si...
Modeling and Simulation of Parallel and Distributed Computing Systems with Si...Modeling and Simulation of Parallel and Distributed Computing Systems with Si...
Modeling and Simulation of Parallel and Distributed Computing Systems with Si...
 
5 Thomas Magedanz
5  Thomas Magedanz5  Thomas Magedanz
5 Thomas Magedanz
 
Rivera_COSC880_Presentation
Rivera_COSC880_PresentationRivera_COSC880_Presentation
Rivera_COSC880_Presentation
 
Better integrations through open interfaces
Better integrations through open interfacesBetter integrations through open interfaces
Better integrations through open interfaces
 
A Web-­Based Simulator for a Discrete Manufacturing System
A Web-­Based Simulator for a Discrete  Manufacturing SystemA Web-­Based Simulator for a Discrete  Manufacturing System
A Web-­Based Simulator for a Discrete Manufacturing System
 
IRJET - Automation in Python using Speech Recognition
IRJET -  	  Automation in Python using Speech RecognitionIRJET -  	  Automation in Python using Speech Recognition
IRJET - Automation in Python using Speech Recognition
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Monitoring in 2017 - TIAD Camp Docker
Monitoring in 2017 - TIAD Camp DockerMonitoring in 2017 - TIAD Camp Docker
Monitoring in 2017 - TIAD Camp Docker
 
Prometheus: From technical metrics to business observability
Prometheus: From technical metrics to business observabilityPrometheus: From technical metrics to business observability
Prometheus: From technical metrics to business observability
 
Proactive ops for container orchestration environments
Proactive ops for container orchestration environmentsProactive ops for container orchestration environments
Proactive ops for container orchestration environments
 

More from Paul Groth

More from Paul Groth (20)

Data Curation and Debugging for Data Centric AI
Data Curation and Debugging for Data Centric AIData Curation and Debugging for Data Centric AI
Data Curation and Debugging for Data Centric AI
 
Content + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learningContent + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learning
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
 
Minimal viable-datareuse-czi
Minimal viable-datareuse-cziMinimal viable-datareuse-czi
Minimal viable-datareuse-czi
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph Maintenance
 
Knowledge Graph Futures
Knowledge Graph FuturesKnowledge Graph Futures
Knowledge Graph Futures
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph Maintenance
 
Thoughts on Knowledge Graphs & Deeper Provenance
Thoughts on Knowledge Graphs  & Deeper ProvenanceThoughts on Knowledge Graphs  & Deeper Provenance
Thoughts on Knowledge Graphs & Deeper Provenance
 
Thinking About the Making of Data
Thinking About the Making of DataThinking About the Making of Data
Thinking About the Making of Data
 
End-to-End Learning for Answering Structured Queries Directly over Text
End-to-End Learning for  Answering Structured Queries Directly over Text End-to-End Learning for  Answering Structured Queries Directly over Text
End-to-End Learning for Answering Structured Queries Directly over Text
 
From Data Search to Data Showcasing
From Data Search to Data ShowcasingFrom Data Search to Data Showcasing
From Data Search to Data Showcasing
 
Elsevier’s Healthcare Knowledge Graph
Elsevier’s Healthcare Knowledge GraphElsevier’s Healthcare Knowledge Graph
Elsevier’s Healthcare Knowledge Graph
 
The Challenge of Deeper Knowledge Graphs for Science
The Challenge of Deeper Knowledge Graphs for ScienceThe Challenge of Deeper Knowledge Graphs for Science
The Challenge of Deeper Knowledge Graphs for Science
 
More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?
 
Diversity and Depth: Implementing AI across many long tail domains
Diversity and Depth: Implementing AI across many long tail domainsDiversity and Depth: Implementing AI across many long tail domains
Diversity and Depth: Implementing AI across many long tail domains
 
Progressive Provenance Capture Through Re-computation
Progressive Provenance Capture Through Re-computationProgressive Provenance Capture Through Re-computation
Progressive Provenance Capture Through Re-computation
 
From Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge GraphsFrom Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge Graphs
 
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsCombining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
 
The need for a transparent data supply chain
The need for a transparent data supply chainThe need for a transparent data supply chain
The need for a transparent data supply chain
 
Knowledge graph construction for research & medicine
Knowledge graph construction for research & medicineKnowledge graph construction for research & medicine
Knowledge graph construction for research & medicine
 

Recently uploaded

Recently uploaded (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 

I want to be a Data DJ!

Editor's Notes

  1. Title: I want to be a Data DJ! Abstract: This talk provides an overview of my work towards enabling Data DJs. That is enabling users to create, remix, record, and share their data analyses as easily as DJs make and share mixes. The talk touches on a variety of topics including linked data, scientific workflows, provenance, enterprise mashups and Facebook. It draws these topics into a unified research framework and discusses future research directions.
  2. Because of I want an audience….not really….
  3. Records Simple components (effects, fades) chained together: workflow Whole albums of dj Creativity (through on a new record – backtrack) – fast to novelty You can continually improve because it’s easy to revisit and remix The ability to remix enables combinatorial innovation
  4. Intuitevly….
  5. 1800: Interchangeable parts 1900: Gasoline engine 1960: Integrated circuits 1995-now: Internet
  6. Web services is lower case because not about SOA… Flickr, Google Maps, Twitter,
  7. not easy enough for the user… or developers
  8. Records = data and data discovery Turntables = components and composition Recording = capturing what’s gone on
  9. Data
  10. Common apis= sparql and rdf Things like factual and yql Machine readable data on the web
  11. Common apis= “sparql and api
  12. I see that there is a technique called “drive across country” and I go ahead and import it.
  13. Also if we extract information this is exposed as its own RDF triple. (see the references field)
  14. RDF Query Answering using Evolutionary Algorithms
  15. Fault-tolerance Data movement Provenance tracking Validation Component Discovery Reproduction
  16. A proliferation of boxes and arrow diagrams
  17. Natural instruction…
  18. How do people “naturally describe workflows”? Study with myExperiment workflows
  19. - Workflow for estimating the maximum accuracy of a model for a set of test data
  20. Linked data + mashup (workflow) = a new cool application, but then what? Need for provenance
  21. IPOD has 451 parts provided by 10 suppliers… but apple trusts all of them http://pcic.merage.uci.edu/papers/2007/AppleiPod.pdf http://people.ischool.berkeley.edu/~hal/people/hal/NYTimes/2007-06-28.html The problem is not mixing and matching components the problem is the need for provenance
  22. Get applications to record process documentation! Log data ! But the key here is to structure that data….
  23. Guarantees that documentation will be captured… Attributable, finalizable, process reflecting, You can also just use log4j
  24. Say it’s an
  25. Condor dag…. Number of jobs
  26. How many people have cell phones? How many people understand their cell phone contract?
  27. I trust the contract because people I know have told me the
  28. Mechanism design, trust because of enforcement
  29. Trust based on the artifact itself
  30. Availability of support for example
  31. Trust based on experience… what you’ve seen before
  32. Note that this is not to say these can’t work together