SlideShare a Scribd company logo
1 of 13
ALIGNED Data Curation Methods and
Tools
Rob Brennan, ALIGNED Coordinator
SWIMing VoCamp Workshop,
Dublin, 22 March 2016
3/25/20162
This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 644055.
This communication reflects only the author’s view and the Commission is not responsible for any use that may be made of the information it contains.
Application
Users
Data
Harvesters
Dataset
Domain
Experts
Software
Developers
System
Admins
Data
Architects
Dev.
Managers
Software
Testers Data
Consumers
Software
Analysts
Implementation
Analysis
Planning
Maintenance
Software
Engineering
Lifecycle
Design
Manual
Revision
/ Author
Inter-
linking
/Fusing
Classify/
Enrich
Quality
Analysis
Evolve
/RepairSearch/
Browse/
Explore
Extract
Store/
Query
Data
Engineering
Lifecycle
System
Analysts
Overall Goal:
How can we get these guys to talk?
To improve: Productivity, Agility, Quality?
Data Quality and Data Curation in
ALIGNED
• Building high quality data-intensive systems
requires high quality datasets
• But
– Datasets are now first class citizens with lifecycles
that are independent of the consuming apps
– Quality still problematic
• We observe:
– Rich data models support quality engineering
– Linked Data entering the enterprise
ALIGNED Tools for Data Curation
Productivity, Agility, Quality
Data
Engineering
Data Quality
Validation
Unified
Process
Governance
Data Integrity
Assurance
Data
Integration
Assurance
Semi-
Supervised
Data Curation
See:
http://aligned-project.eu/open-source-tools/
https://www.poolparty.biz/
Linked Data
Extract,
Transform,
Load
Taxonomy
Management
Dataset Release
Automation
ALIGNED Validates in Real-World, Data
Intensive Systems
Global History
Databank
Legal Information
System
Nucleus for the
Web of Data
Semantic
Middleware
Data
Consumers
Community of experts &
Volunteers
Electronic Archives
Example: Seshat Target System
databases
Seshat
Databank
Collective
Intelligence
High
Quality
Open
Data
Feedback
“improve the extraction of collective
intelligence from electronic archives,
research communities and data consumers
to improve the quality of published data”
Seshat Data Web
Wiki
RDF Triple Store
Linked Data
Publication
User
Management
Schema
Management
tool
Wiki Data
Entry/Validati
on Tool
Errors
Data
Visualisations
Data
Transformations
Links to other
Datasets
Seshat Data
Web Pages
Read/query
Enter
Data
Validate
Candidate
Time Series
Analysis
Data Export
Tool
Data Dump
File (TSV )
Candidate
Generation/
Filtering tools
Seshat Editor Seshat AdministratorSeshat Contributors Seshat Analyst
Copy of
Seshat Data
Seshat Schema
Knowledge
Model
Seshat Data
Knowledge
Model
Seshat Reader
FeedbackView
Data
Data Quality
Controls
Read
Data
DBpedia
External candidate
source
Workflow
Management
Wiki
Generation
tool
generate
Global History Databank Pilot Data Curation System
Goal is to minimise
work requirements
from expert users
(domain expert,
architect) and to
ensure data-quality
in different
dimensions at
different steps in
the process.
Dacura: Generic, Quality-Oriented
Data Curation Process
Dacura Data Harvesting Interfaces
• Knowledge and Data Engineering Group/ADAPT Centre,
Trinity College Dublin
• Software Engineering Group,
University of Oxford
• Institute of Cognitive and Evolutionary Anthropology,
University of Oxford
• Agile Knowledge Engineering and Semantic Web Group
Universität Leipzig
• Semantic Web Company GmbH
• Content Strategy and Architecture Department,
Wolters Kluwer Germany,
Wolters Kluwer Poland
• Institute of Prehistory
Adam Mickiewicz University at Poznan
Partners
We want to help you!
The ALIGNED Consultancy Program
• Are you a business?
• Do any of these apply:
– Are you building data-intensive applications?
– Do you want to curate high quality data?
– Need help integrating Linked Data + apps?
– Want to integrate your software and data
engineering teams?
Call on the ALIGNED consultancy program!
http://aligned-project.eu/aligned-consultancy-program-opportunities/
Contact: rob.brennan@cs.tcd.ie
Web: http://www.aligned-project.eu
Twitter: @AlignedProject

More Related Content

What's hot

Streamlining deposit an ojs to repository plugin
Streamlining deposit an ojs to repository pluginStreamlining deposit an ojs to repository plugin
Streamlining deposit an ojs to repository plugin
Jisc
 
WP4 Overview (Standards)
WP4 Overview (Standards)WP4 Overview (Standards)
WP4 Overview (Standards)
vbrant
 

What's hot (20)

Northumbria University case study
Northumbria University case studyNorthumbria University case study
Northumbria University case study
 
20141030 LinDA Workshop echallenges2014 - LinDA project overview
20141030 LinDA Workshop echallenges2014 - LinDA project overview20141030 LinDA Workshop echallenges2014 - LinDA project overview
20141030 LinDA Workshop echallenges2014 - LinDA project overview
 
Towards Generating Policy-compliant Datasets (poster)
Towards GeneratingPolicy-compliant Datasets (poster)Towards GeneratingPolicy-compliant Datasets (poster)
Towards Generating Policy-compliant Datasets (poster)
 
Adoption and Integration of Persistent Identifiers in European Research Infor...
Adoption and Integration of Persistent Identifiers in European Research Infor...Adoption and Integration of Persistent Identifiers in European Research Infor...
Adoption and Integration of Persistent Identifiers in European Research Infor...
 
The European Open Science Cloud
The European Open Science CloudThe European Open Science Cloud
The European Open Science Cloud
 
PID Services for FAIR data
PID Services for FAIR dataPID Services for FAIR data
PID Services for FAIR data
 
PID services - understandability and findability of data
PID services - understandability and findability of dataPID services - understandability and findability of data
PID services - understandability and findability of data
 
ICIC 2017: Building a Linked Data Knowledge Graph for the Scholarly Publishin...
ICIC 2017: Building a Linked Data Knowledge Graph for the Scholarly Publishin...ICIC 2017: Building a Linked Data Knowledge Graph for the Scholarly Publishin...
ICIC 2017: Building a Linked Data Knowledge Graph for the Scholarly Publishin...
 
Streamlining deposit an ojs to repository plugin
Streamlining deposit an ojs to repository pluginStreamlining deposit an ojs to repository plugin
Streamlining deposit an ojs to repository plugin
 
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
 
General Introduction to the Oxford e-Research Centre
General Introduction to the Oxford e-Research CentreGeneral Introduction to the Oxford e-Research Centre
General Introduction to the Oxford e-Research Centre
 
Presentation ADEQUATe Project: Workshop on Quality Assessment and Improvement...
Presentation ADEQUATe Project: Workshop on Quality Assessment and Improvement...Presentation ADEQUATe Project: Workshop on Quality Assessment and Improvement...
Presentation ADEQUATe Project: Workshop on Quality Assessment and Improvement...
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
 
Rajendra Akerkar - LeMO Project
Rajendra Akerkar - LeMO ProjectRajendra Akerkar - LeMO Project
Rajendra Akerkar - LeMO Project
 
20140902 LinDa Workshop Semantincs2014 - LinDA Project Overview
20140902 LinDa Workshop Semantincs2014 - LinDA Project Overview20140902 LinDa Workshop Semantincs2014 - LinDA Project Overview
20140902 LinDa Workshop Semantincs2014 - LinDA Project Overview
 
WP4 Overview (Standards)
WP4 Overview (Standards)WP4 Overview (Standards)
WP4 Overview (Standards)
 
OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...
OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...
OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...
 
ODIN: Connecting research and researchers
ODIN: Connecting research and researchersODIN: Connecting research and researchers
ODIN: Connecting research and researchers
 
COUNTER Standards for Open Access: The Value of Measuring/The Measuring of Va...
COUNTER Standards for Open Access: The Value of Measuring/The Measuring of Va...COUNTER Standards for Open Access: The Value of Measuring/The Measuring of Va...
COUNTER Standards for Open Access: The Value of Measuring/The Measuring of Va...
 
EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)
EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)
EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)
 

Viewers also liked

Viewers also liked (6)

Data curation at Dryad Digital Repository: A former curator's perspective
Data curation at Dryad Digital Repository: A former curator's perspectiveData curation at Dryad Digital Repository: A former curator's perspective
Data curation at Dryad Digital Repository: A former curator's perspective
 
Wf4Ever: Scientific Workflows and Research Objects as tools for scientific in...
Wf4Ever: Scientific Workflows and Research Objects as tools for scientific in...Wf4Ever: Scientific Workflows and Research Objects as tools for scientific in...
Wf4Ever: Scientific Workflows and Research Objects as tools for scientific in...
 
How much control do you need to dance TANGO?
How much control do you need to dance TANGO?How much control do you need to dance TANGO?
How much control do you need to dance TANGO?
 
Library Data Management Services
Library Data Management ServicesLibrary Data Management Services
Library Data Management Services
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published record
 
Citizen Science in the era of the Square Kilometre Array
Citizen Science in the era of the Square Kilometre ArrayCitizen Science in the era of the Square Kilometre Array
Citizen Science in the era of the Square Kilometre Array
 

Similar to ALIGNED Data Curation Methods and Tools

Analyti x mapping manager product overview presentation
Analyti x mapping manager product overview presentationAnalyti x mapping manager product overview presentation
Analyti x mapping manager product overview presentation
AnalytixDataServices
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
Sanjay Padhi, Ph.D
 
Wrangling RedCap_An Introduction and Inspiration
Wrangling RedCap_An Introduction and InspirationWrangling RedCap_An Introduction and Inspiration
Wrangling RedCap_An Introduction and Inspiration
Jacqueline Stern
 
Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...
DataWorks Summit
 

Similar to ALIGNED Data Curation Methods and Tools (20)

Breed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptxBreed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptx
 
Future.ready().watson dataplatform 01
Future.ready().watson dataplatform 01Future.ready().watson dataplatform 01
Future.ready().watson dataplatform 01
 
Data Quality
Data QualityData Quality
Data Quality
 
Analyti x mapping manager product overview presentation
Analyti x mapping manager product overview presentationAnalyti x mapping manager product overview presentation
Analyti x mapping manager product overview presentation
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Learn Best Practices of a True Hybrid IT Management Approach
Learn Best Practices of a True Hybrid IT Management ApproachLearn Best Practices of a True Hybrid IT Management Approach
Learn Best Practices of a True Hybrid IT Management Approach
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...
 
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
 
Aditya Bhattacharya - Enterprise DL - Accelerating Deep Learning Solutions to...
Aditya Bhattacharya - Enterprise DL - Accelerating Deep Learning Solutions to...Aditya Bhattacharya - Enterprise DL - Accelerating Deep Learning Solutions to...
Aditya Bhattacharya - Enterprise DL - Accelerating Deep Learning Solutions to...
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal Modernization
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
Neo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperativeNeo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperative
 
Qo Introduction V2
Qo Introduction V2Qo Introduction V2
Qo Introduction V2
 
Wrangling RedCap_An Introduction and Inspiration
Wrangling RedCap_An Introduction and InspirationWrangling RedCap_An Introduction and Inspiration
Wrangling RedCap_An Introduction and Inspiration
 
Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
 
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL EndpointsA BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
 
Innovate2010 jazz keynote
Innovate2010 jazz keynoteInnovate2010 jazz keynote
Innovate2010 jazz keynote
 

Recently uploaded

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 

Recently uploaded (20)

Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 

ALIGNED Data Curation Methods and Tools

  • 1. ALIGNED Data Curation Methods and Tools Rob Brennan, ALIGNED Coordinator SWIMing VoCamp Workshop, Dublin, 22 March 2016
  • 2. 3/25/20162 This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 644055. This communication reflects only the author’s view and the Commission is not responsible for any use that may be made of the information it contains.
  • 4. Data Quality and Data Curation in ALIGNED • Building high quality data-intensive systems requires high quality datasets • But – Datasets are now first class citizens with lifecycles that are independent of the consuming apps – Quality still problematic • We observe: – Rich data models support quality engineering – Linked Data entering the enterprise
  • 5. ALIGNED Tools for Data Curation Productivity, Agility, Quality Data Engineering Data Quality Validation Unified Process Governance Data Integrity Assurance Data Integration Assurance Semi- Supervised Data Curation See: http://aligned-project.eu/open-source-tools/ https://www.poolparty.biz/ Linked Data Extract, Transform, Load Taxonomy Management Dataset Release Automation
  • 6. ALIGNED Validates in Real-World, Data Intensive Systems Global History Databank Legal Information System Nucleus for the Web of Data Semantic Middleware
  • 7. Data Consumers Community of experts & Volunteers Electronic Archives Example: Seshat Target System databases Seshat Databank Collective Intelligence High Quality Open Data Feedback “improve the extraction of collective intelligence from electronic archives, research communities and data consumers to improve the quality of published data”
  • 8. Seshat Data Web Wiki RDF Triple Store Linked Data Publication User Management Schema Management tool Wiki Data Entry/Validati on Tool Errors Data Visualisations Data Transformations Links to other Datasets Seshat Data Web Pages Read/query Enter Data Validate Candidate Time Series Analysis Data Export Tool Data Dump File (TSV ) Candidate Generation/ Filtering tools Seshat Editor Seshat AdministratorSeshat Contributors Seshat Analyst Copy of Seshat Data Seshat Schema Knowledge Model Seshat Data Knowledge Model Seshat Reader FeedbackView Data Data Quality Controls Read Data DBpedia External candidate source Workflow Management Wiki Generation tool generate Global History Databank Pilot Data Curation System
  • 9. Goal is to minimise work requirements from expert users (domain expert, architect) and to ensure data-quality in different dimensions at different steps in the process. Dacura: Generic, Quality-Oriented Data Curation Process
  • 11. • Knowledge and Data Engineering Group/ADAPT Centre, Trinity College Dublin • Software Engineering Group, University of Oxford • Institute of Cognitive and Evolutionary Anthropology, University of Oxford • Agile Knowledge Engineering and Semantic Web Group Universität Leipzig • Semantic Web Company GmbH • Content Strategy and Architecture Department, Wolters Kluwer Germany, Wolters Kluwer Poland • Institute of Prehistory Adam Mickiewicz University at Poznan Partners
  • 12. We want to help you! The ALIGNED Consultancy Program • Are you a business? • Do any of these apply: – Are you building data-intensive applications? – Do you want to curate high quality data? – Need help integrating Linked Data + apps? – Want to integrate your software and data engineering teams? Call on the ALIGNED consultancy program! http://aligned-project.eu/aligned-consultancy-program-opportunities/