SlideShare a Scribd company logo
1 of 39
Process Mining
Data Science in Action
Wil van der Aalst
www.vdaalst.com @wvdaalst
www.processmining.org
… , but data science is here to stay!
Data Science Center Eindhoven
http://www.tue.nl/dsce/
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
DSC/e: Competences and Research Programs
28 groups and 420+ people involved
Data Science Flagship (Philips & DSC/e)
4 Strategic topics
•Data Driven Value Propositions
•Healthcare Smart Maintenance
•Optimizing Healthcare Workflows
•Continuous Personal Health
4 TU/e departments
16 PhD students
30 Data science specialists
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
“Data Science University” in Den Bosch
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: On the interface
between process science and
data science
As generic as a
spreadsheet!
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Spreadsheet: Killer App for early computers
• VisiCalc (killer
app for Apple II,
Oct. 1979)
• Lotus 1-2-3 (killer
app for IBM PC
1983)
• Microsoft Excel
(1985)
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Spreadsheet: Static data
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Spreadsheet: Static data
fact derived
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Spreadsheet: Static data
31 items
sold
total
value
average
distribution
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Spreadsheet: Static data
How to analyze operational processes?
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
• Input: events (“things that
have happened”)
• Mandatory per event:
− case identifier
− activity name
− timestamp/date
• Optional
− resource
− transaction type
− costs
− …
case
identifier
activity
name
timestamp
resourcerow = event
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
208 cases
5987 events
74 activities
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
batching for activities
“opstellen eindnota”
and “archiveren”
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Loesje van
der Aalst
desire line
Process Discovery
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
process discovery
NO
modeling
needed!
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
process discovery
NO
modeling
needed!
74 act.
11 act.
3 act.
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
event dataprocess
model
Conformance Checking
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
desire line
very safe
system
Conformance Checking
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
conformance checking
?
discovered or
hand-made
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
conformance checking
fitness of
93.5%
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
conformance checking
final inspection is
skipped 40 times
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
conformance checking
move on model
(something should have
happened, but did not)
move on log
(something happened
that should not happen)
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
performance analysis
average
flowtime is
1.92 months
bottleneck
NO
modeling
needed!
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
performance analysis
waiting time of
15.74 days
NO
modeling
needed!
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
animating reality
real cases
NO
modeling
needed!
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
16 cases are
queueing
animating reality
Deviations
Where?
Why? time
costs
…
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
How to get started?
• Event Data
• Process Mining Tools
• Data Science Mindset
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Starting point for process mining:
Event data
patient activity timestamp doctor age cost
5781 make X-ray 23-1-2014@10.30 Dr. Jones 45 70.00
5541 blood test 23-1-2014@10.18 Dr. Scott 61 40.00
5833 blood test 23-1-2014@10.27 Dr. Scott 24 40.00
5781 blood test 23-1-2014@10.49 Dr. Scott 45 40.00
5781 CT scan 23-1-2014@11.10 Dr. Fox 45 1200.00
5833 surgery 23-1-2014@12.34 Dr. Scott 24 2300.00
5781 handle payment 23-1-2014@12.41 Carol Hope 45 0.00
5541 radiation therapy 23-1-2014@13.57 Dr. Jones 61 140.00
5541 radiation therapy 23-1-2014@13.08 Dr. Jones 61 140.00
… … … … … …
case id activity name timestamp other dataresource
Such data is everywhere (databases,
ERP/CRM/HIS/… systems, transaction
logs, messaging, social media, etc.)
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
How to get started?
• Event Data
• Process Mining Tools
• Data Science Mindset
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining Software
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
900+ plug-ins available covering the
whole process mining spectrum
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
How to get started?
• Event Data
• Process Mining Tools
• Data Science Mindset
Process Mining
Data Science in Action
43.000+25.000 people joined!
Starts again on October 7th
2015!
Register via https://www.coursera.org/course/procmin
Conclusion
http://www.tue.nl/dsce/
Get started today!
spreadsheet
for behavior
data-oriented analysis
(data mining, machine learning, business intelligence)
process model analysis
(simulation, verification, optimization, gaming, etc.)
performance-
oriented
questions,
problems and
solutions
compliance-
oriented
questions,
problems and
solutions

More Related Content

Viewers also liked

Building an ai with raspberry pi
Building an ai with raspberry piBuilding an ai with raspberry pi
Building an ai with raspberry piHaesung Lee
 
IBM Software Day 2013. Smarter analytics and big data. building the next gene...
IBM Software Day 2013. Smarter analytics and big data. building the next gene...IBM Software Day 2013. Smarter analytics and big data. building the next gene...
IBM Software Day 2013. Smarter analytics and big data. building the next gene...IBM (Middle East and Africa)
 
Nano Server First Step
Nano Server First StepNano Server First Step
Nano Server First StepKazuki Takai
 
Praktiline pilvekonverents - IT haldust hõlbustavad uuendused
Praktiline pilvekonverents - IT haldust hõlbustavad uuendusedPraktiline pilvekonverents - IT haldust hõlbustavad uuendused
Praktiline pilvekonverents - IT haldust hõlbustavad uuendusedPrimend
 
DFW meetup Cognitive services - parashar - feb 22
DFW meetup Cognitive services -  parashar - feb 22DFW meetup Cognitive services -  parashar - feb 22
DFW meetup Cognitive services - parashar - feb 22Parashar Shah
 
Keynote #Enterprise - L'ouverture du Cloud Microsoft, transformation open sou...
Keynote #Enterprise - L'ouverture du Cloud Microsoft, transformation open sou...Keynote #Enterprise - L'ouverture du Cloud Microsoft, transformation open sou...
Keynote #Enterprise - L'ouverture du Cloud Microsoft, transformation open sou...Paris Open Source Summit
 
WSO2 BAM - Your Big Data Toolbox
WSO2 BAM - Your Big Data ToolboxWSO2 BAM - Your Big Data Toolbox
WSO2 BAM - Your Big Data ToolboxWSO2
 
NUON Rens Weijers
NUON Rens WeijersNUON Rens Weijers
NUON Rens WeijersBigDataExpo
 
Bioocean1 :Introduction to Biological Oceanography
Bioocean1 :Introduction to Biological Oceanography Bioocean1 :Introduction to Biological Oceanography
Bioocean1 :Introduction to Biological Oceanography Gazi Abdullah
 
Understanding Camouflage
Understanding CamouflageUnderstanding Camouflage
Understanding CamouflageEmily Kissner
 
Play Framework - Toulouse JUG - nov 2011
Play Framework - Toulouse JUG - nov 2011Play Framework - Toulouse JUG - nov 2011
Play Framework - Toulouse JUG - nov 2011Sylvain Wallez
 
Azure Large Scale Deployments - Tales from the Trenches
Azure Large Scale Deployments - Tales from the TrenchesAzure Large Scale Deployments - Tales from the Trenches
Azure Large Scale Deployments - Tales from the TrenchesAaron Saikovski
 
15h00 intel - intel big data for aws summits rev3
15h00   intel - intel big data for aws summits rev315h00   intel - intel big data for aws summits rev3
15h00 intel - intel big data for aws summits rev3infolive
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendCaserta
 
Sumo Logic Quickstart - Nv 2016
Sumo Logic Quickstart - Nv 2016Sumo Logic Quickstart - Nv 2016
Sumo Logic Quickstart - Nv 2016Sumo Logic
 
Dataiku pig - hive - cascading
Dataiku   pig - hive - cascadingDataiku   pig - hive - cascading
Dataiku pig - hive - cascadingDataiku
 
1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...IBM
 

Viewers also liked (20)

Building an ai with raspberry pi
Building an ai with raspberry piBuilding an ai with raspberry pi
Building an ai with raspberry pi
 
IBM Software Day 2013. Smarter analytics and big data. building the next gene...
IBM Software Day 2013. Smarter analytics and big data. building the next gene...IBM Software Day 2013. Smarter analytics and big data. building the next gene...
IBM Software Day 2013. Smarter analytics and big data. building the next gene...
 
Nano Server First Step
Nano Server First StepNano Server First Step
Nano Server First Step
 
Praktiline pilvekonverents - IT haldust hõlbustavad uuendused
Praktiline pilvekonverents - IT haldust hõlbustavad uuendusedPraktiline pilvekonverents - IT haldust hõlbustavad uuendused
Praktiline pilvekonverents - IT haldust hõlbustavad uuendused
 
DFW meetup Cognitive services - parashar - feb 22
DFW meetup Cognitive services -  parashar - feb 22DFW meetup Cognitive services -  parashar - feb 22
DFW meetup Cognitive services - parashar - feb 22
 
Keynote #Enterprise - L'ouverture du Cloud Microsoft, transformation open sou...
Keynote #Enterprise - L'ouverture du Cloud Microsoft, transformation open sou...Keynote #Enterprise - L'ouverture du Cloud Microsoft, transformation open sou...
Keynote #Enterprise - L'ouverture du Cloud Microsoft, transformation open sou...
 
WSO2 BAM - Your Big Data Toolbox
WSO2 BAM - Your Big Data ToolboxWSO2 BAM - Your Big Data Toolbox
WSO2 BAM - Your Big Data Toolbox
 
NUON Rens Weijers
NUON Rens WeijersNUON Rens Weijers
NUON Rens Weijers
 
Bioocean1 :Introduction to Biological Oceanography
Bioocean1 :Introduction to Biological Oceanography Bioocean1 :Introduction to Biological Oceanography
Bioocean1 :Introduction to Biological Oceanography
 
Understanding Camouflage
Understanding CamouflageUnderstanding Camouflage
Understanding Camouflage
 
Click or clunk
Click or clunkClick or clunk
Click or clunk
 
Play Framework - Toulouse JUG - nov 2011
Play Framework - Toulouse JUG - nov 2011Play Framework - Toulouse JUG - nov 2011
Play Framework - Toulouse JUG - nov 2011
 
Azure Large Scale Deployments - Tales from the Trenches
Azure Large Scale Deployments - Tales from the TrenchesAzure Large Scale Deployments - Tales from the Trenches
Azure Large Scale Deployments - Tales from the Trenches
 
15h00 intel - intel big data for aws summits rev3
15h00   intel - intel big data for aws summits rev315h00   intel - intel big data for aws summits rev3
15h00 intel - intel big data for aws summits rev3
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
Sumo Logic Quickstart - Nv 2016
Sumo Logic Quickstart - Nv 2016Sumo Logic Quickstart - Nv 2016
Sumo Logic Quickstart - Nv 2016
 
Dataiku pig - hive - cascading
Dataiku   pig - hive - cascadingDataiku   pig - hive - cascading
Dataiku pig - hive - cascading
 
1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...
 
Cloud developer evolution
Cloud developer evolutionCloud developer evolution
Cloud developer evolution
 
Sudan tanıtımı
Sudan tanıtımıSudan tanıtımı
Sudan tanıtımı
 

Similar to Big Data Expo 2015 - Data Science Center Eindhove

Best Practices in Testing Biometric Wearables
Best Practices in Testing Biometric WearablesBest Practices in Testing Biometric Wearables
Best Practices in Testing Biometric WearablesValencell, Inc
 
Certifying and Securing a Trusted Environment for Health Informatics Research...
Certifying and Securing a Trusted Environment for Health Informatics Research...Certifying and Securing a Trusted Environment for Health Informatics Research...
Certifying and Securing a Trusted Environment for Health Informatics Research...Jisc
 
OpenTele presentation - Silverbullet
OpenTele presentation - SilverbulletOpenTele presentation - Silverbullet
OpenTele presentation - SilverbulletHealthcare DENMARK
 
Knowledge Engineering from Big Data in Oncology
Knowledge Engineering from Big Data in OncologyKnowledge Engineering from Big Data in Oncology
Knowledge Engineering from Big Data in OncologyAndre Dekker
 
Collecting a dataset of information behaviour in context
Collecting a dataset of information behaviour in contextCollecting a dataset of information behaviour in context
Collecting a dataset of information behaviour in contextLeiden University
 
CaBALondon 05 Sarah Taigel, UEA
CaBALondon 05 Sarah Taigel, UEACaBALondon 05 Sarah Taigel, UEA
CaBALondon 05 Sarah Taigel, UEACaBASupport
 
iVention_EN_CDL_09022015
iVention_EN_CDL_09022015iVention_EN_CDL_09022015
iVention_EN_CDL_09022015Rody Sparenberg
 
Using Spark in Healthcare Predictive Analytics in the OR - Data Science Pop-u...
Using Spark in Healthcare Predictive Analytics in the OR - Data Science Pop-u...Using Spark in Healthcare Predictive Analytics in the OR - Data Science Pop-u...
Using Spark in Healthcare Predictive Analytics in the OR - Data Science Pop-u...Domino Data Lab
 
Smb 25092014 helma rutjes pivot park
Smb 25092014 helma rutjes   pivot parkSmb 25092014 helma rutjes   pivot park
Smb 25092014 helma rutjes pivot parkSMBBV
 
Digital Pathology streamlines tissue diagnostics and helps protect patient sa...
Digital Pathology streamlines tissue diagnostics and helps protect patient sa...Digital Pathology streamlines tissue diagnostics and helps protect patient sa...
Digital Pathology streamlines tissue diagnostics and helps protect patient sa...Roche Tissue Diagnostics
 
HETT Conference Olympic Central 2014 Integrating Healthcare Delivery
HETT Conference Olympic Central 2014 Integrating Healthcare DeliveryHETT Conference Olympic Central 2014 Integrating Healthcare Delivery
HETT Conference Olympic Central 2014 Integrating Healthcare DeliveryElmar Flamme
 
Smart ICU project. Dr.Francisco Murillo_English version
Smart ICU project. Dr.Francisco Murillo_English versionSmart ICU project. Dr.Francisco Murillo_English version
Smart ICU project. Dr.Francisco Murillo_English versioneveris/ ehCOS
 
140123 Workshop Additive Manufacturing in Biomedical application - Sirris
140123 Workshop Additive Manufacturing in Biomedical application - Sirris140123 Workshop Additive Manufacturing in Biomedical application - Sirris
140123 Workshop Additive Manufacturing in Biomedical application - Sirrisgnolens
 
140123 Workshop bioprinting Sirris
140123 Workshop bioprinting Sirris140123 Workshop bioprinting Sirris
140123 Workshop bioprinting Sirrisbatgreg
 
Additive manufacturing for biomedical applications
Additive manufacturing for biomedical applicationsAdditive manufacturing for biomedical applications
Additive manufacturing for biomedical applicationsSirris
 

Similar to Big Data Expo 2015 - Data Science Center Eindhove (18)

Open Lecture Wil van der Aalst
Open Lecture Wil van der AalstOpen Lecture Wil van der Aalst
Open Lecture Wil van der Aalst
 
Best Practices in Testing Biometric Wearables
Best Practices in Testing Biometric WearablesBest Practices in Testing Biometric Wearables
Best Practices in Testing Biometric Wearables
 
Certifying and Securing a Trusted Environment for Health Informatics Research...
Certifying and Securing a Trusted Environment for Health Informatics Research...Certifying and Securing a Trusted Environment for Health Informatics Research...
Certifying and Securing a Trusted Environment for Health Informatics Research...
 
OpenTele presentation - Silverbullet
OpenTele presentation - SilverbulletOpenTele presentation - Silverbullet
OpenTele presentation - Silverbullet
 
Silverbullet Open Tele
Silverbullet Open TeleSilverbullet Open Tele
Silverbullet Open Tele
 
Knowledge Engineering from Big Data in Oncology
Knowledge Engineering from Big Data in OncologyKnowledge Engineering from Big Data in Oncology
Knowledge Engineering from Big Data in Oncology
 
Collecting a dataset of information behaviour in context
Collecting a dataset of information behaviour in contextCollecting a dataset of information behaviour in context
Collecting a dataset of information behaviour in context
 
CaBALondon 05 Sarah Taigel, UEA
CaBALondon 05 Sarah Taigel, UEACaBALondon 05 Sarah Taigel, UEA
CaBALondon 05 Sarah Taigel, UEA
 
iVention_EN_CDL_09022015
iVention_EN_CDL_09022015iVention_EN_CDL_09022015
iVention_EN_CDL_09022015
 
Using Spark in Healthcare Predictive Analytics in the OR - Data Science Pop-u...
Using Spark in Healthcare Predictive Analytics in the OR - Data Science Pop-u...Using Spark in Healthcare Predictive Analytics in the OR - Data Science Pop-u...
Using Spark in Healthcare Predictive Analytics in the OR - Data Science Pop-u...
 
Smb 25092014 helma rutjes pivot park
Smb 25092014 helma rutjes   pivot parkSmb 25092014 helma rutjes   pivot park
Smb 25092014 helma rutjes pivot park
 
Digital Pathology streamlines tissue diagnostics and helps protect patient sa...
Digital Pathology streamlines tissue diagnostics and helps protect patient sa...Digital Pathology streamlines tissue diagnostics and helps protect patient sa...
Digital Pathology streamlines tissue diagnostics and helps protect patient sa...
 
HETT Conference Olympic Central 2014 Integrating Healthcare Delivery
HETT Conference Olympic Central 2014 Integrating Healthcare DeliveryHETT Conference Olympic Central 2014 Integrating Healthcare Delivery
HETT Conference Olympic Central 2014 Integrating Healthcare Delivery
 
Delta code2015hildebrandt
Delta code2015hildebrandtDelta code2015hildebrandt
Delta code2015hildebrandt
 
Smart ICU project. Dr.Francisco Murillo_English version
Smart ICU project. Dr.Francisco Murillo_English versionSmart ICU project. Dr.Francisco Murillo_English version
Smart ICU project. Dr.Francisco Murillo_English version
 
140123 Workshop Additive Manufacturing in Biomedical application - Sirris
140123 Workshop Additive Manufacturing in Biomedical application - Sirris140123 Workshop Additive Manufacturing in Biomedical application - Sirris
140123 Workshop Additive Manufacturing in Biomedical application - Sirris
 
140123 Workshop bioprinting Sirris
140123 Workshop bioprinting Sirris140123 Workshop bioprinting Sirris
140123 Workshop bioprinting Sirris
 
Additive manufacturing for biomedical applications
Additive manufacturing for biomedical applicationsAdditive manufacturing for biomedical applications
Additive manufacturing for biomedical applications
 

More from BigDataExpo

Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...
Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...
Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...BigDataExpo
 
Google Cloud - Google's vision on AI
Google Cloud - Google's vision on AIGoogle Cloud - Google's vision on AI
Google Cloud - Google's vision on AIBigDataExpo
 
Pacmed - Machine Learning in health care: opportunities and challanges in pra...
Pacmed - Machine Learning in health care: opportunities and challanges in pra...Pacmed - Machine Learning in health care: opportunities and challanges in pra...
Pacmed - Machine Learning in health care: opportunities and challanges in pra...BigDataExpo
 
PGGM - The Future Explore
PGGM - The Future ExplorePGGM - The Future Explore
PGGM - The Future ExploreBigDataExpo
 
Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...
Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...
Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...BigDataExpo
 
Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...
Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...
Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...BigDataExpo
 
OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...
OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...
OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...BigDataExpo
 
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AIDynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AIBigDataExpo
 
Teleperformance - Smart personalized service door het gebruik van Data Science
Teleperformance - Smart personalized service door het gebruik van Data Science Teleperformance - Smart personalized service door het gebruik van Data Science
Teleperformance - Smart personalized service door het gebruik van Data Science BigDataExpo
 
FunXtion - Interactive Digital Fitness with Data Analytics
FunXtion - Interactive Digital Fitness with Data AnalyticsFunXtion - Interactive Digital Fitness with Data Analytics
FunXtion - Interactive Digital Fitness with Data AnalyticsBigDataExpo
 
fashionTrade - Vroeger noemde we dat Big Data
fashionTrade - Vroeger noemde we dat Big DatafashionTrade - Vroeger noemde we dat Big Data
fashionTrade - Vroeger noemde we dat Big DataBigDataExpo
 
BigData Republic - Industrializing data science: a view from the trenches
BigData Republic - Industrializing data science: a view from the trenchesBigData Republic - Industrializing data science: a view from the trenches
BigData Republic - Industrializing data science: a view from the trenchesBigDataExpo
 
Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...
Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...
Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...BigDataExpo
 
Endrse - Next level online samenwerkingen tussen personalities en merken met ...
Endrse - Next level online samenwerkingen tussen personalities en merken met ...Endrse - Next level online samenwerkingen tussen personalities en merken met ...
Endrse - Next level online samenwerkingen tussen personalities en merken met ...BigDataExpo
 
Bovag - Refine-IT - Proces optimalisatie in de automotive sector
Bovag - Refine-IT - Proces optimalisatie in de automotive sectorBovag - Refine-IT - Proces optimalisatie in de automotive sector
Bovag - Refine-IT - Proces optimalisatie in de automotive sectorBigDataExpo
 
Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...
Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...
Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...BigDataExpo
 
Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...
Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...
Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...BigDataExpo
 
Rabobank - There is something about Data
Rabobank - There is something about DataRabobank - There is something about Data
Rabobank - There is something about DataBigDataExpo
 
VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...
VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...
VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...BigDataExpo
 
Booking.com - Data science and experimentation at Booking.com: a data-driven ...
Booking.com - Data science and experimentation at Booking.com: a data-driven ...Booking.com - Data science and experimentation at Booking.com: a data-driven ...
Booking.com - Data science and experimentation at Booking.com: a data-driven ...BigDataExpo
 

More from BigDataExpo (20)

Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...
Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...
Centric - Jaap huisprijzen, GTST, The Bold, IKEA en IENS. Zomaar wat toepassi...
 
Google Cloud - Google's vision on AI
Google Cloud - Google's vision on AIGoogle Cloud - Google's vision on AI
Google Cloud - Google's vision on AI
 
Pacmed - Machine Learning in health care: opportunities and challanges in pra...
Pacmed - Machine Learning in health care: opportunities and challanges in pra...Pacmed - Machine Learning in health care: opportunities and challanges in pra...
Pacmed - Machine Learning in health care: opportunities and challanges in pra...
 
PGGM - The Future Explore
PGGM - The Future ExplorePGGM - The Future Explore
PGGM - The Future Explore
 
Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...
Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...
Universiteit Utrecht & gghdc - Wat zijn de gezondheidseffecten van omgeving e...
 
Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...
Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...
Rob van Kranenburg - Kunnen we ons een sociaal krediet systeem zoals in het o...
 
OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...
OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...
OrangeNXT - High accuracy mapping from videos for efficient fiber optic cable...
 
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AIDynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
 
Teleperformance - Smart personalized service door het gebruik van Data Science
Teleperformance - Smart personalized service door het gebruik van Data Science Teleperformance - Smart personalized service door het gebruik van Data Science
Teleperformance - Smart personalized service door het gebruik van Data Science
 
FunXtion - Interactive Digital Fitness with Data Analytics
FunXtion - Interactive Digital Fitness with Data AnalyticsFunXtion - Interactive Digital Fitness with Data Analytics
FunXtion - Interactive Digital Fitness with Data Analytics
 
fashionTrade - Vroeger noemde we dat Big Data
fashionTrade - Vroeger noemde we dat Big DatafashionTrade - Vroeger noemde we dat Big Data
fashionTrade - Vroeger noemde we dat Big Data
 
BigData Republic - Industrializing data science: a view from the trenches
BigData Republic - Industrializing data science: a view from the trenchesBigData Republic - Industrializing data science: a view from the trenches
BigData Republic - Industrializing data science: a view from the trenches
 
Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...
Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...
Bicos - Hear how a top sportswear company produced cutting-edge data infrastr...
 
Endrse - Next level online samenwerkingen tussen personalities en merken met ...
Endrse - Next level online samenwerkingen tussen personalities en merken met ...Endrse - Next level online samenwerkingen tussen personalities en merken met ...
Endrse - Next level online samenwerkingen tussen personalities en merken met ...
 
Bovag - Refine-IT - Proces optimalisatie in de automotive sector
Bovag - Refine-IT - Proces optimalisatie in de automotive sectorBovag - Refine-IT - Proces optimalisatie in de automotive sector
Bovag - Refine-IT - Proces optimalisatie in de automotive sector
 
Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...
Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...
Schiphol - Optimale doorstroom van passagiers op Schiphol dankzij slimme data...
 
Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...
Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...
Veco - Big Data in de Supply Chain: Hoe Process Mining kan helpen kosten te r...
 
Rabobank - There is something about Data
Rabobank - There is something about DataRabobank - There is something about Data
Rabobank - There is something about Data
 
VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...
VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...
VU Amsterdam - Big data en datagedreven waardecreatie: valt er nog iets te ki...
 
Booking.com - Data science and experimentation at Booking.com: a data-driven ...
Booking.com - Data science and experimentation at Booking.com: a data-driven ...Booking.com - Data science and experimentation at Booking.com: a data-driven ...
Booking.com - Data science and experimentation at Booking.com: a data-driven ...
 

Recently uploaded

The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerPavel Šabatka
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best PracticesDataArchiva
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?sonikadigital1
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Vladislav Solodkiy
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptaigil2
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructuresonikadigital1
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Guido X Jansen
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)Data & Analytics Magazin
 
CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionajayrajaganeshkayala
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxDwiAyuSitiHartinah
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationGiorgio Carbone
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityAggregage
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.JasonViviers2
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introductionsanjaymuralee1
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024Becky Burwell
 
Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...PrithaVashisht1
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxVenkatasubramani13
 

Recently uploaded (17)

The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayer
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .ppt
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructure
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)
 
CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual intervention
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - Presentation
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introduction
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024
 
Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptx
 

Big Data Expo 2015 - Data Science Center Eindhove

  • 1. Process Mining Data Science in Action Wil van der Aalst www.vdaalst.com @wvdaalst www.processmining.org
  • 2. … , but data science is here to stay!
  • 3. Data Science Center Eindhoven http://www.tue.nl/dsce/
  • 4. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) DSC/e: Competences and Research Programs 28 groups and 420+ people involved
  • 5. Data Science Flagship (Philips & DSC/e) 4 Strategic topics •Data Driven Value Propositions •Healthcare Smart Maintenance •Optimizing Healthcare Workflows •Continuous Personal Health 4 TU/e departments 16 PhD students 30 Data science specialists
  • 6. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) “Data Science University” in Den Bosch
  • 7. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: On the interface between process science and data science
  • 8. As generic as a spreadsheet!
  • 9. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Spreadsheet: Killer App for early computers • VisiCalc (killer app for Apple II, Oct. 1979) • Lotus 1-2-3 (killer app for IBM PC 1983) • Microsoft Excel (1985)
  • 10. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Spreadsheet: Static data
  • 11. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Spreadsheet: Static data fact derived
  • 12. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Spreadsheet: Static data 31 items sold total value average distribution
  • 13. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Spreadsheet: Static data How to analyze operational processes?
  • 14. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior • Input: events (“things that have happened”) • Mandatory per event: − case identifier − activity name − timestamp/date • Optional − resource − transaction type − costs − … case identifier activity name timestamp resourcerow = event
  • 15. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior 208 cases 5987 events 74 activities
  • 16. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior batching for activities “opstellen eindnota” and “archiveren”
  • 17. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Loesje van der Aalst desire line Process Discovery
  • 18. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior process discovery NO modeling needed!
  • 19. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior process discovery NO modeling needed! 74 act. 11 act. 3 act.
  • 20. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) event dataprocess model Conformance Checking
  • 21. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) desire line very safe system Conformance Checking
  • 22. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior conformance checking ? discovered or hand-made
  • 23. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior conformance checking fitness of 93.5%
  • 24. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior conformance checking final inspection is skipped 40 times
  • 25. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior conformance checking move on model (something should have happened, but did not) move on log (something happened that should not happen)
  • 26. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior performance analysis average flowtime is 1.92 months bottleneck NO modeling needed!
  • 27. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior performance analysis waiting time of 15.74 days NO modeling needed!
  • 28. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior animating reality real cases NO modeling needed!
  • 29. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining: Spreadsheet for behavior 16 cases are queueing animating reality
  • 31. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) How to get started? • Event Data • Process Mining Tools • Data Science Mindset
  • 32. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Starting point for process mining: Event data patient activity timestamp doctor age cost 5781 make X-ray 23-1-2014@10.30 Dr. Jones 45 70.00 5541 blood test 23-1-2014@10.18 Dr. Scott 61 40.00 5833 blood test 23-1-2014@10.27 Dr. Scott 24 40.00 5781 blood test 23-1-2014@10.49 Dr. Scott 45 40.00 5781 CT scan 23-1-2014@11.10 Dr. Fox 45 1200.00 5833 surgery 23-1-2014@12.34 Dr. Scott 24 2300.00 5781 handle payment 23-1-2014@12.41 Carol Hope 45 0.00 5541 radiation therapy 23-1-2014@13.57 Dr. Jones 61 140.00 5541 radiation therapy 23-1-2014@13.08 Dr. Jones 61 140.00 … … … … … … case id activity name timestamp other dataresource Such data is everywhere (databases, ERP/CRM/HIS/… systems, transaction logs, messaging, social media, etc.)
  • 33. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) How to get started? • Event Data • Process Mining Tools • Data Science Mindset
  • 34. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Process Mining Software
  • 35. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) 900+ plug-ins available covering the whole process mining spectrum
  • 36. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements)©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
  • 37. ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) How to get started? • Event Data • Process Mining Tools • Data Science Mindset
  • 38. Process Mining Data Science in Action 43.000+25.000 people joined! Starts again on October 7th 2015! Register via https://www.coursera.org/course/procmin
  • 39. Conclusion http://www.tue.nl/dsce/ Get started today! spreadsheet for behavior data-oriented analysis (data mining, machine learning, business intelligence) process model analysis (simulation, verification, optimization, gaming, etc.) performance- oriented questions, problems and solutions compliance- oriented questions, problems and solutions