SlideShare a Scribd company logo
1 of 24
CONFIDENTIAL © 2019
Why I love Spark

Jean Georges “JG" Perrin
February 11th 2019
v100
CONFIDENTIAL © 2019
Why I love Spark

(and all it does for IBM
products)
Jean Georges “JG" Perrin
February 11th 2019
v100
CONFIDENTIAL © 2019
JGP • Jean Georges Perrin
• @jgperrin
• Chapel Hill, NC
• I ! SW since 1983
• #Knowledge = 

𝑓 ( ∑ (#SmallData, #BigData), #DataScience)

& #Software 
• #IBMChampion x11 • #KeepLearning
• @ http://jgp.net
CONFIDENTIAL © 2019
CONFIDENTIAL © 2019
Analytics operating system
CONFIDENTIAL © 2019
An analytics operating system?
Hardware
OS
Apps
CONFIDENTIAL © 2019
An analytics operating system?
Hardware
OS
Apps
HardwareHardware
OS OS
CONFIDENTIAL © 2019
An analytics operating system?
Hardware
OS
Apps
HardwareHardware
OS OS
Apps
CONFIDENTIAL © 2019
Apps
Analytics
Distrib.
An analytics operating system?
Hardware
OS
Apps
HardwareHardware
OS OS
CONFIDENTIAL © 2019
Apps
Analytics
Distrib.
An analytics operating system?
Hardware
OS
Apps
HardwareHardware
OS OS
HardwareHardware
OS OS
CONFIDENTIAL © 2019
Apps
Analytics
Distrib.
An analytics operating system?
Hardware
OS
Apps
HardwareHardware
OS OS
Distributed OS
Analytics OS
HardwareHardware
OS OS
CONFIDENTIAL © 2019
Apps
Analytics
Distrib.
An analytics operating system?
Hardware
OS
Apps
HardwareHardware
OS OS
Distributed OS
Analytics OS
Apps
HardwareHardware
OS OS
CONFIDENTIAL © 2019
An analytics operating system?
HardwareHardware
OS OS
Distributed OS
Analytics OS
Apps
{
CONFIDENTIAL © 2019
An analytics operating system?
HardwareHardware
OS OS
Distributed OS
Analytics OS
Apps
{
CONFIDENTIAL © 2019
An analytics operating system?
HardwareHardware
OS OS
Distributed OS
Analytics OS
Apps
{
CONFIDENTIAL © 2019
There are two kinds of data
scientists:
1) Those who can extrapolate
from incomplete data.
-The Internet
CONFIDENTIAL © 2019
Unified API
Data Science Data Engineering
InfoSphere
Information AnalyzerDb2 Event Store
Watson Knowledge
CatalogWatson Data Studio
DataStage Flow
Designer…
Watson Knowledge
Catalog
Cloud Private for Data
…
SparkBench
What kind of applications?
CONFIDENTIAL © 2018
DATA
Engineer
DATA
Scientist
Adapted from: https://www.datacamp.com/community/blog/data-scientist-vs-data-engineer
Develop, build, test, and operationalize
datastores and large-scale processing
systems.
DataOps is the new DevOps.
Clean, massage, and organize data.
Perform statistics and analysis to develop
insights, build models, and search for
innovative correlations.
Match architecture
with business needs.
Develop processes
for data modeling,
mining, and
pipelines.
Improve data
reliability and quality.
Prepare data for
predictive models.
Explore data to find
hidden gems and
patterns.
Tells stories to key
stakeholders.
CONFIDENTIAL © 2018
Adapted from: https://www.datacamp.com/community/blog/data-scientist-vs-data-engineer
DATA
Engineer
DATA
Scientist
SQL
CONFIDENTIAL © 2019
Difference between machine learning and AI:
If it is written in Python, 

it’s probably machine learning
If it is written in PowerPoint, 

it’s probably AI
-Curt Simon Harlinghausen
CONFIDENTIAL © 2019
IBM’s communities and CODAIT
• IBM’s investment is not limited to products
• CODAIT (formerly Spark Technology
Center)
• IBM Communities
CONFIDENTIAL © 2019
Key takeaways
• IBM contributed to building a new kind of Operating System.
• IBM builds its new generation of data products on this
Operating System.
• Share the love.
• Use Java.
CONFIDENTIAL © 2019
Going even further
Spark in Action (MEAP)
by Jean Georges Perrin (@jgperrin)
published by Manning
http://jgp.net/sia
sprkact-8D74
sprkact-2C72 ctwthink19
One two free books
40% off
CONFIDENTIAL © 2019
Links
• Apache Spark
• http://spark.apache.org
• Spark in Action, 2e
• http://jgp.net/sia
• IBM Products
• https://dataplatform.cloud.ibm.com/docs/content/catalog/overview-wkc.html
• https://www.ibm.com/support/knowledgecenter/en/SSZJPZ_11.7.0/com.ibm.swg.im.iis.ds.fd.doc/topics/
t_config_spark.html
• https://www.ibm.com/support/knowledgecenter/en/SSZJPZ_11.7.0/com.ibm.swg.im.iis.ia.administer.doc/topics/
t_spark_job.html
• https://dataplatform.cloud.ibm.com/docs/content/catalog/overview-wkc.html
• https://www.ibm.com/products/db2-event-store
• https://www.ibm.com/analytics/cloud-private-for-data
• https://developer.ibm.com/open/projects/spark-bench/, https://research.spec.org/fileadmin/user_upload/
documents/wg_bd/BD-20150401-spark_benchmark-v1.3-spec.pdf
• IBM Center for Open-Source Data & AI Technologies (Spark Technology Center)
• https://developer.ibm.com/code/open/centers/codait/about/

More Related Content

Similar to Why i love Apache Spark?

IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
Adrian Turcu
 

Similar to Why i love Apache Spark? (20)

Opportunities and Pitfalls of Prototyping with Artificial Intelligence berl...
Opportunities and Pitfalls of Prototyping with Artificial Intelligence   berl...Opportunities and Pitfalls of Prototyping with Artificial Intelligence   berl...
Opportunities and Pitfalls of Prototyping with Artificial Intelligence berl...
 
Your Data Nerd Friends Need You!
Your Data Nerd Friends Need You!Your Data Nerd Friends Need You!
Your Data Nerd Friends Need You!
 
Visualisation And Reporting: where the Analytics tyres hit the road
Visualisation And Reporting: where the Analytics tyres hit the roadVisualisation And Reporting: where the Analytics tyres hit the road
Visualisation And Reporting: where the Analytics tyres hit the road
 
Augmented OLAP for Big Data
Augmented OLAP for Big DataAugmented OLAP for Big Data
Augmented OLAP for Big Data
 
Augmented OLAP Analytics for Big Data
Augmented OLAP Analytics for Big DataAugmented OLAP Analytics for Big Data
Augmented OLAP Analytics for Big Data
 
Rage WITH the machine, not against it: Machine learning for Event Management
Rage WITH the machine, not against it: Machine learning for Event ManagementRage WITH the machine, not against it: Machine learning for Event Management
Rage WITH the machine, not against it: Machine learning for Event Management
 
Role of Data in Digital Transformation
Role of Data in Digital TransformationRole of Data in Digital Transformation
Role of Data in Digital Transformation
 
Data and its Role in Your Digital Transformation
Data and its Role in Your Digital TransformationData and its Role in Your Digital Transformation
Data and its Role in Your Digital Transformation
 
ODSC May 2019 - The DataOps Manifesto
ODSC May 2019 - The DataOps ManifestoODSC May 2019 - The DataOps Manifesto
ODSC May 2019 - The DataOps Manifesto
 
06 summary
06 summary06 summary
06 summary
 
Auto AI : AI used to create AI applications
Auto AI : AI used to create AI applicationsAuto AI : AI used to create AI applications
Auto AI : AI used to create AI applications
 
IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
 
Splunk Artificial Intelligence & Machine Learning Webinar
Splunk Artificial Intelligence & Machine Learning WebinarSplunk Artificial Intelligence & Machine Learning Webinar
Splunk Artificial Intelligence & Machine Learning Webinar
 
Washington DC DataOps Meetup -- Nov 2019
Washington DC DataOps Meetup   -- Nov 2019Washington DC DataOps Meetup   -- Nov 2019
Washington DC DataOps Meetup -- Nov 2019
 
Data Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and GovernanceData Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and Governance
 
seven steps to dataops @ dataops.rocks conference Oct 2019
seven steps to dataops @ dataops.rocks conference Oct 2019seven steps to dataops @ dataops.rocks conference Oct 2019
seven steps to dataops @ dataops.rocks conference Oct 2019
 
The 10 Best Data Analytics And BI Platforms And Tools In 2020
The 10 Best Data Analytics And BI Platforms And Tools In 2020The 10 Best Data Analytics And BI Platforms And Tools In 2020
The 10 Best Data Analytics And BI Platforms And Tools In 2020
 
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
 
Data Analytics for Finance
Data Analytics for FinanceData Analytics for Finance
Data Analytics for Finance
 
Big Data Careers
Big Data CareersBig Data Careers
Big Data Careers
 

More from Jean-Georges Perrin

More from Jean-Georges Perrin (20)

It's painful how much data rules the world
It's painful how much data rules the worldIt's painful how much data rules the world
It's painful how much data rules the world
 
Apache Spark v3.0.0
Apache Spark v3.0.0Apache Spark v3.0.0
Apache Spark v3.0.0
 
Big data made easy with a Spark
Big data made easy with a SparkBig data made easy with a Spark
Big data made easy with a Spark
 
Big Data made easy with a Spark
Big Data made easy with a SparkBig Data made easy with a Spark
Big Data made easy with a Spark
 
The road to AI is paved with pragmatic intentions
The road to AI is paved with pragmatic intentionsThe road to AI is paved with pragmatic intentions
The road to AI is paved with pragmatic intentions
 
Spark Summit Europe Wrap Up and TASM State of the Community
Spark Summit Europe Wrap Up and TASM State of the CommunitySpark Summit Europe Wrap Up and TASM State of the Community
Spark Summit Europe Wrap Up and TASM State of the Community
 
Spark hands-on tutorial (rev. 002)
Spark hands-on tutorial (rev. 002)Spark hands-on tutorial (rev. 002)
Spark hands-on tutorial (rev. 002)
 
Spark Summit 2017 - A feedback for TASM
Spark Summit 2017 - A feedback for TASMSpark Summit 2017 - A feedback for TASM
Spark Summit 2017 - A feedback for TASM
 
HTML (or how the web got started)
HTML (or how the web got started)HTML (or how the web got started)
HTML (or how the web got started)
 
2CRSI presentation for ISC-HPC: When High-Performance Computing meets High-Pe...
2CRSI presentation for ISC-HPC: When High-Performance Computing meets High-Pe...2CRSI presentation for ISC-HPC: When High-Performance Computing meets High-Pe...
2CRSI presentation for ISC-HPC: When High-Performance Computing meets High-Pe...
 
Vision stratégique de l'utilisation de l'(Open)Data dans l'entreprise
Vision stratégique de l'utilisation de l'(Open)Data dans l'entrepriseVision stratégique de l'utilisation de l'(Open)Data dans l'entreprise
Vision stratégique de l'utilisation de l'(Open)Data dans l'entreprise
 
Informix is not for legacy applications
Informix is not for legacy applicationsInformix is not for legacy applications
Informix is not for legacy applications
 
Vendre des produits techniques
Vendre des produits techniquesVendre des produits techniques
Vendre des produits techniques
 
Vendre plus sur le web
Vendre plus sur le webVendre plus sur le web
Vendre plus sur le web
 
Vendre plus sur le Web
Vendre plus sur le WebVendre plus sur le Web
Vendre plus sur le Web
 
GreenIvory : products and services
GreenIvory : products and servicesGreenIvory : products and services
GreenIvory : products and services
 
GreenIvory : produits & services
GreenIvory : produits & servicesGreenIvory : produits & services
GreenIvory : produits & services
 
A la découverte des nouvelles tendances du web (Mulhouse Edition)
A la découverte des nouvelles tendances du web (Mulhouse Edition)A la découverte des nouvelles tendances du web (Mulhouse Edition)
A la découverte des nouvelles tendances du web (Mulhouse Edition)
 
MashupXFeed et la stratégie éditoriale - Workshop Activis - GreenIvory
MashupXFeed et la stratégie éditoriale - Workshop Activis - GreenIvoryMashupXFeed et la stratégie éditoriale - Workshop Activis - GreenIvory
MashupXFeed et la stratégie éditoriale - Workshop Activis - GreenIvory
 
MashupXFeed et le référencement - Workshop Activis - Greenivory
MashupXFeed et le référencement - Workshop Activis - GreenivoryMashupXFeed et le référencement - Workshop Activis - Greenivory
MashupXFeed et le référencement - Workshop Activis - Greenivory
 

Recently uploaded

Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
masabamasaba
 

Recently uploaded (20)

Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 

Why i love Apache Spark?

  • 1. CONFIDENTIAL © 2019 Why I love Spark
 Jean Georges “JG" Perrin February 11th 2019 v100
  • 2. CONFIDENTIAL © 2019 Why I love Spark
 (and all it does for IBM products) Jean Georges “JG" Perrin February 11th 2019 v100
  • 3. CONFIDENTIAL © 2019 JGP • Jean Georges Perrin • @jgperrin • Chapel Hill, NC • I ! SW since 1983 • #Knowledge = 
 𝑓 ( ∑ (#SmallData, #BigData), #DataScience)
 & #Software  • #IBMChampion x11 • #KeepLearning • @ http://jgp.net
  • 6. CONFIDENTIAL © 2019 An analytics operating system? Hardware OS Apps
  • 7. CONFIDENTIAL © 2019 An analytics operating system? Hardware OS Apps HardwareHardware OS OS
  • 8. CONFIDENTIAL © 2019 An analytics operating system? Hardware OS Apps HardwareHardware OS OS Apps
  • 9. CONFIDENTIAL © 2019 Apps Analytics Distrib. An analytics operating system? Hardware OS Apps HardwareHardware OS OS
  • 10. CONFIDENTIAL © 2019 Apps Analytics Distrib. An analytics operating system? Hardware OS Apps HardwareHardware OS OS HardwareHardware OS OS
  • 11. CONFIDENTIAL © 2019 Apps Analytics Distrib. An analytics operating system? Hardware OS Apps HardwareHardware OS OS Distributed OS Analytics OS HardwareHardware OS OS
  • 12. CONFIDENTIAL © 2019 Apps Analytics Distrib. An analytics operating system? Hardware OS Apps HardwareHardware OS OS Distributed OS Analytics OS Apps HardwareHardware OS OS
  • 13. CONFIDENTIAL © 2019 An analytics operating system? HardwareHardware OS OS Distributed OS Analytics OS Apps {
  • 14. CONFIDENTIAL © 2019 An analytics operating system? HardwareHardware OS OS Distributed OS Analytics OS Apps {
  • 15. CONFIDENTIAL © 2019 An analytics operating system? HardwareHardware OS OS Distributed OS Analytics OS Apps {
  • 16. CONFIDENTIAL © 2019 There are two kinds of data scientists: 1) Those who can extrapolate from incomplete data. -The Internet
  • 17. CONFIDENTIAL © 2019 Unified API Data Science Data Engineering InfoSphere Information AnalyzerDb2 Event Store Watson Knowledge CatalogWatson Data Studio DataStage Flow Designer… Watson Knowledge Catalog Cloud Private for Data … SparkBench What kind of applications?
  • 18. CONFIDENTIAL © 2018 DATA Engineer DATA Scientist Adapted from: https://www.datacamp.com/community/blog/data-scientist-vs-data-engineer Develop, build, test, and operationalize datastores and large-scale processing systems. DataOps is the new DevOps. Clean, massage, and organize data. Perform statistics and analysis to develop insights, build models, and search for innovative correlations. Match architecture with business needs. Develop processes for data modeling, mining, and pipelines. Improve data reliability and quality. Prepare data for predictive models. Explore data to find hidden gems and patterns. Tells stories to key stakeholders.
  • 19. CONFIDENTIAL © 2018 Adapted from: https://www.datacamp.com/community/blog/data-scientist-vs-data-engineer DATA Engineer DATA Scientist SQL
  • 20. CONFIDENTIAL © 2019 Difference between machine learning and AI: If it is written in Python, 
 it’s probably machine learning If it is written in PowerPoint, 
 it’s probably AI -Curt Simon Harlinghausen
  • 21. CONFIDENTIAL © 2019 IBM’s communities and CODAIT • IBM’s investment is not limited to products • CODAIT (formerly Spark Technology Center) • IBM Communities
  • 22. CONFIDENTIAL © 2019 Key takeaways • IBM contributed to building a new kind of Operating System. • IBM builds its new generation of data products on this Operating System. • Share the love. • Use Java.
  • 23. CONFIDENTIAL © 2019 Going even further Spark in Action (MEAP) by Jean Georges Perrin (@jgperrin) published by Manning http://jgp.net/sia sprkact-8D74 sprkact-2C72 ctwthink19 One two free books 40% off
  • 24. CONFIDENTIAL © 2019 Links • Apache Spark • http://spark.apache.org • Spark in Action, 2e • http://jgp.net/sia • IBM Products • https://dataplatform.cloud.ibm.com/docs/content/catalog/overview-wkc.html • https://www.ibm.com/support/knowledgecenter/en/SSZJPZ_11.7.0/com.ibm.swg.im.iis.ds.fd.doc/topics/ t_config_spark.html • https://www.ibm.com/support/knowledgecenter/en/SSZJPZ_11.7.0/com.ibm.swg.im.iis.ia.administer.doc/topics/ t_spark_job.html • https://dataplatform.cloud.ibm.com/docs/content/catalog/overview-wkc.html • https://www.ibm.com/products/db2-event-store • https://www.ibm.com/analytics/cloud-private-for-data • https://developer.ibm.com/open/projects/spark-bench/, https://research.spec.org/fileadmin/user_upload/ documents/wg_bd/BD-20150401-spark_benchmark-v1.3-spec.pdf • IBM Center for Open-Source Data & AI Technologies (Spark Technology Center) • https://developer.ibm.com/code/open/centers/codait/about/