SlideShare a Scribd company logo
1 of 11
Download to read offline
The	
  OPTIRAD	
  Pla-orm:	
  Cloud-­‐hosted	
  IPython	
  Notebooks	
  
for	
  collabora?ve	
  EO	
  Data	
  Analysis	
  and	
  Processing	
  
ESA	
  EO	
  Open	
  Science	
  2.0	
  Conference	
  12-­‐14	
  October	
  2015	
  
	
  
Philip	
  Kershaw	
  (CEDA),	
  John	
  Holt	
  (Tessella	
  plc.)	
  José	
  Gómez-­‐Dans,	
  Philip	
  Lewis	
  (UCL)	
  
Nicola	
  Pounder,	
  Jon	
  Styles	
  (Assimila	
  Ltd.)	
  
JASMIN	
  (STFC/Stephen	
  Kill)	
  
Introduc?on	
  
•  OPTIRAD	
  =	
  OPTImisa?on	
  environment	
  for	
  joint	
  retrieval	
  of	
  
mul?-­‐sensor	
  RADiances	
  
–  Collabora?on:	
  CEDA,	
  UCL,	
  Assimila	
  Ltd,	
  FastOpt	
  and	
  VU	
  Amsterdam	
  
–  Funded	
  by	
  ESA	
  
•  Overview	
  of	
  technical	
  solu?on	
  
–  Introduc?on	
  to	
  IPython	
  (Jupyter)	
  Notebook	
  
–  Deployment	
  on	
  JASMIN-­‐CEMS	
  science	
  cloud	
  
•  Make	
  the	
  case,	
  IPython	
  Notebook	
  +	
  Cloud	
  =	
  powerful	
  
combina?on	
  for	
  EO	
  Open	
  Science	
  2.0	
  
OPTIRAD	
  Goals	
  
Address	
  the	
  challenge	
  of	
  
producing	
  consistent	
  EO	
  
land	
  surface	
  informa?on	
  
products	
  from	
  
heterogeneous	
  EO	
  data	
  
input:	
  
Collabora?on:	
  provide	
  a	
  collabora?ve	
  
research	
  environment	
  as	
  a	
  means	
  to	
  
engender	
  closer	
  working	
  between	
  algorithm	
  
specialists,	
  modellers	
  and	
  end	
  users.	
  	
  	
  
Compu?ng	
  resources:	
  processing	
  at	
  high	
  
spa?al	
  and	
  temporal	
  resolu?ons	
  with	
  
computa?onally	
  expensive	
  algorithms.	
  	
  	
  
Usability	
  and	
  access:	
  easy	
  execu?on	
  and	
  
development	
  of	
  exis?ng	
  Python	
  code	
  and	
  
the	
  provision	
  of	
  interac?ve	
  tutorials	
  for	
  new	
  
users	
  
IPython	
  Notebook	
  
•  Provides	
  Python	
  kernels	
  accessible	
  via	
  a	
  
web	
  browser	
  	
  
•  Sessions	
  can	
  be	
  saved	
  and	
  shared	
  
	
  
•  Trivial	
  access	
  to	
  parallel	
  processing	
  
capabili?es	
  –	
  IPython.parallel	
  (ipyparallel)	
  
•  IPython	
  Jupyter	
  Notebook	
  
•  Support	
  for	
  other	
  languages	
  such	
  as	
  
R	
  
•  New	
  JupyterHub	
  allows	
  mul?-­‐user	
  
management	
  of	
  notebooks	
  
•  Gained	
  trac?on	
  as	
  a	
  teaching	
  and	
  
collabora?ve	
  tool	
  
	
  
IPython	
  Notebook	
  +	
  Cloud	
  
•  Cloud’s	
  characteris?cs:	
  
–  Broad	
  network	
  access,	
  resource	
  pooling,	
  elas?city,	
  scale	
  –	
  compute	
  and	
  
storage	
  	
  
–  Good	
  fit	
  for	
  Big	
  Data	
  science	
  applica?ons	
  
	
  
•  Cloud-­‐hosted	
  Notebook	
  -­‐	
  a	
  model	
  already	
  demonstrated	
  with	
  
public	
  cloud	
  services	
  e.g.	
  
–  Wakari,	
  Azure,	
  Rackspace	
  
	
  
•  Central	
  hos?ng	
  allows	
  central	
  management	
  of	
  socware	
  packages	
  
–  no	
  installa?on	
  steps	
  needed	
  for	
  the	
  user	
  
	
  
•  Algorithm	
  prototyping	
  environment	
  next	
  to	
  Big	
  Data	
  
–  Acts	
  as	
  a	
  precursor	
  to	
  opera?onal	
  processing	
  services	
  
Notebook:	
  	
  
a	
  user	
  –	
  applica?on	
  perspec?ve	
  
Support	
  a	
  spectrum	
  of	
  usage	
  models	
  
Different	
  
classes	
  of	
  
user	
  
Long-­‐tail	
  of	
  science	
  users	
  è	
  
Design	
  and	
  development	
  	
  
considera?ons	
  
•  Host	
  on	
  JASMIN-­‐CEMS	
  	
  
–  Data	
  analysis	
  facility	
  and	
  science	
  cloud	
  at	
  Rutherford	
  Appleton	
  Lab,	
  UK	
  
–  Advantage	
  of	
  proximity	
  to	
  locally	
  hosted	
  EO	
  and	
  climate	
  science	
  datasets	
  
–  Integra?on	
  with	
  environmental	
  sciences	
  community	
  
	
  
•  Lightweight	
  development	
  and	
  deployment	
  philosophy	
  
–  Build	
  on	
  Open	
  Source	
  and	
  community	
  efforts	
  to	
  use	
  what’s	
  already	
  available	
  
	
  
•  How	
  to	
  meet	
  mul?-­‐user	
  support	
  requirement?	
  
–  Buy	
  off-­‐the-­‐shelf:	
  run	
  Wakari	
  on	
  JASMIN-­‐CEMS	
  pla-orm	
  or	
  
–  Try	
  JupyterHub:	
  mul?-­‐user	
  IPython	
  Notebook	
  solu?on	
  or	
  
–  Roll	
  our	
  own	
  solu?on	
  
	
  
•  How	
  to	
  integrate	
  parallel	
  processing?	
  
–  IPython.parallel	
  (ipyparallel)	
  Python	
  API	
  accessed	
  via	
  the	
  Notebook	
  
OPTIRAD	
  JASMIN	
  Cloud	
  Tenancy	
  
Docker	
  Container	
  
VM:	
  Swarm	
  pool	
  0	
  
VM:	
  Swarm	
  pool	
  0	
  
Deployment	
  Architecture	
  
JupyterHub	
  
VM:	
  Swarm	
  pool	
  0	
  
Docker	
  Container	
  
IPython	
  
Notebook	
  
Kernel	
  
Docker	
  Container	
  
IPython	
  
Notebook	
  
Kernel	
  
Kernel	
  
Kernel	
   Parallel	
  
Controller	
  
Parallel	
  
Controller	
  
VM:	
  Swarm	
  
pool	
  0	
  
VM:	
  Swarm	
  
pool	
  0	
  
VM:	
  slave	
  0	
  
Parallel	
  
Engine	
  
Parallel	
  
Engine	
  
Nodes	
  for	
  parallel	
  
Processing	
  
Notebooks	
  and	
  kernels	
  in	
  
containers	
  
Swarm	
  manages	
  alloca?on	
  
of	
  containers	
  for	
  notebooks	
  
Manage	
  users	
  
and	
  provision	
  of	
  
notebooks	
  
Swarm	
  
Firewall	
  	
  
VM:	
  shared	
  services	
  
NFS	
   LDAP	
  
Browser	
  access	
  
Conclusions	
  +	
  Next	
  Steps	
  
•  Experiences	
  from	
  project	
  delivery	
  
–  Off-­‐shelf	
  solu?on	
  using	
  JupyterHub	
  paid	
  off	
  
–  JupyterHub	
  and	
  Swarm	
  was	
  new	
  but	
  
–  Installa?on	
  straigh-orward	
  +	
  opera?onally	
  robust	
  
•  Challenges	
  and	
  future	
  development	
  
–  Extend	
  use	
  of	
  containers	
  for	
  parallel	
  compute	
  
–  Challenge:	
  managing	
  cloud	
  elas?city	
  with	
  both	
  containers	
  and	
  host	
  
VMs	
  
–  Provide	
  object	
  storage	
  –	
  CEPH	
  likely	
  to	
  be	
  adopted	
  
–  Expand	
  from	
  OPTIRAD	
  pilot	
  to	
  wider	
  user	
  community	
  
–  Deploy	
  with	
  toolboxes	
  e.g.	
  Sen?nels	
  or	
  CIS.	
  
Demo	
  .	
  .	
  .	
  
•  A	
  tutorial	
  on	
  EO	
  data	
  
assimila?on	
  
–  Notebook	
  blurs	
  the	
  
tradi?onal	
  separa?on	
  
between	
  tutorial	
  
documenta?on	
  and	
  
using	
  the	
  target	
  system	
  
–  The	
  two	
  are	
  one	
  self-­‐
contained	
  interac?ve	
  
unit	
  J	
  
Further	
  informa?on	
  
•  OPTIRAD:	
  
–  Op?misa?on	
  Environment	
  For	
  Joint	
  Retrieval	
  Of	
  Mul?-­‐Sensor	
  Radiances	
  
(OPTIRAD),	
  Proceedings	
  of	
  the	
  ESA	
  2014	
  Conference	
  on	
  Big	
  Data	
  from	
  
Space	
  (BiDS’14)	
  hip://dx.doi.org/10.2788/1823	
  	
  
•  JASMIN	
  paper	
  (Sept	
  2013)	
  
–  hip://home.badc.rl.ac.uk/lawrence/sta?c/2013/10/14/
LawEA13_Jasmin.pdf	
  
–  Cloud	
  paper	
  to	
  follow	
  soon	
  
•  Cloud-­‐hosted	
  JupyterHub	
  with	
  Docker	
  for	
  teaching:	
  
–  hips://developer.rackspace.com/blog/deploying-­‐jupyterhub-­‐for-­‐
educa?on/	
  	
  	
  
•  JASMIN	
  and	
  CEDA:	
  
–  hip://jasmin.ac.uk/	
  	
  
–  hip://www.ceda.ac.uk	
  
•  @PhilipJKershaw	
  	
  

More Related Content

Recently uploaded

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 

Recently uploaded (20)

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 

Featured

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 

Featured (20)

Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 

The OPTIRAD Platform: Cloud-hosted IPython Notebooks for collaborative EO Data Analysis and Processing

  • 1. The  OPTIRAD  Pla-orm:  Cloud-­‐hosted  IPython  Notebooks   for  collabora?ve  EO  Data  Analysis  and  Processing   ESA  EO  Open  Science  2.0  Conference  12-­‐14  October  2015     Philip  Kershaw  (CEDA),  John  Holt  (Tessella  plc.)  José  Gómez-­‐Dans,  Philip  Lewis  (UCL)   Nicola  Pounder,  Jon  Styles  (Assimila  Ltd.)   JASMIN  (STFC/Stephen  Kill)  
  • 2. Introduc?on   •  OPTIRAD  =  OPTImisa?on  environment  for  joint  retrieval  of   mul?-­‐sensor  RADiances   –  Collabora?on:  CEDA,  UCL,  Assimila  Ltd,  FastOpt  and  VU  Amsterdam   –  Funded  by  ESA   •  Overview  of  technical  solu?on   –  Introduc?on  to  IPython  (Jupyter)  Notebook   –  Deployment  on  JASMIN-­‐CEMS  science  cloud   •  Make  the  case,  IPython  Notebook  +  Cloud  =  powerful   combina?on  for  EO  Open  Science  2.0  
  • 3. OPTIRAD  Goals   Address  the  challenge  of   producing  consistent  EO   land  surface  informa?on   products  from   heterogeneous  EO  data   input:   Collabora?on:  provide  a  collabora?ve   research  environment  as  a  means  to   engender  closer  working  between  algorithm   specialists,  modellers  and  end  users.       Compu?ng  resources:  processing  at  high   spa?al  and  temporal  resolu?ons  with   computa?onally  expensive  algorithms.       Usability  and  access:  easy  execu?on  and   development  of  exis?ng  Python  code  and   the  provision  of  interac?ve  tutorials  for  new   users  
  • 4. IPython  Notebook   •  Provides  Python  kernels  accessible  via  a   web  browser     •  Sessions  can  be  saved  and  shared     •  Trivial  access  to  parallel  processing   capabili?es  –  IPython.parallel  (ipyparallel)   •  IPython  Jupyter  Notebook   •  Support  for  other  languages  such  as   R   •  New  JupyterHub  allows  mul?-­‐user   management  of  notebooks   •  Gained  trac?on  as  a  teaching  and   collabora?ve  tool    
  • 5. IPython  Notebook  +  Cloud   •  Cloud’s  characteris?cs:   –  Broad  network  access,  resource  pooling,  elas?city,  scale  –  compute  and   storage     –  Good  fit  for  Big  Data  science  applica?ons     •  Cloud-­‐hosted  Notebook  -­‐  a  model  already  demonstrated  with   public  cloud  services  e.g.   –  Wakari,  Azure,  Rackspace     •  Central  hos?ng  allows  central  management  of  socware  packages   –  no  installa?on  steps  needed  for  the  user     •  Algorithm  prototyping  environment  next  to  Big  Data   –  Acts  as  a  precursor  to  opera?onal  processing  services  
  • 6. Notebook:     a  user  –  applica?on  perspec?ve   Support  a  spectrum  of  usage  models   Different   classes  of   user   Long-­‐tail  of  science  users  è  
  • 7. Design  and  development     considera?ons   •  Host  on  JASMIN-­‐CEMS     –  Data  analysis  facility  and  science  cloud  at  Rutherford  Appleton  Lab,  UK   –  Advantage  of  proximity  to  locally  hosted  EO  and  climate  science  datasets   –  Integra?on  with  environmental  sciences  community     •  Lightweight  development  and  deployment  philosophy   –  Build  on  Open  Source  and  community  efforts  to  use  what’s  already  available     •  How  to  meet  mul?-­‐user  support  requirement?   –  Buy  off-­‐the-­‐shelf:  run  Wakari  on  JASMIN-­‐CEMS  pla-orm  or   –  Try  JupyterHub:  mul?-­‐user  IPython  Notebook  solu?on  or   –  Roll  our  own  solu?on     •  How  to  integrate  parallel  processing?   –  IPython.parallel  (ipyparallel)  Python  API  accessed  via  the  Notebook  
  • 8. OPTIRAD  JASMIN  Cloud  Tenancy   Docker  Container   VM:  Swarm  pool  0   VM:  Swarm  pool  0   Deployment  Architecture   JupyterHub   VM:  Swarm  pool  0   Docker  Container   IPython   Notebook   Kernel   Docker  Container   IPython   Notebook   Kernel   Kernel   Kernel   Parallel   Controller   Parallel   Controller   VM:  Swarm   pool  0   VM:  Swarm   pool  0   VM:  slave  0   Parallel   Engine   Parallel   Engine   Nodes  for  parallel   Processing   Notebooks  and  kernels  in   containers   Swarm  manages  alloca?on   of  containers  for  notebooks   Manage  users   and  provision  of   notebooks   Swarm   Firewall     VM:  shared  services   NFS   LDAP   Browser  access  
  • 9. Conclusions  +  Next  Steps   •  Experiences  from  project  delivery   –  Off-­‐shelf  solu?on  using  JupyterHub  paid  off   –  JupyterHub  and  Swarm  was  new  but   –  Installa?on  straigh-orward  +  opera?onally  robust   •  Challenges  and  future  development   –  Extend  use  of  containers  for  parallel  compute   –  Challenge:  managing  cloud  elas?city  with  both  containers  and  host   VMs   –  Provide  object  storage  –  CEPH  likely  to  be  adopted   –  Expand  from  OPTIRAD  pilot  to  wider  user  community   –  Deploy  with  toolboxes  e.g.  Sen?nels  or  CIS.  
  • 10. Demo  .  .  .   •  A  tutorial  on  EO  data   assimila?on   –  Notebook  blurs  the   tradi?onal  separa?on   between  tutorial   documenta?on  and   using  the  target  system   –  The  two  are  one  self-­‐ contained  interac?ve   unit  J  
  • 11. Further  informa?on   •  OPTIRAD:   –  Op?misa?on  Environment  For  Joint  Retrieval  Of  Mul?-­‐Sensor  Radiances   (OPTIRAD),  Proceedings  of  the  ESA  2014  Conference  on  Big  Data  from   Space  (BiDS’14)  hip://dx.doi.org/10.2788/1823     •  JASMIN  paper  (Sept  2013)   –  hip://home.badc.rl.ac.uk/lawrence/sta?c/2013/10/14/ LawEA13_Jasmin.pdf   –  Cloud  paper  to  follow  soon   •  Cloud-­‐hosted  JupyterHub  with  Docker  for  teaching:   –  hips://developer.rackspace.com/blog/deploying-­‐jupyterhub-­‐for-­‐ educa?on/       •  JASMIN  and  CEDA:   –  hip://jasmin.ac.uk/     –  hip://www.ceda.ac.uk   •  @PhilipJKershaw