SlideShare a Scribd company logo
1 of 31
Download to read offline
The DOME 64-bit µServer and its Ecosystem
Ronald P. Luijten – Data Motion Architect
lui@zurich.ibm.com
IBM Research - Zurich
3 July 2014
DISCLAIMER: This presentation is entirely Ronald’s view and not necessarily that of IBM.
Definition
µServer:
The integration of an entire server node motherboard* into a
single microchip except DRAM, Nor-boot flash and power
conversion logic.
•2
305mm
245mm
133mmx55mm
* no graphics
This does NOT imply low performance!
© 2012 IBM Corporation
•3
SKA (Square Kilometer Array) to measure Big Bang
Picture source: NZZ march 2014
Big
Bang
Inflation Protons
created
Start of
nucleosynthesis
through fusion
End of
nucleosynthesis Modern Universe
0 10-32s 10-6s 0.01s 3min 380’000 years 13.8 Billion years
© 2012 IBM Corporation
Up to 2 Million+ Antenna’s
What does this mean?
•4
SKA: Largest Radio-astronomy antenna
Big data on Steroids
~ 10 Pb/s
86’400 sec/day
10..14 ExaByte/day
??
~ 1 PB/Day.
Prelim. Spec. SKA, R.T. Schilizzi et al. 2007 / Chr. Broekema
??
Science Data ProcCentral Signal Proc
•5
© 2012 IBM Corporation© 2014 IBM Corporation
~ 10 Pb/s
86’400 sec/day
10..14 ExaByte/day
??
~ 1 PB/Day.
330 disks/day
120’000 disks/yr
??
Top-500 Supercomputing(11/2013)…. 0.3Watt/Gflop/s
Today’s industry focus is 1 Eflop @ 20MW. (2018)
( 0.02 Gflop/s)
Most recent data from SKA:
CSP….max. power 7.5MW
SDP….max. power 1 MW
Latest need for SKA – 4 Exaflop (SKA1 - Mid)
1.2GW…80MW
Too easy (for us)
Too hard
Moore’s law
Factor 80-1200
SDPCSP
•6
multiple breakthroughs needed
•7
•© 2012 IBM Corporation
DOME
DOME Project: 5
Years, 33M Euro
Ronald P. Luijten – HPC User Forum April 2014 •8
•© 2012 IBM Corporation•© 2013 IBM Corporation
•IBM at CeBIT 2013 – Rethink your business
•8
•System Analysis
•Data & Streaming
•Sustainable
(Green) Computing •Nanophotonics
•Computing •Transport •Storage
•Algorithms & Machines
-Nanophotonics
-Real Time
Communications
-Compressive
Sampling
-Microservers
-Accelerators
-Access Patterns
-Student
projects
-Events
-Research
Collaboration
•User
Platform
IBM / ASTRON DOME project
Technology roadmap development
IBM DOME µServer Motivation & Objectives
• Create the worlds highest density 64 bit µ-server drawer
• Useful for both SKA radio-astronomy and IBM future business
– Platform for Business Analytics appliance pre-product research
– “Datacenter in-a-box”
• Very high energy efficiency / very low cost (radioastronomers…)
• Use commodity components only, HW + SW standards
• Leverage ‘free computing’ paradigm
• Enhance with ‘Value Add’: packaging, system integration, …
• Density and speed of light
• Most efficient cooling using IBM technology (ref: SuperMUC TOP500 machine)
• Must be true 64 bit to enable business applications
• Must run server class OS (SLES11 or RHEL6, or equivalent)
– Precluded ARM (64-bit Silicon was not available)
– PPC64 is available in SoC from FSL since 2011
– (I am poor – no $$$ for my own SoC…)
• This is a research project – capability demonstrator only
9
•10
Compute node board form factor
133 mm
30 mmStandard FB-DIMM memory board
133 mm
55 mm
P5020
DRAM DRAM
PSoC
SPI
flash
Power
converter
Power
converter
USB
JTAG
Serial
I2C
Multiple
Ethernet
SDcard SATA
•IBM / ASTRON compute node board diagram
•11
P5020
DRAM DRAM
PSoC
SPI
flash
Power
converter
Power
converter
USB
JTAG
Serial
I2C
Multiple
Ethernet
SDcard SATA
•IBM / ASTRON compute node board diagram
•12
PSOC collapses 6 functions into a small chip
to save Area, Power and Cost
1. On/Off and Power up sequencing
2. Provide uServer boot configuration
3. JTAG debug access
4. Serial port access (Linux)
5. Temperature monitoring and protection
6. Management interface and control
Compute node processor options
•13
FSL SoC parts P5040 T4240
CPU GHz 2.2 1.8
CPUs 4 cores, 1 thread per core 12 cores, 2 threads per core
Primary cache 32 KB I + 32 KB D per core 32 KB I + 32 KB D per core
Secondary cache 512 KB I+D 2 MB per 4 CPUs
L3 cache 1 MB on chip 1.5 MB on chip
Memory 2 x 2 GB, DDR3/L3, ECC 3 x 2 GB, DDR3/L3, ECC
core e5500, ppc64 e6500, ppc64
1 DP FP unit per core 1 DP FP unit per core
128 bit SP altivec unit per core
node 45nm 28nm
TDP 55W 60W
T4240 DIMM connector:
•2 times SATA
•4 times 10 Gigabit ethernet
•SD card interface
•USB interface
•Some power supplies
T4240 SoC block diagram
•14
Hot Water Cooling
Most Energy Efficient solution:
– Low PUE possible (<=1.1) – Green IT
– 40% less energy consumption compared to air-cooled systems
– 90% of waste heat can be reused (CO2 neutral according Kyoto protocol)
– Allows very high density
– Less thermal cycling - improved reliability
– Lower Tj reduces leakage current – further saving energy
SuperMUC HPC machine at LRZ in Germany demonstrates ZRL hot water cooling
– No 4 on June 2012 TOP500 HPC list
SuperMuc
node board
15
•16
Compute node heat spreader
Functions:
• Electrically and thermally connects the compute node to
cooling-power delivery infrastructure
• allows heat removal laterally
• allows main power delivery to the board
•Schematics of board assembly
•Populated processor board
•Heat spreader
•Processor chip
•Power inductors
•Processor PCB
•Memory chips
•Heat spreader •Power delivery contacts
•(rivets)
•Gnd
•Power
•Shield
•capacitors
19” 2U Chassis with Combined
Cooling and Power
128 compute node boards
1536 cores / 3072 Threads
6 TB DRAM
Datacenter-in-a-box
•17
S O F T W A R E
Status (3 July 2014)
•19
• Rev 2 P5020/P5040 boards working
– Uboot is running, Sata works, booted Fedora 17, ppc64
• Rev 1 T4240 board received
– First power on next week
• Power module working (150 Amps)
• Multinode carrier board in bringup
• Water Cooling Thermal Test Vehicle in bringup
• T4240RDB installed (using Rev 2 Chip – Production version)
• P5040 available within DOME user platform (from today, 3 July 2014)
Multinode board
150Amp power board
Thermal Test
Vehicle board
Status (2 april 2014)
•20
Live DB2 demo
Simulates airline reservation transactions running a real data base server
T4240 uServer runs DB2 Server:– remote at swissdutch.ch
Local laptop: exercises the ‘basket of transactions’
(playing the role of customers buying tickets)
21Laptop Internet uServer
Acknowledgements
This work is the results of many people
• Peter v. Ackeren, FSL
• Ed Swarthout, FSL Austin
• Dac Pham, FSL Austin
• Yvonne Chan, IBM Toronto
• Andreas Doering, IBM ZRL
• Tom Wilson, IBM Armonk
• Alessandro Curioni, IBM ZRL
• Stephan Paredes, IBM ZRL
• James Nigel, FSL
• Gary Streber, FSL
• Patricia Sagmeister, IBM ZRL
• Boris Bialek, IBM Toronto
• Marco de Vos, Astron NL
• Hillery Hunter, IBM WRL
• Vipin Patel, IBM Fishkill
• And many more remain unnamed….
Companies: FSL Austin, Belgium & Germany; IBM worldwide; Transfer - NL
22
Potential uServer application areas
Ronald P. Luijten – Data Motion Architect
lui@zurich.ibm.com
IBM Research - Zurich
3 July 2014
Dome µServer properties
:
- Very low cost 64 bit computing and networking
- Fully standards based – no proprietary interfaces
- Ultradense: Watercooled and Aircooled versions
- Very high performance
- Standard Linux
•24
Example 1: Smart Fridge
•25
Example 2: Smart TV
•26
Example 3: Self driving car
•27
Example 4: Avionics
•28
Example X:
Use your own imagination!
•29
•30
Questions???
µServer website: www.swissdutch.ch
•31
Published Conference Papers
• “Parallelism and Data Movement Characterization of contemporary Application
Classes ”, Victoria Caparros Cabezas, Phillip Stanley-Marbell, ACM SPAA 2011, June
2011
• “Quantitative Analysis of the Berkeley Dwarfs' Parallelism and Data Movement
Properties”, Victoria Caparros Cabezas, Phillip Stanley-marbell, ACM CF 2011, May
2011
• “Performance, Power, and Thermal Analysis of Low-Power Processors for Scale-
Out Systems”, Phillip Stanley-Marbell, Victoria Caparros Cabezas, IEEE HPPAC 2011,
May 2011
• “Pinned to the Walls—Impact of Packaging and Application Properties on the
Memory and Power Walls”, Phillip Stanley-Marbell, Victoria Caparros Cabezas,
Ronald P. Luijten, IEEE ISLPED 2011, Aug 2011.
• “The DOME embedded 64 bit microserver demonstrator”, R. Luijten and A.
Doering, ICICDT 2013, Pavia, Italy, May 2013
• “Dual function heat-spreading and performance of the IBM / Astron DOME 64-bit
μServer demonstrator”, R. Luijten , A. Doering and S. Paredes, ICICDT 2014, Austin
Tx, May 2014

More Related Content

What's hot

A Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural NetworksA Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural Networksinside-BigData.com
 
Deep Learning on the SaturnV Cluster
Deep Learning on the SaturnV ClusterDeep Learning on the SaturnV Cluster
Deep Learning on the SaturnV Clusterinside-BigData.com
 
Expectations for optical network from the viewpoint of system software research
Expectations for optical network from the viewpoint of system software researchExpectations for optical network from the viewpoint of system software research
Expectations for optical network from the viewpoint of system software researchRyousei Takano
 
AMD and the new “Zen” High Performance x86 Core at Hot Chips 28
AMD and the new “Zen” High Performance x86 Core at Hot Chips 28AMD and the new “Zen” High Performance x86 Core at Hot Chips 28
AMD and the new “Zen” High Performance x86 Core at Hot Chips 28AMD
 
Exploring the Performance Impact of Virtualization on an HPC Cloud
Exploring the Performance Impact of Virtualization on an HPC CloudExploring the Performance Impact of Virtualization on an HPC Cloud
Exploring the Performance Impact of Virtualization on an HPC CloudRyousei Takano
 
IBM Data Centric Systems & OpenPOWER
IBM Data Centric Systems & OpenPOWERIBM Data Centric Systems & OpenPOWER
IBM Data Centric Systems & OpenPOWERinside-BigData.com
 
Stig Telfer - OpenStack and the Software-Defined SuperComputer
Stig Telfer - OpenStack and the Software-Defined SuperComputerStig Telfer - OpenStack and the Software-Defined SuperComputer
Stig Telfer - OpenStack and the Software-Defined SuperComputerDanny Abukalam
 
Programmable Exascale Supercomputer
Programmable Exascale SupercomputerProgrammable Exascale Supercomputer
Programmable Exascale SupercomputerSagar Dolas
 
HPC Cloud: Clouds on supercomputers for HPC
HPC Cloud: Clouds on supercomputers for HPCHPC Cloud: Clouds on supercomputers for HPC
HPC Cloud: Clouds on supercomputers for HPCRyousei Takano
 
Flow-centric Computing - A Datacenter Architecture in the Post Moore Era
Flow-centric Computing - A Datacenter Architecture in the Post Moore EraFlow-centric Computing - A Datacenter Architecture in the Post Moore Era
Flow-centric Computing - A Datacenter Architecture in the Post Moore EraRyousei Takano
 
NNSA Explorations: ARM for Supercomputing
NNSA Explorations: ARM for SupercomputingNNSA Explorations: ARM for Supercomputing
NNSA Explorations: ARM for Supercomputinginside-BigData.com
 
OpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software StackOpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software Stackinside-BigData.com
 
IEEE CloudCom 2014参加報告
IEEE CloudCom 2014参加報告IEEE CloudCom 2014参加報告
IEEE CloudCom 2014参加報告Ryousei Takano
 
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)inside-BigData.com
 
A Fresh Look at HPC from Huawei Enterprise
A Fresh Look at HPC from Huawei EnterpriseA Fresh Look at HPC from Huawei Enterprise
A Fresh Look at HPC from Huawei Enterpriseinside-BigData.com
 
High performance computing - building blocks, production & perspective
High performance computing - building blocks, production & perspectiveHigh performance computing - building blocks, production & perspective
High performance computing - building blocks, production & perspectiveJason Shih
 

What's hot (20)

A Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural NetworksA Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural Networks
 
SGI HPC DAY 2011 Kiev
SGI HPC DAY 2011 KievSGI HPC DAY 2011 Kiev
SGI HPC DAY 2011 Kiev
 
Exascale Capabl
Exascale CapablExascale Capabl
Exascale Capabl
 
POWER10 innovations for HPC
POWER10 innovations for HPCPOWER10 innovations for HPC
POWER10 innovations for HPC
 
Deep Learning on the SaturnV Cluster
Deep Learning on the SaturnV ClusterDeep Learning on the SaturnV Cluster
Deep Learning on the SaturnV Cluster
 
Expectations for optical network from the viewpoint of system software research
Expectations for optical network from the viewpoint of system software researchExpectations for optical network from the viewpoint of system software research
Expectations for optical network from the viewpoint of system software research
 
AMD and the new “Zen” High Performance x86 Core at Hot Chips 28
AMD and the new “Zen” High Performance x86 Core at Hot Chips 28AMD and the new “Zen” High Performance x86 Core at Hot Chips 28
AMD and the new “Zen” High Performance x86 Core at Hot Chips 28
 
Exploring the Performance Impact of Virtualization on an HPC Cloud
Exploring the Performance Impact of Virtualization on an HPC CloudExploring the Performance Impact of Virtualization on an HPC Cloud
Exploring the Performance Impact of Virtualization on an HPC Cloud
 
IBM Data Centric Systems & OpenPOWER
IBM Data Centric Systems & OpenPOWERIBM Data Centric Systems & OpenPOWER
IBM Data Centric Systems & OpenPOWER
 
Stig Telfer - OpenStack and the Software-Defined SuperComputer
Stig Telfer - OpenStack and the Software-Defined SuperComputerStig Telfer - OpenStack and the Software-Defined SuperComputer
Stig Telfer - OpenStack and the Software-Defined SuperComputer
 
Programmable Exascale Supercomputer
Programmable Exascale SupercomputerProgrammable Exascale Supercomputer
Programmable Exascale Supercomputer
 
HPC Cloud: Clouds on supercomputers for HPC
HPC Cloud: Clouds on supercomputers for HPCHPC Cloud: Clouds on supercomputers for HPC
HPC Cloud: Clouds on supercomputers for HPC
 
Flow-centric Computing - A Datacenter Architecture in the Post Moore Era
Flow-centric Computing - A Datacenter Architecture in the Post Moore EraFlow-centric Computing - A Datacenter Architecture in the Post Moore Era
Flow-centric Computing - A Datacenter Architecture in the Post Moore Era
 
Sierra overview
Sierra overviewSierra overview
Sierra overview
 
NNSA Explorations: ARM for Supercomputing
NNSA Explorations: ARM for SupercomputingNNSA Explorations: ARM for Supercomputing
NNSA Explorations: ARM for Supercomputing
 
OpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software StackOpenHPC: A Comprehensive System Software Stack
OpenHPC: A Comprehensive System Software Stack
 
IEEE CloudCom 2014参加報告
IEEE CloudCom 2014参加報告IEEE CloudCom 2014参加報告
IEEE CloudCom 2014参加報告
 
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
Microsoft Project Olympus AI Accelerator Chassis (HGX-1)
 
A Fresh Look at HPC from Huawei Enterprise
A Fresh Look at HPC from Huawei EnterpriseA Fresh Look at HPC from Huawei Enterprise
A Fresh Look at HPC from Huawei Enterprise
 
High performance computing - building blocks, production & perspective
High performance computing - building blocks, production & perspectiveHigh performance computing - building blocks, production & perspective
High performance computing - building blocks, production & perspective
 

Similar to IBM and ASTRON 64bit μServer for DOME

02 computer evolution and performance.ppt [compatibility mode]
02 computer evolution and performance.ppt [compatibility mode]02 computer evolution and performance.ppt [compatibility mode]
02 computer evolution and performance.ppt [compatibility mode]bogi007
 
02 computer evolution and performance
02 computer evolution and performance02 computer evolution and performance
02 computer evolution and performanceSher Shah Merkhel
 
02_Computer-Evolution(1).ppt
02_Computer-Evolution(1).ppt02_Computer-Evolution(1).ppt
02_Computer-Evolution(1).pptShaistaRiaz4
 
02_Computer-Evolution(1).ppt
02_Computer-Evolution(1).ppt02_Computer-Evolution(1).ppt
02_Computer-Evolution(1).pptShaistaRiaz4
 
Computer Evolution.ppt
Computer Evolution.pptComputer Evolution.ppt
Computer Evolution.pptVivekTrial
 
AI Accelerators for Cloud Datacenters
AI Accelerators for Cloud DatacentersAI Accelerators for Cloud Datacenters
AI Accelerators for Cloud DatacentersCastLabKAIST
 
Advanced Computer Architecture
Advanced Computer ArchitectureAdvanced Computer Architecture
Advanced Computer Architecturenibiganesh
 
Barcelona Supercomputing Center, Generador de Riqueza
Barcelona Supercomputing Center, Generador de RiquezaBarcelona Supercomputing Center, Generador de Riqueza
Barcelona Supercomputing Center, Generador de RiquezaFacultad de Informática UCM
 
Design installation-commissioning-red raider-cluster-ttu
Design installation-commissioning-red raider-cluster-ttuDesign installation-commissioning-red raider-cluster-ttu
Design installation-commissioning-red raider-cluster-ttuAlan Sill
 
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...BigDataEverywhere
 
Introduction to DPDK
Introduction to DPDKIntroduction to DPDK
Introduction to DPDKKernel TLV
 
System On Chip (SOC)
System On Chip (SOC)System On Chip (SOC)
System On Chip (SOC)Shivam Gupta
 
OpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC SystemsOpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC SystemsHPCC Systems
 
Opportunities of ML-based data analytics in ABCI
Opportunities of ML-based data analytics in ABCIOpportunities of ML-based data analytics in ABCI
Opportunities of ML-based data analytics in ABCIRyousei Takano
 

Similar to IBM and ASTRON 64bit μServer for DOME (20)

Micro controller & Micro processor
Micro controller & Micro processorMicro controller & Micro processor
Micro controller & Micro processor
 
02 computer evolution and performance.ppt [compatibility mode]
02 computer evolution and performance.ppt [compatibility mode]02 computer evolution and performance.ppt [compatibility mode]
02 computer evolution and performance.ppt [compatibility mode]
 
Current Trends in HPC
Current Trends in HPCCurrent Trends in HPC
Current Trends in HPC
 
02 computer evolution and performance
02 computer evolution and performance02 computer evolution and performance
02 computer evolution and performance
 
Available HPC resources at CSUC
Available HPC resources at CSUCAvailable HPC resources at CSUC
Available HPC resources at CSUC
 
Available HPC resources at CSUC
Available HPC resources at CSUCAvailable HPC resources at CSUC
Available HPC resources at CSUC
 
02_Computer-Evolution(1).ppt
02_Computer-Evolution(1).ppt02_Computer-Evolution(1).ppt
02_Computer-Evolution(1).ppt
 
02_Computer-Evolution(1).ppt
02_Computer-Evolution(1).ppt02_Computer-Evolution(1).ppt
02_Computer-Evolution(1).ppt
 
NSCC Training Introductory Class
NSCC Training Introductory Class NSCC Training Introductory Class
NSCC Training Introductory Class
 
Computer Evolution.ppt
Computer Evolution.pptComputer Evolution.ppt
Computer Evolution.ppt
 
AI Accelerators for Cloud Datacenters
AI Accelerators for Cloud DatacentersAI Accelerators for Cloud Datacenters
AI Accelerators for Cloud Datacenters
 
DOME 64-bit μDataCenter
DOME 64-bit μDataCenterDOME 64-bit μDataCenter
DOME 64-bit μDataCenter
 
Advanced Computer Architecture
Advanced Computer ArchitectureAdvanced Computer Architecture
Advanced Computer Architecture
 
Barcelona Supercomputing Center, Generador de Riqueza
Barcelona Supercomputing Center, Generador de RiquezaBarcelona Supercomputing Center, Generador de Riqueza
Barcelona Supercomputing Center, Generador de Riqueza
 
Design installation-commissioning-red raider-cluster-ttu
Design installation-commissioning-red raider-cluster-ttuDesign installation-commissioning-red raider-cluster-ttu
Design installation-commissioning-red raider-cluster-ttu
 
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
 
Introduction to DPDK
Introduction to DPDKIntroduction to DPDK
Introduction to DPDK
 
System On Chip (SOC)
System On Chip (SOC)System On Chip (SOC)
System On Chip (SOC)
 
OpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC SystemsOpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC Systems
 
Opportunities of ML-based data analytics in ABCI
Opportunities of ML-based data analytics in ABCIOpportunities of ML-based data analytics in ABCI
Opportunities of ML-based data analytics in ABCI
 

More from IBM Research

IBM Research - Zurich Celebrates 60 Years of Science and Innovation
IBM Research - Zurich Celebrates 60 Years of Science and InnovationIBM Research - Zurich Celebrates 60 Years of Science and Innovation
IBM Research - Zurich Celebrates 60 Years of Science and InnovationIBM Research
 
The Dilemmas of Innovation Management
The Dilemmas of Innovation ManagementThe Dilemmas of Innovation Management
The Dilemmas of Innovation ManagementIBM Research
 
A Prototype Storage Subsystem based on Phase Change Memory
A Prototype Storage Subsystem based on Phase Change MemoryA Prototype Storage Subsystem based on Phase Change Memory
A Prototype Storage Subsystem based on Phase Change MemoryIBM Research
 
Big Data and the Future of Storage
Big Data and the Future of StorageBig Data and the Future of Storage
Big Data and the Future of StorageIBM Research
 
The New Era of Cognitive Computing
The New Era of Cognitive ComputingThe New Era of Cognitive Computing
The New Era of Cognitive ComputingIBM Research
 
Das IBM Forschungslabor als Arbeitgeber
Das IBM Forschungslabor als ArbeitgeberDas IBM Forschungslabor als Arbeitgeber
Das IBM Forschungslabor als ArbeitgeberIBM Research
 
Nano, SuperMUC and Photovoltaics:A Day in the Life of IBM Research - Zurich
Nano, SuperMUC and Photovoltaics:A Day in the Life of IBM Research - ZurichNano, SuperMUC and Photovoltaics:A Day in the Life of IBM Research - Zurich
Nano, SuperMUC and Photovoltaics:A Day in the Life of IBM Research - ZurichIBM Research
 
Dechema Conference: Istanbul
Dechema Conference: IstanbulDechema Conference: Istanbul
Dechema Conference: IstanbulIBM Research
 
IBM Research: IBM 2010 Investor Briefing
IBM Research: IBM 2010 Investor BriefingIBM Research: IBM 2010 Investor Briefing
IBM Research: IBM 2010 Investor BriefingIBM Research
 

More from IBM Research (10)

IBM Research - Zurich Celebrates 60 Years of Science and Innovation
IBM Research - Zurich Celebrates 60 Years of Science and InnovationIBM Research - Zurich Celebrates 60 Years of Science and Innovation
IBM Research - Zurich Celebrates 60 Years of Science and Innovation
 
The Dilemmas of Innovation Management
The Dilemmas of Innovation ManagementThe Dilemmas of Innovation Management
The Dilemmas of Innovation Management
 
A Prototype Storage Subsystem based on Phase Change Memory
A Prototype Storage Subsystem based on Phase Change MemoryA Prototype Storage Subsystem based on Phase Change Memory
A Prototype Storage Subsystem based on Phase Change Memory
 
Big Data and the Future of Storage
Big Data and the Future of StorageBig Data and the Future of Storage
Big Data and the Future of Storage
 
The New Era of Cognitive Computing
The New Era of Cognitive ComputingThe New Era of Cognitive Computing
The New Era of Cognitive Computing
 
Das IBM Forschungslabor als Arbeitgeber
Das IBM Forschungslabor als ArbeitgeberDas IBM Forschungslabor als Arbeitgeber
Das IBM Forschungslabor als Arbeitgeber
 
Nano, SuperMUC and Photovoltaics:A Day in the Life of IBM Research - Zurich
Nano, SuperMUC and Photovoltaics:A Day in the Life of IBM Research - ZurichNano, SuperMUC and Photovoltaics:A Day in the Life of IBM Research - Zurich
Nano, SuperMUC and Photovoltaics:A Day in the Life of IBM Research - Zurich
 
Meet IBM Research
Meet IBM ResearchMeet IBM Research
Meet IBM Research
 
Dechema Conference: Istanbul
Dechema Conference: IstanbulDechema Conference: Istanbul
Dechema Conference: Istanbul
 
IBM Research: IBM 2010 Investor Briefing
IBM Research: IBM 2010 Investor BriefingIBM Research: IBM 2010 Investor Briefing
IBM Research: IBM 2010 Investor Briefing
 

Recently uploaded

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 

Recently uploaded (20)

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 

IBM and ASTRON 64bit μServer for DOME

  • 1. The DOME 64-bit µServer and its Ecosystem Ronald P. Luijten – Data Motion Architect lui@zurich.ibm.com IBM Research - Zurich 3 July 2014 DISCLAIMER: This presentation is entirely Ronald’s view and not necessarily that of IBM.
  • 2. Definition µServer: The integration of an entire server node motherboard* into a single microchip except DRAM, Nor-boot flash and power conversion logic. •2 305mm 245mm 133mmx55mm * no graphics This does NOT imply low performance!
  • 3. © 2012 IBM Corporation •3 SKA (Square Kilometer Array) to measure Big Bang Picture source: NZZ march 2014 Big Bang Inflation Protons created Start of nucleosynthesis through fusion End of nucleosynthesis Modern Universe 0 10-32s 10-6s 0.01s 3min 380’000 years 13.8 Billion years
  • 4. © 2012 IBM Corporation Up to 2 Million+ Antenna’s What does this mean? •4 SKA: Largest Radio-astronomy antenna Big data on Steroids
  • 5. ~ 10 Pb/s 86’400 sec/day 10..14 ExaByte/day ?? ~ 1 PB/Day. Prelim. Spec. SKA, R.T. Schilizzi et al. 2007 / Chr. Broekema ?? Science Data ProcCentral Signal Proc •5
  • 6. © 2012 IBM Corporation© 2014 IBM Corporation ~ 10 Pb/s 86’400 sec/day 10..14 ExaByte/day ?? ~ 1 PB/Day. 330 disks/day 120’000 disks/yr ?? Top-500 Supercomputing(11/2013)…. 0.3Watt/Gflop/s Today’s industry focus is 1 Eflop @ 20MW. (2018) ( 0.02 Gflop/s) Most recent data from SKA: CSP….max. power 7.5MW SDP….max. power 1 MW Latest need for SKA – 4 Exaflop (SKA1 - Mid) 1.2GW…80MW Too easy (for us) Too hard Moore’s law Factor 80-1200 SDPCSP •6 multiple breakthroughs needed
  • 7. •7 •© 2012 IBM Corporation DOME
  • 8. DOME Project: 5 Years, 33M Euro Ronald P. Luijten – HPC User Forum April 2014 •8 •© 2012 IBM Corporation•© 2013 IBM Corporation •IBM at CeBIT 2013 – Rethink your business •8 •System Analysis •Data & Streaming •Sustainable (Green) Computing •Nanophotonics •Computing •Transport •Storage •Algorithms & Machines -Nanophotonics -Real Time Communications -Compressive Sampling -Microservers -Accelerators -Access Patterns -Student projects -Events -Research Collaboration •User Platform IBM / ASTRON DOME project Technology roadmap development
  • 9. IBM DOME µServer Motivation & Objectives • Create the worlds highest density 64 bit µ-server drawer • Useful for both SKA radio-astronomy and IBM future business – Platform for Business Analytics appliance pre-product research – “Datacenter in-a-box” • Very high energy efficiency / very low cost (radioastronomers…) • Use commodity components only, HW + SW standards • Leverage ‘free computing’ paradigm • Enhance with ‘Value Add’: packaging, system integration, … • Density and speed of light • Most efficient cooling using IBM technology (ref: SuperMUC TOP500 machine) • Must be true 64 bit to enable business applications • Must run server class OS (SLES11 or RHEL6, or equivalent) – Precluded ARM (64-bit Silicon was not available) – PPC64 is available in SoC from FSL since 2011 – (I am poor – no $$$ for my own SoC…) • This is a research project – capability demonstrator only 9
  • 10. •10 Compute node board form factor 133 mm 30 mmStandard FB-DIMM memory board 133 mm 55 mm
  • 12. P5020 DRAM DRAM PSoC SPI flash Power converter Power converter USB JTAG Serial I2C Multiple Ethernet SDcard SATA •IBM / ASTRON compute node board diagram •12 PSOC collapses 6 functions into a small chip to save Area, Power and Cost 1. On/Off and Power up sequencing 2. Provide uServer boot configuration 3. JTAG debug access 4. Serial port access (Linux) 5. Temperature monitoring and protection 6. Management interface and control
  • 13. Compute node processor options •13 FSL SoC parts P5040 T4240 CPU GHz 2.2 1.8 CPUs 4 cores, 1 thread per core 12 cores, 2 threads per core Primary cache 32 KB I + 32 KB D per core 32 KB I + 32 KB D per core Secondary cache 512 KB I+D 2 MB per 4 CPUs L3 cache 1 MB on chip 1.5 MB on chip Memory 2 x 2 GB, DDR3/L3, ECC 3 x 2 GB, DDR3/L3, ECC core e5500, ppc64 e6500, ppc64 1 DP FP unit per core 1 DP FP unit per core 128 bit SP altivec unit per core node 45nm 28nm TDP 55W 60W T4240 DIMM connector: •2 times SATA •4 times 10 Gigabit ethernet •SD card interface •USB interface •Some power supplies
  • 14. T4240 SoC block diagram •14
  • 15. Hot Water Cooling Most Energy Efficient solution: – Low PUE possible (<=1.1) – Green IT – 40% less energy consumption compared to air-cooled systems – 90% of waste heat can be reused (CO2 neutral according Kyoto protocol) – Allows very high density – Less thermal cycling - improved reliability – Lower Tj reduces leakage current – further saving energy SuperMUC HPC machine at LRZ in Germany demonstrates ZRL hot water cooling – No 4 on June 2012 TOP500 HPC list SuperMuc node board 15
  • 16. •16 Compute node heat spreader Functions: • Electrically and thermally connects the compute node to cooling-power delivery infrastructure • allows heat removal laterally • allows main power delivery to the board •Schematics of board assembly •Populated processor board •Heat spreader •Processor chip •Power inductors •Processor PCB •Memory chips •Heat spreader •Power delivery contacts •(rivets) •Gnd •Power •Shield •capacitors
  • 17. 19” 2U Chassis with Combined Cooling and Power 128 compute node boards 1536 cores / 3072 Threads 6 TB DRAM Datacenter-in-a-box •17
  • 18. S O F T W A R E
  • 19. Status (3 July 2014) •19 • Rev 2 P5020/P5040 boards working – Uboot is running, Sata works, booted Fedora 17, ppc64 • Rev 1 T4240 board received – First power on next week • Power module working (150 Amps) • Multinode carrier board in bringup • Water Cooling Thermal Test Vehicle in bringup • T4240RDB installed (using Rev 2 Chip – Production version) • P5040 available within DOME user platform (from today, 3 July 2014) Multinode board 150Amp power board Thermal Test Vehicle board
  • 20. Status (2 april 2014) •20
  • 21. Live DB2 demo Simulates airline reservation transactions running a real data base server T4240 uServer runs DB2 Server:– remote at swissdutch.ch Local laptop: exercises the ‘basket of transactions’ (playing the role of customers buying tickets) 21Laptop Internet uServer
  • 22. Acknowledgements This work is the results of many people • Peter v. Ackeren, FSL • Ed Swarthout, FSL Austin • Dac Pham, FSL Austin • Yvonne Chan, IBM Toronto • Andreas Doering, IBM ZRL • Tom Wilson, IBM Armonk • Alessandro Curioni, IBM ZRL • Stephan Paredes, IBM ZRL • James Nigel, FSL • Gary Streber, FSL • Patricia Sagmeister, IBM ZRL • Boris Bialek, IBM Toronto • Marco de Vos, Astron NL • Hillery Hunter, IBM WRL • Vipin Patel, IBM Fishkill • And many more remain unnamed…. Companies: FSL Austin, Belgium & Germany; IBM worldwide; Transfer - NL 22
  • 23. Potential uServer application areas Ronald P. Luijten – Data Motion Architect lui@zurich.ibm.com IBM Research - Zurich 3 July 2014
  • 24. Dome µServer properties : - Very low cost 64 bit computing and networking - Fully standards based – no proprietary interfaces - Ultradense: Watercooled and Aircooled versions - Very high performance - Standard Linux •24
  • 25. Example 1: Smart Fridge •25
  • 26. Example 2: Smart TV •26
  • 27. Example 3: Self driving car •27
  • 29. Example X: Use your own imagination! •29
  • 31. •31 Published Conference Papers • “Parallelism and Data Movement Characterization of contemporary Application Classes ”, Victoria Caparros Cabezas, Phillip Stanley-Marbell, ACM SPAA 2011, June 2011 • “Quantitative Analysis of the Berkeley Dwarfs' Parallelism and Data Movement Properties”, Victoria Caparros Cabezas, Phillip Stanley-marbell, ACM CF 2011, May 2011 • “Performance, Power, and Thermal Analysis of Low-Power Processors for Scale- Out Systems”, Phillip Stanley-Marbell, Victoria Caparros Cabezas, IEEE HPPAC 2011, May 2011 • “Pinned to the Walls—Impact of Packaging and Application Properties on the Memory and Power Walls”, Phillip Stanley-Marbell, Victoria Caparros Cabezas, Ronald P. Luijten, IEEE ISLPED 2011, Aug 2011. • “The DOME embedded 64 bit microserver demonstrator”, R. Luijten and A. Doering, ICICDT 2013, Pavia, Italy, May 2013 • “Dual function heat-spreading and performance of the IBM / Astron DOME 64-bit μServer demonstrator”, R. Luijten , A. Doering and S. Paredes, ICICDT 2014, Austin Tx, May 2014