SlideShare a Scribd company logo
1 of 25
IBM PureData System
for Analytics
(Formerly known as, IBM Netezza)
- Ravi
www.etraining.guru
info@etraining.guru
IBM Netezza 100
(Skimmer)
Single chassis, single host system
Appliance for data warehouse test and development environments
It’s a non High-Availability (HA) system
Shares the same architecture and software as IBM Netezza 1000
(Twinfin) model
IBM Netezza 1000
(Twinfin)
Purpose built, standards based data warehouse appliance that
architecturally integrates database, server, storage and advanced
analytics capabilities into a single, easy-to-manage system
It supports High Availability (HA)
Fast load speeds: > 4TB/Hour
Fast backup rates: > 4TB/Hour
1:1:1 ratio between disks, CPU’s, FPGA’s
IBM PureData System for Analytics N1001
(Twinfin)
Update to the IBM Netezza 1000 model, with same architecture
On 09/Oct/2012, IBM announced the name changes from TwinFin
(1000) to PureData System for Analytics, as below:
1:1:1 ratio between Disks, CPU’s, FPGA’s
IBM Netezza High Capacity Appliances
(C1000 model)
C1000 models are similar to N1001 systems, but with more storage/rack
Scales to more than 10 petabytes of user data capacity
Each Netezza C1000 rack has one S-Blade chassis that contains 4 S-blades. Each S-blade
has 8 CPU/FPGA processors. Same as N1001
1 Rack = 4 Storage Arrays (or Storage Groups)
1 Storage Array = 1 disk raid controller + 2 disk enclosures
1 disk raid controller = 12 disks; 1 disk enclosure = 12 disks
So, 1 Storage array/group = 12 + (2*12) = 36 disks
So, 1 Rack = 4 Storage array/groups = 4 * 36 disks = 144 TB
Note: 2 spare disks per each storage array
C1000-4:
1 Rack,
1 S-Blade Chassis  8 CPU, 8 FPGA
4 Storage groups  144 TB (8 Spares)
C1000-8:
2 Racks
2 S-Blade Chassis  16 CPU, 16 FPGA
8 Storage groups  288 TB (16 Spares)
Note: There is no 1:1:1 ratio between Disks, CPU’s, FPGA’s
IBM PureData System for Analytics N2001
(Striper)
3x faster analytics performance & 50% more usable capacity per rack
128 GB/Sec scan rate
1:1:1 ration between disks, FPGA engines, and CPU cores doesn’t apply
1 Rack = 7 S-blades + 288 disks
1 S-blade = 16 CPU cores + 16 FPGA engines
288 disks = 240 active disks + 34 spare + 14 used for swap/log space
Note: Each disk size in striper: 600 GB (user space:200GB, Mirror:200GB, Temp: 200GB)
Why no 1:1:1 ratio?
There is a 1:1 ratio between FPGA:CPU.
There is no 1:1 ratio between FPGA and disks because FPGA can
process
the data much faster than the disk can produce.
2 S-blades have 40 disks; 5 S-blades have 32 disks
Striper configurations: ½ rack, 1 rack, 2 rack, or 4 rack.
We no longer have a ¼ rack configuration as we did with the Twinfin
IBM PureData System for Analytics N2002
(Striper)
N2002 is the first hardware lifecycle refresh of the IBM PureData
System for Analytics N200x family of appliances.
What happens when you submit a query?
When the system starts up, 32 or 40 dataslices are assigned to each s-blade
At no point during operation can one s-blade access the data on a dataslice which has been assigned to another s-
blade
If an s-blade fails, the dataslices which were assigned to the failed s-blade will be rebalanced and assigned to the
remaining still operational s-blades. That is exactly the same as the system worked in later releases of NPS on the
TwinFin architecture as well.
There in no attachment of CPUs to disks. The only attachment is that dataslice is assigned to 1 S-blade when the
system starts. CPU and FPGA resources are assigned as they are available.
When a query starts, NPS will start 240 processes, one for each dataslice.
The processes start reading data off disk; As each page of data (128KB) comes off disk, that page gets assigned to
the first available FPGA on that s-blade.
FPGA decompresses and filters data and passes the result back to the process.
The Linux CPU scheduler assigns the process to one of the CPUs which processes the remaining data that came out
of the FPGA.
Once complete, the next 128KB is read off disk and that continues until all of the data has been processed for the
table being scanned.
Find Netezza Model
option1: nz_get_model
[/export/home/nz]$nz_get_model
IBM PureData System for Analytics N2001-010
option2: select * from _t_environ where name like ‘%NPS%’
/export/home/nz->nzsql -c “select * from _t_environ where name like ‘NPS%’;”
NAME | VAL
————–+————
NPS_PLATFORM | xs
NPS_MODEL | P1000X_A_E
NPS_FAMILY | Pseries
Note: Pseries is for Twinfin; Qseries is for Striper
IBM Netezza Appliance Models (By, www.etraining.guru)

More Related Content

Viewers also liked

NENUG Apr14 Talk - data modeling for netezza
NENUG Apr14 Talk - data modeling for netezzaNENUG Apr14 Talk - data modeling for netezza
NENUG Apr14 Talk - data modeling for netezza
Biju Nair
 
Approved IBM Sterling Integrator Trainer Certificate
Approved IBM Sterling Integrator Trainer CertificateApproved IBM Sterling Integrator Trainer Certificate
Approved IBM Sterling Integrator Trainer Certificate
DASHARATHA BUCHHI REDDY
 
High performance computing - building blocks, production & perspective
High performance computing - building blocks, production & perspectiveHigh performance computing - building blocks, production & perspective
High performance computing - building blocks, production & perspective
Jason Shih
 

Viewers also liked (12)

Ibm pure data system for analytics n3001
Ibm pure data system for analytics n3001Ibm pure data system for analytics n3001
Ibm pure data system for analytics n3001
 
Using Netezza Query Plan to Improve Performace
Using Netezza Query Plan to Improve PerformaceUsing Netezza Query Plan to Improve Performace
Using Netezza Query Plan to Improve Performace
 
A Hybrid Technology Platform for Increasing the Speed of Operational Analytics
A Hybrid Technology Platform for Increasing the Speed of Operational AnalyticsA Hybrid Technology Platform for Increasing the Speed of Operational Analytics
A Hybrid Technology Platform for Increasing the Speed of Operational Analytics
 
IBM Netezza - The data warehouse in a big data strategy
IBM Netezza - The data warehouse in a big data strategyIBM Netezza - The data warehouse in a big data strategy
IBM Netezza - The data warehouse in a big data strategy
 
NENUG Apr14 Talk - data modeling for netezza
NENUG Apr14 Talk - data modeling for netezzaNENUG Apr14 Talk - data modeling for netezza
NENUG Apr14 Talk - data modeling for netezza
 
Approved IBM Sterling Integrator Trainer Certificate
Approved IBM Sterling Integrator Trainer CertificateApproved IBM Sterling Integrator Trainer Certificate
Approved IBM Sterling Integrator Trainer Certificate
 
Netezza workload management
Netezza workload managementNetezza workload management
Netezza workload management
 
Row or Columnar Database
Row or Columnar DatabaseRow or Columnar Database
Row or Columnar Database
 
Data power training
Data power trainingData power training
Data power training
 
Column-Stores vs. Row-Stores: How Different are they Really?
Column-Stores vs. Row-Stores: How Different are they Really?Column-Stores vs. Row-Stores: How Different are they Really?
Column-Stores vs. Row-Stores: How Different are they Really?
 
High performance computing - building blocks, production & perspective
High performance computing - building blocks, production & perspectiveHigh performance computing - building blocks, production & perspective
High performance computing - building blocks, production & perspective
 
スタートアップ組織づくりの具体策を学ぶ 先生:金子 陽三
スタートアップ組織づくりの具体策を学ぶ 先生:金子 陽三スタートアップ組織づくりの具体策を学ぶ 先生:金子 陽三
スタートアップ組織づくりの具体策を学ぶ 先生:金子 陽三
 

More from Ravikumar Nandigam

Online Training in Unix Linux Shell Scripting in Hyderabad
Online Training in Unix Linux Shell Scripting in HyderabadOnline Training in Unix Linux Shell Scripting in Hyderabad
Online Training in Unix Linux Shell Scripting in Hyderabad
Ravikumar Nandigam
 

More from Ravikumar Nandigam (15)

Online Training in IBM DB2 LUW/UDB DBA in Hyderabad
Online Training in IBM DB2 LUW/UDB DBA in HyderabadOnline Training in IBM DB2 LUW/UDB DBA in Hyderabad
Online Training in IBM DB2 LUW/UDB DBA in Hyderabad
 
Online Training in IBM DB2 LUW/UDB DBA in Hyderabad
Online Training in IBM DB2 LUW/UDB DBA in HyderabadOnline Training in IBM DB2 LUW/UDB DBA in Hyderabad
Online Training in IBM DB2 LUW/UDB DBA in Hyderabad
 
Online Training in Unix Linux Shell Scripting in Hyderabad
Online Training in Unix Linux Shell Scripting in HyderabadOnline Training in Unix Linux Shell Scripting in Hyderabad
Online Training in Unix Linux Shell Scripting in Hyderabad
 
2 db2 instance creation
2 db2 instance creation2 db2 instance creation
2 db2 instance creation
 
2 db2 instance creation
2 db2 instance creation2 db2 instance creation
2 db2 instance creation
 
Managing user Online Training in IBM Netezza DBA Development by www.etraining...
Managing user Online Training in IBM Netezza DBA Development by www.etraining...Managing user Online Training in IBM Netezza DBA Development by www.etraining...
Managing user Online Training in IBM Netezza DBA Development by www.etraining...
 
IBM DB2 LUW UDB DBA Training by www.etraining.guru
IBM DB2 LUW UDB DBA Training by www.etraining.guruIBM DB2 LUW UDB DBA Training by www.etraining.guru
IBM DB2 LUW UDB DBA Training by www.etraining.guru
 
online training for IBM DB2 LUW UDB DBA
online training for IBM DB2 LUW UDB DBAonline training for IBM DB2 LUW UDB DBA
online training for IBM DB2 LUW UDB DBA
 
Course content (netezza dba)
Course content (netezza dba)Course content (netezza dba)
Course content (netezza dba)
 
IBM DB2 LUW UDB DBA Online Training by Etraining Guru In Hyderabad
IBM DB2 LUW UDB DBA Online Training by Etraining Guru In HyderabadIBM DB2 LUW UDB DBA Online Training by Etraining Guru In Hyderabad
IBM DB2 LUW UDB DBA Online Training by Etraining Guru In Hyderabad
 
IBM DB2 LUW/UDB DBA Training by www.etraining.guru
IBM DB2 LUW/UDB DBA Training by www.etraining.guruIBM DB2 LUW/UDB DBA Training by www.etraining.guru
IBM DB2 LUW/UDB DBA Training by www.etraining.guru
 
IBM DB2 LUW UDB DBA Training by www.etraining.guru
IBM DB2 LUW UDB DBA Training by www.etraining.guruIBM DB2 LUW UDB DBA Training by www.etraining.guru
IBM DB2 LUW UDB DBA Training by www.etraining.guru
 
data loading and unloading in IBM Netezza by www.etraining.guru
data loading and unloading in IBM Netezza by www.etraining.gurudata loading and unloading in IBM Netezza by www.etraining.guru
data loading and unloading in IBM Netezza by www.etraining.guru
 
IBM DB2 LUW UDB DBA Online Training by Etraining.guru
IBM DB2 LUW UDB DBA Online Training by Etraining.guruIBM DB2 LUW UDB DBA Online Training by Etraining.guru
IBM DB2 LUW UDB DBA Online Training by Etraining.guru
 
Netezza Online Training by www.etraining.guru in India
Netezza Online Training by www.etraining.guru in IndiaNetezza Online Training by www.etraining.guru in India
Netezza Online Training by www.etraining.guru in India
 

Recently uploaded

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 

IBM Netezza Appliance Models (By, www.etraining.guru)

  • 1. IBM PureData System for Analytics (Formerly known as, IBM Netezza) - Ravi www.etraining.guru info@etraining.guru
  • 3. Single chassis, single host system Appliance for data warehouse test and development environments It’s a non High-Availability (HA) system Shares the same architecture and software as IBM Netezza 1000 (Twinfin) model
  • 5. Purpose built, standards based data warehouse appliance that architecturally integrates database, server, storage and advanced analytics capabilities into a single, easy-to-manage system It supports High Availability (HA) Fast load speeds: > 4TB/Hour Fast backup rates: > 4TB/Hour 1:1:1 ratio between disks, CPU’s, FPGA’s
  • 6.
  • 7.
  • 8.
  • 9. IBM PureData System for Analytics N1001 (Twinfin)
  • 10. Update to the IBM Netezza 1000 model, with same architecture On 09/Oct/2012, IBM announced the name changes from TwinFin (1000) to PureData System for Analytics, as below: 1:1:1 ratio between Disks, CPU’s, FPGA’s
  • 11.
  • 12. IBM Netezza High Capacity Appliances (C1000 model)
  • 13. C1000 models are similar to N1001 systems, but with more storage/rack Scales to more than 10 petabytes of user data capacity Each Netezza C1000 rack has one S-Blade chassis that contains 4 S-blades. Each S-blade has 8 CPU/FPGA processors. Same as N1001 1 Rack = 4 Storage Arrays (or Storage Groups) 1 Storage Array = 1 disk raid controller + 2 disk enclosures 1 disk raid controller = 12 disks; 1 disk enclosure = 12 disks So, 1 Storage array/group = 12 + (2*12) = 36 disks So, 1 Rack = 4 Storage array/groups = 4 * 36 disks = 144 TB Note: 2 spare disks per each storage array C1000-4: 1 Rack, 1 S-Blade Chassis  8 CPU, 8 FPGA 4 Storage groups  144 TB (8 Spares) C1000-8: 2 Racks 2 S-Blade Chassis  16 CPU, 16 FPGA 8 Storage groups  288 TB (16 Spares) Note: There is no 1:1:1 ratio between Disks, CPU’s, FPGA’s
  • 14.
  • 15. IBM PureData System for Analytics N2001 (Striper)
  • 16. 3x faster analytics performance & 50% more usable capacity per rack 128 GB/Sec scan rate 1:1:1 ration between disks, FPGA engines, and CPU cores doesn’t apply 1 Rack = 7 S-blades + 288 disks 1 S-blade = 16 CPU cores + 16 FPGA engines 288 disks = 240 active disks + 34 spare + 14 used for swap/log space Note: Each disk size in striper: 600 GB (user space:200GB, Mirror:200GB, Temp: 200GB) Why no 1:1:1 ratio? There is a 1:1 ratio between FPGA:CPU. There is no 1:1 ratio between FPGA and disks because FPGA can process the data much faster than the disk can produce. 2 S-blades have 40 disks; 5 S-blades have 32 disks Striper configurations: ½ rack, 1 rack, 2 rack, or 4 rack. We no longer have a ¼ rack configuration as we did with the Twinfin
  • 17.
  • 18.
  • 19. IBM PureData System for Analytics N2002 (Striper)
  • 20. N2002 is the first hardware lifecycle refresh of the IBM PureData System for Analytics N200x family of appliances.
  • 21. What happens when you submit a query?
  • 22. When the system starts up, 32 or 40 dataslices are assigned to each s-blade At no point during operation can one s-blade access the data on a dataslice which has been assigned to another s- blade If an s-blade fails, the dataslices which were assigned to the failed s-blade will be rebalanced and assigned to the remaining still operational s-blades. That is exactly the same as the system worked in later releases of NPS on the TwinFin architecture as well. There in no attachment of CPUs to disks. The only attachment is that dataslice is assigned to 1 S-blade when the system starts. CPU and FPGA resources are assigned as they are available. When a query starts, NPS will start 240 processes, one for each dataslice. The processes start reading data off disk; As each page of data (128KB) comes off disk, that page gets assigned to the first available FPGA on that s-blade. FPGA decompresses and filters data and passes the result back to the process. The Linux CPU scheduler assigns the process to one of the CPUs which processes the remaining data that came out of the FPGA. Once complete, the next 128KB is read off disk and that continues until all of the data has been processed for the table being scanned.
  • 24. option1: nz_get_model [/export/home/nz]$nz_get_model IBM PureData System for Analytics N2001-010 option2: select * from _t_environ where name like ‘%NPS%’ /export/home/nz->nzsql -c “select * from _t_environ where name like ‘NPS%’;” NAME | VAL ————–+———— NPS_PLATFORM | xs NPS_MODEL | P1000X_A_E NPS_FAMILY | Pseries Note: Pseries is for Twinfin; Qseries is for Striper

Editor's Notes

  1. Twinfin 3 (1000-3) Twinfin 6 (1000-6) Twinfin 12 (1000-12)
  2. Ref: http://www-01.ibm.com/support/docview.wss?uid=swg21614926
  3. The 14 disks are dedicated for partitions to store swap/log space required by the linux operating system that runs on the S-Blades, this has been separated out from the disks containing the partitions with user data primary/mirror and the database temp space, in the former TwinFin architecture all of these partitions were spread across all the disks, i.e. they weren't separated out.