SlideShare a Scribd company logo
1 of 32
ROCKET ENGINE
FOR CONTEXT DRIVEN APPS THAT

PERSONALIZE THE INTERNET
BY YOUNG PAIK
DIRECTOR SALES ENGINEERING,
AEROSPIKE
Aerospike aer . o . spike [air-oh- spahyk]
noun, 1. tip of a rocket that enhances speed and stability
© 2014 Aerospike. All rights reserved. Confidential

1
Aerospike NoSQL Database

© 2014 Aerospike. All rights reserved. Confidential

2
AGENDA
1. How the game has changed,
driving need for next-gen NoSQL
2. Who uses Aerospike and why
3. Architecture Overview

© 2014 Aerospike. All rights reserved. Confidential

3
Internet Enterprises have changed the game…
Simple, Personalized, Instant

Complex, Standardized, Silo-ed

© 2014 Aerospike. All rights reserved. Confidential

4
Consumers Expect and “Want it All”
1. Instant Response
■“Every 100ms latency costs
Amazon 1% in sales”
– Greg Linden, Amazon
■“An extra ½ sec in search page
generation dropped traffic 20%”
– Google (average 1.5 sec)
■“A 1 sec delay can cause 7%
decline in conversion”
– Walmart

© 2014 Aerospike. All rights reserved. Confidential

5
Consumers Expect and “Want it All”
1. Instant Response
2. Intuitive Service
■

Personalized & Consistent
across channels
■
■

Mobile devices, tablet, car…
Web, mobile, social media…

© 2014 Aerospike. All rights reserved. Confidential

6
Consumers Expect and “Want it All”
1. Instant Response
2. Intuitive Service
■

Personalized & Consistent
across channels
■
■

■

Mobile devices, tablet, car…
Web, mobile, social media…

Seamless across the business
■

Marketing, sales, support…

© 2014 Aerospike. All rights reserved. Confidential

7
Consumers Expect and “Want it All”
1. Instant Response

2. Intuitive Service

1. Always-On
■

How much does down-time cost?

© 2014 Aerospike. All rights reserved. Confidential

8
Enterprises must “Deliver it All”
■ Use every swipe, search, share to delight
- Instantly, Intuitively, Always-On
■ (DIY or SaaS)

CONTEXT
■ IDENTITY
■ SessionIDs, Cookies, DeviceIDs, ip-Addr

■ ATTRIBUTES
■ Demographic, geographic

■ BEHAVIOR (REAL-TIME)
■ Presence, swipe, search, share..
■ Channels – web, phone, in-store..
■ Services – frequency, sophistication

■ SEGMENTS (PRE-CALCULATED)
■ Attitudes, values, lifestyle, history..

■ TRANSACTIONS
■ Payments, campaigns

© 2014 Aerospike. All rights reserved. Confidential

9
How Big is Real-time Big Context?
# People

Per Profile

10 M Customers *

25 kb

250 GB

500 M Prospects *

1 kb

+ 500 GB

Real-time
Context

= 750

CONTEXT

GB

■ IDENTITY
■ SessionIDs, Cookies, DeviceIDs, ip-Addr

■ ATTRIBUTES

■ How many Objects?

■ Demographic, geographic

■# People * # Devices * # Browsers
■People move around, cookies get
cleared..
■* 2x Replication

■ “100M people ≈ 2 Billion cookies”
- eBay

■ BEHAVIOR (REAL-TIME)
■ Presence, swipe, search, share..
■ Channels – web, phone, in-store..
■ Services – frequency, sophistication

■ SEGMENTS (PRE-CALCULATED)
■ Attitudes, values, lifestyle, history..

■ TRANSACTIONS
■ Payments, campaigns

© 2014 Aerospike. All rights reserved. Confidential

10
Aerospike Database – Powering Context-Driven
Apps
■ Apps that Personalize the Internet
Instantly, Intuitively & Always-On

1. ROCKET ENGINE
- In-Memory,
Flash-optimized
2. WEB SCALE
– Distributed,
Shared nothing
3. ACID RELIABILITY
– Immediate Consistency,
High Availability
4. NoSQL FLEXIBILITY
– Distributed Queries,
Real-time Analytics

© 2014 Aerospike. All rights reserved. Confidential

11
New Architecture for Context Computing
APP SERVERS

RDBMS

DATA
WAREHOUSE
AEROSPIKE
CLUSTER
MILLIONS OF CONSUMERS
BILLIONS OF DEVICES

SEGMENTS
R/W
REAL-TIME
CONTEXT

CONTEXT
IDENTITY
Cookies, device
ID..
ATTRIBUTES
Age, Gender..

+

REAL-TIME
ANALYTICS

Calculate models,
Discover
SEGMENTS eg
early adopter,
bargain hunter,
mass affluent…

BATCH
ANALYTICS

QUERIES &
AGGREGATIONS
Risk scores,
best sellers,
trending now…

BEHAVIOR
Click, search…
SEGMENTS
© 2014 Aerospike. All rights reserved. Confidential

12
Pioneered by Ad-Tech
AppNexus

[x+1]

Federated
Media

© 2014 Aerospike. All rights reserved. Confidential

eXelate

13
Powering the profile store for AppNexus (RTB)
■ “For the last three years, Aerospike’s
database has been managing our vast
volumes of user data. With Aerospike, we
process many terabytes of data daily
across our global data centers at a rate in
excess of a million requests per second.
– Mike Nolet, CTO
■ “AppNexus* operates at massive scale
while paying close attention to the
economics of the platform. Aerospike’s
flash optimizations running on top of
Intel® SSDs have given us the price,
performance, reliability, and serviceability
we need to grow our business.”
– Timothy G Smith,
SVP Technical Operations
© 2014 Aerospike. All rights reserved. Confidential

•

50 Billion Ad +
300 Billion Bid Requests/day
for Microsoft Ad Exchange,
Interactive Media
(Deutsche Telekom),
Collective…

•

6 Billion Mobile Ads/day
for Millennial Media Exchange

•

100ms SLA from click to view

14
Powering the eXelate Data Exchange (DMP)
■ 200 publishers/ marketers access real-time context on
700Million Consumers
■ Demographics, purchase intent, behavioral propensities
from online /offline sources eg Nielsen, MasterCard Advisors,
Bizo

■ Found SQL DBs an order of magnitude too expensive,
considered several NoSQL DBs
■ 200 servers ingest 2 TB clickstream data per day
to Aerospike and an analytics DWH
■ Models calculated in DWH, loaded into Aerospike:
■ 20 TB data, 60 Billion transactions per month
■ 50/50 balanced reads/writes
■ 12 node clusters, 7 SSDs/128GB DRAM per node
■ Data synchronized across clusters in 4 data centers

■ Aerospike delivered on all of these requirements.”
– Elad Efraim, CTO of eXelate
© 2014 Aerospike. All rights reserved. Confidential

15
Powering [x+1] Origin Digital Marketing Hub
(DMP + DSP)
■ Marketing Hub

■ Multi-channel analytics and personalization of messages
across touchpoints
■ Integrated with leading CRM platforms
■ Deployed at many Fortune 500 companies

■ “It was a challenge to find an extremely highperformance, high availability database…
■ 4 TB of data, 2 Billion profiles
■ 5,000-10,000 attributes per profile analyzed in 4ms
■ 10 relevant recommendations suggested in 50ms
- each time a visitor clicks on a website

…providing fast reliable access to data in real-time
is simple to say, but it’s not easy to do….
Aerospike has proven that our choice to buy, not build was
the right decision.” – Patrick DeAngelis, CTO, [x+1]
© 2014 Aerospike. All rights reserved. Confidential

16
Internet-scale Context Computing Platforms
EMAIL
RETAIL
E-COMMERCE

SEARCH

WEB

OMNI
CHANNEL

GAMING

VIDEO
MOBILE

SOCIAL

© 2014 Aerospike. All rights reserved. Confidential

17
Context driven Apps - Use Cases
ADVERTISING
• REAL-TIME BIDDING
• DEMAND SIDE PLATFORM
(DSP)
• DATA MGMT PLATFORM
(DMP)
• SUPPLY SIDE PLATFORM
(SSP)
MARKETING
• MULTI-SCREEN OFFERS
• MULTI-CHANNEL
PERSONALIZATION
• REAL-TIME
RECOMMENDATIONS
• ONE TIME COUPONS
• LOYALTY REWARDS
• DEALS NEAR YOU
• RELATED ITEMS
SALES
• PRODUCT AVAILABILITY
• DYNAMIC PRICING
• RISK SCORES
• FRAUD PREVENTION
• STREAM ANALYSIS

© 2014 Aerospike. All rights reserved. Confidential

SUPPORT
• REAL-TIME DASHBOARDS
• PERSONAL FINANCE
PORTFOLIOS

18
Next Gen NoSQL

© 2014 Aerospike. All rights reserved. Confidential

19
Aerospike Database / Context Computing
Platform
■ Powering Context driven Apps that
Personalize the Internet
Instantly, Intuitively & Always-On!

1. ROCKET ENGINE
- In-Memory,
Flash-optimized
2. WEB SCALE
– Distributed,
Shared nothing
3. ACID RELIABILITY
– Immediate
Consistency, High Availability
4. NoSQL EXTENSIBILITY
– Distributed Queries,
Real-time Analytics

© 2014 Aerospike. All rights reserved. Confidential

20
ROCKET ENGINE

Ask me. I’ll look up the answer and then tell it
to you.

• Indexes in DRAM
• Data in DRAM / SSD
• Balanced Reads &
Writes
• Highly Parallelized
Ask and I’ll tell you now.
• Lock-free + ACID

OTHER DATABASE

AEROSPIKE

OS FILE SYSTEM

HYBRID MEMORY SYSTEM™

PAGE CACHE

BLOCK INTERFACE OPEN NVM

BLOCK INTERFACE
HDD

SSD

SSD

OTHER
DATABASE

SSD

SSD

FLASH OPTIMIZED
IN-MEMORY DATABASE

© 2014 Aerospike. All rights reserved. Confidential

21
Aerospike Certification Tool (ACT) for SSDs
■ Industry Standard Flash (SSD / PCI-E) Benchmark
■ Open Source Tool used by Flash Vendors to certify drives

© 2014 Aerospike. All rights reserved. Confidential

22
10X Faster for Balanced Read/Write loads
HIGH THROUGHPUT

LOW LATENCY

350,000

Balanced Workload Read Latency
Average Latency, ms

300,000

250,000
200,000
150,000

10
Aerospike

7.5

Cassandra

5

MongoDB

2.5
0
0

100,000

Throughput, ops/sec

50,000

Balanced
Cassandra

Read-Heavy
MongoDB

Couchbase 2.0*

*We were forced to exclude Couchbase...since when run with either disk
or replica durability on it was unable to complete the test.”
– Thumbtack Technology

Average Latency, ms

Balanced Workload Update Latency

0

Aerospike

50,000 100,000 150,000 200,000

16
Aerospike

12

Cassandra

8

MongoDB

4
0
0

4 Node Cluster, each with:

50,000 100,000 150,000 200,000
Throughput, ops/sec

CPU: 8 x Intel(R) Xeon(R) CPU E5-2665 0 @
2.40GHz, RAM: 31 GB
SSD: 4 x INTEL SSDSA2CW120G3, 120 GB (94 GB overprovisioned)
HDD: ST500NM0011, 500 GB, SATA III, 7200 RPM
OS: Ubuntu Server 12.04.1 64-bit (Linux kernel v.3.2.0)
© 2014 Aerospike. All rights reserved. Confidential

23
ONLY
186 SERVERS REQUIRED

14 SERVERS

Actual customer analysis
99% < 1ms
500K TPS
10 TB Storage
2x Replication
OTHER DATABASES
DRAM & HDD

SSD & DRAM

Storage /server
TPS /server
Cost /server
Server costs
Power /server
Power (2 years) $0.12 per kWh ave.
US
Maintenance (2 years) $3,600
/server

180 GB (196 GB)
500,000
$8,000
$1,488,000
0.9 kW

2.4 TB (4 x 700 GB)
500,000
$11,000
$154,000
1.1 kW

$352,000

$32,400

$670,000

$50,400

Total

$2,510,000

$236,800

© 2014 Aerospike. All rights reserved. Confidential

24
WEB SCALE
■ Distributed Hash Table with No Hotspots
■ Every key hashed with RIPEMD160
into a 20 byte (fixed length) string

cookie-abcdefg-12345678

■ Hash + additional (fixed 64 bytes) data
stored in DRAM in the index
■ Some bits from hash value are used to
calculate the Partition ID (4096 partitions)
■ Partition ID maps to Node ID in the cluster

182023kh15hh3kahdjsh

■ Shared Nothing architecture

…

1

4

2

3

3

2

4096

■ No Load Balancers required

Replica
Node ID

1821

■ Client just calculates Partition ID, determines Node ID

Master
Node ID

1820

■ 1 Hop to data

Partition ID

4

1

■ Every node is identical

© 2014 Aerospike. All rights reserved. Confidential

25
WEB SCALE with ACID RELIABILITY

OHI
O

1)

No Hotspots
– DHT with RIPEMD160
simplifies data
partitioning

2)

Shared Nothing
Architecture,
every node identical

Single row ACID
– synch replication in
cluster

5)

Smart Cluster, Zero
Touch
– auto-failover,
rebalancing, rolling
upgrades..

Smart Client – 1 hop to
data, no load balancers

3)

4)

6)

Transactions and long
running tasks prioritized
© 2014 Aerospike.
real-time All rights reserved. Confidential

7) XDR – asynch replication
across data centers ensures
Zero Downtime

26
XDR ensures Zero Downtime
■ Cross Data Center Replication (XDR) enables geographic
redundancy and location proximity
■ Maximum flexibility
■ Replication set at namespace level
■ Active-Passive /Active-Active modes
■ Changes in one data center can be
■ replicated to multiple data centers
■ forwarded to another data center

■ Clusters can have different number of nodes
■ Automatic failure handling ensures continuity
in spite of node failures

■ Super Storm Sandy 2012
■ Power outage, NYC Cluster down for 17 hours
■ Once power returned, XDR synched in 1 hour

“Aerospike allows us to handle business
continuity and reliability across 4 data centers
seamlessly.”
- Elad Efraim, CTO
© 2014 Aerospike. All rights reserved. Confidential

27
APP SERVER

Architected for Context Computing

APPLICATION

APP/WEB
SERVER

AEROSPIKE SMART CLIENT™
• APIs (C, C#, Java, PHP, Python, Ruby, Erlang…)
• Transactions, Cluster awareness

EXTENSIBLE DATA MODEL

AEROSPIKE SERVER

Patents pendin
Written in „C‟

• Str, Int, Lists, Maps
• Lookups, Queries, Scans

• User Defined Functions
• Distributed Aggregations

4) NOSQL FLEXIBILITY

MONITORING &
MANAGEMENT
• Aerospike Monitoring
Console™
• Command Line Tools

AEROSPIKE
SMART CLUSTER™

2) WEB SCALE
AEROSPIKE
HYBRID
MEMORY SYSTEM™

AEROSPIKE (XDR)
CROSS DATA CENTER
REPLICATION™

3) ACID RELIABILITY

AEROSPIKE
CLUSTER

• PluginsNaglos, Graphite, Zabbi
x

AEROSPIKE
REAL-TIME
ENGINE™

1) ROCKET FAST
© 2014 Aerospike. All rights reserved. Confidential

28
NOSQL EXTENSIBILITY
■ Namespaces (policy containers)
■ Determine storage - DRAM or Flash
■ Determine replication factor
■ Contain records and sets

■ Sets (tables) of records
■ Arbitrary grouping

■ Records (rows)
■ Max 128k, contain key and bins
■ Bin with same name can contain values of
different types
■ String, integer, bytes (raw, blob, etc)
■ list ( an ordered collection of values )
■ map ( a collection of keys and values )

■ Bins can be added anytime

© 2014 Aerospike. All rights reserved. Confidential

29
Real-time Analytics on Operational Data
DISTRIBUTED QUERIES
1. “Scatter” requests to all nodes
2. Indexes in DRAM for fast map of secondary  primary keys
3. Indexes co-located with data to guarantee ACID,
manage migrations
4. Records read in parallel from all SSDs
using lock free concurrency control
5. Aggregate results on each node

6. “Gather” results from all nodes on client
STREAM AGGREGATIONS
1. Push Code/ Security Policies/ Rules to Data with UDFs
2. Pipe Query results through UDFs to
Filter, Transform, Aggregate.. Map, Reduce
REAL-TIME ANALYTICS on OPERATIONAL DATA (No ETL)
■ In Database, within the same Cluster
■ On the same Data, on XDR Replicated Clusters

© 2014 Aerospike. All rights reserved. Confidential

30
Aerospike Database / Context Computing
Platform
■ Powering Context driven Apps
that Personalize the Internet
Instantly, Intuitively & Always-On!

1. ROCKET ENGINE
- In-Memory,
Flash-optimized
2. WEB SCALE
– Distributed,
Shared nothing
3. ACID RELIABILITY
– Immediate
Consistency, High Availability
4. NoSQL EXTENSIBILITY
– Distributed Queries,
Real-time Analytics

© 2014 Aerospike. All rights reserved. Confidential

31
Recognized as the only Visionary in Gartner's Magic
Quadrant for Operational Database Management
Systems

Gartner, Magic Quadrant for Operational Database
Management Systems Donald Fienberg et al. October 23, 2013
This graphic was published by Gartner, Inc. as part of a larger
research document and should be evaluated in the context of the
entire document. The Gartner document is available at
www.aerospike.com .
Gartner does not endorse any vendor, product or service depicted in
its research publications, and does not advise technology users to
select only those vendors with the highest ratings. Gartner research
publications consist of the opinions of Gartner's research organization
and should not be construed as statements of fact. Gartner disclaims
all warranties, expressed or implied, with respect to this
research, including any warranties of merchantability or fitness for a
particular purpose.

© 2014 Aerospike. All rights reserved. Confidential

32

More Related Content

What's hot

Spark (Structured) Streaming vs. Kafka Streams
Spark (Structured) Streaming vs. Kafka StreamsSpark (Structured) Streaming vs. Kafka Streams
Spark (Structured) Streaming vs. Kafka StreamsGuido Schmutz
 
Upgrade from MySQL 5.7 to MySQL 8.0
Upgrade from MySQL 5.7 to MySQL 8.0Upgrade from MySQL 5.7 to MySQL 8.0
Upgrade from MySQL 5.7 to MySQL 8.0Olivier DASINI
 
Elastic Stack Introduction
Elastic Stack IntroductionElastic Stack Introduction
Elastic Stack IntroductionVikram Shinde
 
Aerospike Architecture
Aerospike ArchitectureAerospike Architecture
Aerospike ArchitecturePeter Milne
 
MySQL Performance Tuning: Top 10 Tips
MySQL Performance Tuning: Top 10 TipsMySQL Performance Tuning: Top 10 Tips
MySQL Performance Tuning: Top 10 TipsOSSCube
 
Getting The Most Out Of Your Flash/SSDs
Getting The Most Out Of Your Flash/SSDsGetting The Most Out Of Your Flash/SSDs
Getting The Most Out Of Your Flash/SSDsAerospike, Inc.
 
Open Source 101 2022 - MySQL Indexes and Histograms
Open Source 101 2022 - MySQL Indexes and HistogramsOpen Source 101 2022 - MySQL Indexes and Histograms
Open Source 101 2022 - MySQL Indexes and HistogramsFrederic Descamps
 
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...Flink Forward
 
MySQL Performance Best Practices
MySQL Performance Best PracticesMySQL Performance Best Practices
MySQL Performance Best PracticesOlivier DASINI
 
Materialized Views and Secondary Indexes in Scylla: They Are finally here!
Materialized Views and Secondary Indexes in Scylla: They Are finally here!Materialized Views and Secondary Indexes in Scylla: They Are finally here!
Materialized Views and Secondary Indexes in Scylla: They Are finally here!ScyllaDB
 
Ray: Enterprise-Grade, Distributed Python
Ray: Enterprise-Grade, Distributed PythonRay: Enterprise-Grade, Distributed Python
Ray: Enterprise-Grade, Distributed PythonDatabricks
 
Module 2 - Datalake
Module 2 - DatalakeModule 2 - Datalake
Module 2 - DatalakeLam Le
 
Snowflake essentials
Snowflake essentialsSnowflake essentials
Snowflake essentialsqureshihamid
 
Evening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in FlinkEvening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in FlinkFlink Forward
 
Hybrid MongoDB and RDBMS Applications
Hybrid MongoDB and RDBMS ApplicationsHybrid MongoDB and RDBMS Applications
Hybrid MongoDB and RDBMS ApplicationsSteven Francia
 
A Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks
 
ElasticSearch Basic Introduction
ElasticSearch Basic IntroductionElasticSearch Basic Introduction
ElasticSearch Basic IntroductionMayur Rathod
 

What's hot (20)

Spark (Structured) Streaming vs. Kafka Streams
Spark (Structured) Streaming vs. Kafka StreamsSpark (Structured) Streaming vs. Kafka Streams
Spark (Structured) Streaming vs. Kafka Streams
 
Upgrade from MySQL 5.7 to MySQL 8.0
Upgrade from MySQL 5.7 to MySQL 8.0Upgrade from MySQL 5.7 to MySQL 8.0
Upgrade from MySQL 5.7 to MySQL 8.0
 
Elastic Stack Introduction
Elastic Stack IntroductionElastic Stack Introduction
Elastic Stack Introduction
 
Aerospike Architecture
Aerospike ArchitectureAerospike Architecture
Aerospike Architecture
 
ELK Stack
ELK StackELK Stack
ELK Stack
 
MySQL Performance Tuning: Top 10 Tips
MySQL Performance Tuning: Top 10 TipsMySQL Performance Tuning: Top 10 Tips
MySQL Performance Tuning: Top 10 Tips
 
Getting The Most Out Of Your Flash/SSDs
Getting The Most Out Of Your Flash/SSDsGetting The Most Out Of Your Flash/SSDs
Getting The Most Out Of Your Flash/SSDs
 
Open Source 101 2022 - MySQL Indexes and Histograms
Open Source 101 2022 - MySQL Indexes and HistogramsOpen Source 101 2022 - MySQL Indexes and Histograms
Open Source 101 2022 - MySQL Indexes and Histograms
 
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
 
MySQL Performance Best Practices
MySQL Performance Best PracticesMySQL Performance Best Practices
MySQL Performance Best Practices
 
Materialized Views and Secondary Indexes in Scylla: They Are finally here!
Materialized Views and Secondary Indexes in Scylla: They Are finally here!Materialized Views and Secondary Indexes in Scylla: They Are finally here!
Materialized Views and Secondary Indexes in Scylla: They Are finally here!
 
Ray: Enterprise-Grade, Distributed Python
Ray: Enterprise-Grade, Distributed PythonRay: Enterprise-Grade, Distributed Python
Ray: Enterprise-Grade, Distributed Python
 
Module 2 - Datalake
Module 2 - DatalakeModule 2 - Datalake
Module 2 - Datalake
 
File Format Benchmark - Avro, JSON, ORC & Parquet
File Format Benchmark - Avro, JSON, ORC & ParquetFile Format Benchmark - Avro, JSON, ORC & Parquet
File Format Benchmark - Avro, JSON, ORC & Parquet
 
Snowflake essentials
Snowflake essentialsSnowflake essentials
Snowflake essentials
 
Evening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in FlinkEvening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in Flink
 
Hybrid MongoDB and RDBMS Applications
Hybrid MongoDB and RDBMS ApplicationsHybrid MongoDB and RDBMS Applications
Hybrid MongoDB and RDBMS Applications
 
Apache flink
Apache flinkApache flink
Apache flink
 
A Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and Hudi
 
ElasticSearch Basic Introduction
ElasticSearch Basic IntroductionElasticSearch Basic Introduction
ElasticSearch Basic Introduction
 

Viewers also liked

2017 DB Trends for Powering Real-Time Systems of Engagement
2017 DB Trends for Powering Real-Time Systems of Engagement2017 DB Trends for Powering Real-Time Systems of Engagement
2017 DB Trends for Powering Real-Time Systems of EngagementAerospike, Inc.
 
WEBINAR: Architectures for Digital Transformation and Next-Generation Systems...
WEBINAR: Architectures for Digital Transformation and Next-Generation Systems...WEBINAR: Architectures for Digital Transformation and Next-Generation Systems...
WEBINAR: Architectures for Digital Transformation and Next-Generation Systems...Aerospike, Inc.
 
Aerospike: Maximizing Performance
Aerospike: Maximizing PerformanceAerospike: Maximizing Performance
Aerospike: Maximizing PerformanceAerospike, Inc.
 
What the Spark!? Intro and Use Cases
What the Spark!? Intro and Use CasesWhat the Spark!? Intro and Use Cases
What the Spark!? Intro and Use CasesAerospike, Inc.
 
Aerospike AdTech Gets Hacked in Lower Manhattan
Aerospike AdTech Gets Hacked in Lower ManhattanAerospike AdTech Gets Hacked in Lower Manhattan
Aerospike AdTech Gets Hacked in Lower ManhattanAerospike
 
La leyenda del huemul precentación
La leyenda del huemul precentaciónLa leyenda del huemul precentación
La leyenda del huemul precentaciónnatibariloche
 
Driving the On-Demand Economy with Spark and Predictive Analytics
Driving the On-Demand Economy with Spark and Predictive AnalyticsDriving the On-Demand Economy with Spark and Predictive Analytics
Driving the On-Demand Economy with Spark and Predictive AnalyticsSingleStore
 
Introduction to NoSQL with MongoDB
Introduction to NoSQL with MongoDBIntroduction to NoSQL with MongoDB
Introduction to NoSQL with MongoDBHector Correa
 
Faster persistent data structures through hashing
Faster persistent data structures through hashingFaster persistent data structures through hashing
Faster persistent data structures through hashingJohan Tibell
 
Creating Real-time Systems of Engagement with Analytics and Big Data
Creating Real-time Systems of Engagement with Analytics and Big DataCreating Real-time Systems of Engagement with Analytics and Big Data
Creating Real-time Systems of Engagement with Analytics and Big DataMongoDB
 
Oracle Database Mobile Server Performance Tuning
Oracle Database Mobile Server Performance TuningOracle Database Mobile Server Performance Tuning
Oracle Database Mobile Server Performance Tuningphilipploer
 
Big Data Learnings from a Vendor's Perspective
Big Data Learnings from a Vendor's PerspectiveBig Data Learnings from a Vendor's Perspective
Big Data Learnings from a Vendor's PerspectiveAerospike, Inc.
 
Ansn ind 14_ir_suyamto
Ansn ind 14_ir_suyamtoAnsn ind 14_ir_suyamto
Ansn ind 14_ir_suyamtolukmanft21
 
Integrative Nutrition Pictures
Integrative Nutrition PicturesIntegrative Nutrition Pictures
Integrative Nutrition PicturesSara S
 
Slide Golle Ira
Slide Golle IraSlide Golle Ira
Slide Golle Iras5irgoll
 

Viewers also liked (16)

2017 DB Trends for Powering Real-Time Systems of Engagement
2017 DB Trends for Powering Real-Time Systems of Engagement2017 DB Trends for Powering Real-Time Systems of Engagement
2017 DB Trends for Powering Real-Time Systems of Engagement
 
WEBINAR: Architectures for Digital Transformation and Next-Generation Systems...
WEBINAR: Architectures for Digital Transformation and Next-Generation Systems...WEBINAR: Architectures for Digital Transformation and Next-Generation Systems...
WEBINAR: Architectures for Digital Transformation and Next-Generation Systems...
 
Aerospike: Maximizing Performance
Aerospike: Maximizing PerformanceAerospike: Maximizing Performance
Aerospike: Maximizing Performance
 
What the Spark!? Intro and Use Cases
What the Spark!? Intro and Use CasesWhat the Spark!? Intro and Use Cases
What the Spark!? Intro and Use Cases
 
Aerospike AdTech Gets Hacked in Lower Manhattan
Aerospike AdTech Gets Hacked in Lower ManhattanAerospike AdTech Gets Hacked in Lower Manhattan
Aerospike AdTech Gets Hacked in Lower Manhattan
 
La leyenda del huemul precentación
La leyenda del huemul precentaciónLa leyenda del huemul precentación
La leyenda del huemul precentación
 
Driving the On-Demand Economy with Spark and Predictive Analytics
Driving the On-Demand Economy with Spark and Predictive AnalyticsDriving the On-Demand Economy with Spark and Predictive Analytics
Driving the On-Demand Economy with Spark and Predictive Analytics
 
Introduction to NoSQL with MongoDB
Introduction to NoSQL with MongoDBIntroduction to NoSQL with MongoDB
Introduction to NoSQL with MongoDB
 
Faster persistent data structures through hashing
Faster persistent data structures through hashingFaster persistent data structures through hashing
Faster persistent data structures through hashing
 
Creating Real-time Systems of Engagement with Analytics and Big Data
Creating Real-time Systems of Engagement with Analytics and Big DataCreating Real-time Systems of Engagement with Analytics and Big Data
Creating Real-time Systems of Engagement with Analytics and Big Data
 
Oracle Database Mobile Server Performance Tuning
Oracle Database Mobile Server Performance TuningOracle Database Mobile Server Performance Tuning
Oracle Database Mobile Server Performance Tuning
 
Big Data Learnings from a Vendor's Perspective
Big Data Learnings from a Vendor's PerspectiveBig Data Learnings from a Vendor's Perspective
Big Data Learnings from a Vendor's Perspective
 
Ansn ind 14_ir_suyamto
Ansn ind 14_ir_suyamtoAnsn ind 14_ir_suyamto
Ansn ind 14_ir_suyamto
 
Integrative Nutrition Pictures
Integrative Nutrition PicturesIntegrative Nutrition Pictures
Integrative Nutrition Pictures
 
Slide Golle Ira
Slide Golle IraSlide Golle Ira
Slide Golle Ira
 
Integración de Portafolio
Integración de PortafolioIntegración de Portafolio
Integración de Portafolio
 

Similar to Introduction to Aerospike

Brian Bulkowski : what startups can learn from real-time bidding
Brian Bulkowski : what startups can learn from real-time biddingBrian Bulkowski : what startups can learn from real-time bidding
Brian Bulkowski : what startups can learn from real-time biddingAerospike
 
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...In-Memory Computing Summit
 
What enterprises can learn from Real Time Bidding (RTB)
What enterprises can learn from Real Time Bidding (RTB)What enterprises can learn from Real Time Bidding (RTB)
What enterprises can learn from Real Time Bidding (RTB)bigdatagurus_meetup
 
What enterprises can learn from Real Time Bidding
What enterprises can learn from Real Time BiddingWhat enterprises can learn from Real Time Bidding
What enterprises can learn from Real Time BiddingAerospike
 
The Future of Distributed Databases
The Future of Distributed DatabasesThe Future of Distributed Databases
The Future of Distributed DatabasesNuoDB
 
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...The Hive
 
Rapid Application Design in Financial Services
Rapid Application Design in Financial ServicesRapid Application Design in Financial Services
Rapid Application Design in Financial ServicesAerospike
 
Brian Bulkowski. Aerospike
Brian Bulkowski. AerospikeBrian Bulkowski. Aerospike
Brian Bulkowski. AerospikeVolha Banadyseva
 
Real-Time Analytics in Transactional Applications by Brian Bulkowski
Real-Time Analytics in Transactional Applications by Brian BulkowskiReal-Time Analytics in Transactional Applications by Brian Bulkowski
Real-Time Analytics in Transactional Applications by Brian BulkowskiData Con LA
 
Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...
Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...
Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...Teradata Aster
 
Aerospike: Enabling Your Digital Transformation
Aerospike: Enabling Your Digital TransformationAerospike: Enabling Your Digital Transformation
Aerospike: Enabling Your Digital TransformationBrillix
 
Introduction to Azure DocumentDB
Introduction to Azure DocumentDBIntroduction to Azure DocumentDB
Introduction to Azure DocumentDBDenny Lee
 
Presentazione SimpliVity @ VMUGIT UserCon 2015
Presentazione SimpliVity @ VMUGIT UserCon 2015Presentazione SimpliVity @ VMUGIT UserCon 2015
Presentazione SimpliVity @ VMUGIT UserCon 2015VMUG IT
 
Partner Webinar: Mesosphere and DSE: Production-Proven Infrastructure for Fas...
Partner Webinar: Mesosphere and DSE: Production-Proven Infrastructure for Fas...Partner Webinar: Mesosphere and DSE: Production-Proven Infrastructure for Fas...
Partner Webinar: Mesosphere and DSE: Production-Proven Infrastructure for Fas...DataStax
 
Converged Everything, Converged Infrastructure Delivering Business Value and ...
Converged Everything, Converged Infrastructure Delivering Business Value and ...Converged Everything, Converged Infrastructure Delivering Business Value and ...
Converged Everything, Converged Infrastructure Delivering Business Value and ...NetApp
 
Handling Increasing Load and Reducing Costs During COVID-19 Crisis - Oshrat &...
Handling Increasing Load and Reducing Costs During COVID-19 Crisis - Oshrat &...Handling Increasing Load and Reducing Costs During COVID-19 Crisis - Oshrat &...
Handling Increasing Load and Reducing Costs During COVID-19 Crisis - Oshrat &...Aerospike
 
Flash Economics and Lessons learned from operating low latency platforms at h...
Flash Economics and Lessons learned from operating low latency platforms at h...Flash Economics and Lessons learned from operating low latency platforms at h...
Flash Economics and Lessons learned from operating low latency platforms at h...Aerospike, Inc.
 
Aerospike Meetup - Introduction - Ami - 04 March 2020
Aerospike Meetup - Introduction - Ami - 04 March 2020Aerospike Meetup - Introduction - Ami - 04 March 2020
Aerospike Meetup - Introduction - Ami - 04 March 2020Aerospike
 

Similar to Introduction to Aerospike (20)

Brian Bulkowski : what startups can learn from real-time bidding
Brian Bulkowski : what startups can learn from real-time biddingBrian Bulkowski : what startups can learn from real-time bidding
Brian Bulkowski : what startups can learn from real-time bidding
 
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
 
What enterprises can learn from Real Time Bidding (RTB)
What enterprises can learn from Real Time Bidding (RTB)What enterprises can learn from Real Time Bidding (RTB)
What enterprises can learn from Real Time Bidding (RTB)
 
What enterprises can learn from Real Time Bidding
What enterprises can learn from Real Time BiddingWhat enterprises can learn from Real Time Bidding
What enterprises can learn from Real Time Bidding
 
The Future of Distributed Databases
The Future of Distributed DatabasesThe Future of Distributed Databases
The Future of Distributed Databases
 
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
 
Rapid Application Design in Financial Services
Rapid Application Design in Financial ServicesRapid Application Design in Financial Services
Rapid Application Design in Financial Services
 
Brian Bulkowski. Aerospike
Brian Bulkowski. AerospikeBrian Bulkowski. Aerospike
Brian Bulkowski. Aerospike
 
141106 actifio overview
141106 actifio overview 141106 actifio overview
141106 actifio overview
 
Real-Time Analytics in Transactional Applications by Brian Bulkowski
Real-Time Analytics in Transactional Applications by Brian BulkowskiReal-Time Analytics in Transactional Applications by Brian Bulkowski
Real-Time Analytics in Transactional Applications by Brian Bulkowski
 
comScore
comScorecomScore
comScore
 
Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...
Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...
Utilizing Aster nCluster to support processing in excess of 100 Billion rows ...
 
Aerospike: Enabling Your Digital Transformation
Aerospike: Enabling Your Digital TransformationAerospike: Enabling Your Digital Transformation
Aerospike: Enabling Your Digital Transformation
 
Introduction to Azure DocumentDB
Introduction to Azure DocumentDBIntroduction to Azure DocumentDB
Introduction to Azure DocumentDB
 
Presentazione SimpliVity @ VMUGIT UserCon 2015
Presentazione SimpliVity @ VMUGIT UserCon 2015Presentazione SimpliVity @ VMUGIT UserCon 2015
Presentazione SimpliVity @ VMUGIT UserCon 2015
 
Partner Webinar: Mesosphere and DSE: Production-Proven Infrastructure for Fas...
Partner Webinar: Mesosphere and DSE: Production-Proven Infrastructure for Fas...Partner Webinar: Mesosphere and DSE: Production-Proven Infrastructure for Fas...
Partner Webinar: Mesosphere and DSE: Production-Proven Infrastructure for Fas...
 
Converged Everything, Converged Infrastructure Delivering Business Value and ...
Converged Everything, Converged Infrastructure Delivering Business Value and ...Converged Everything, Converged Infrastructure Delivering Business Value and ...
Converged Everything, Converged Infrastructure Delivering Business Value and ...
 
Handling Increasing Load and Reducing Costs During COVID-19 Crisis - Oshrat &...
Handling Increasing Load and Reducing Costs During COVID-19 Crisis - Oshrat &...Handling Increasing Load and Reducing Costs During COVID-19 Crisis - Oshrat &...
Handling Increasing Load and Reducing Costs During COVID-19 Crisis - Oshrat &...
 
Flash Economics and Lessons learned from operating low latency platforms at h...
Flash Economics and Lessons learned from operating low latency platforms at h...Flash Economics and Lessons learned from operating low latency platforms at h...
Flash Economics and Lessons learned from operating low latency platforms at h...
 
Aerospike Meetup - Introduction - Ami - 04 March 2020
Aerospike Meetup - Introduction - Ami - 04 March 2020Aerospike Meetup - Introduction - Ami - 04 March 2020
Aerospike Meetup - Introduction - Ami - 04 March 2020
 

More from Aerospike, Inc.

Leveraging Big Data with Hadoop, NoSQL and RDBMS
Leveraging Big Data with Hadoop, NoSQL and RDBMSLeveraging Big Data with Hadoop, NoSQL and RDBMS
Leveraging Big Data with Hadoop, NoSQL and RDBMSAerospike, Inc.
 
Using Databases and Containers From Development to Deployment
Using Databases and Containers  From Development to DeploymentUsing Databases and Containers  From Development to Deployment
Using Databases and Containers From Development to DeploymentAerospike, Inc.
 
01282016 Aerospike-Docker webinar
01282016 Aerospike-Docker webinar01282016 Aerospike-Docker webinar
01282016 Aerospike-Docker webinarAerospike, Inc.
 
There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?Aerospike, Inc.
 
The role of NoSQL in the Next Generation of Financial Informatics
The role of NoSQL in the Next Generation of Financial InformaticsThe role of NoSQL in the Next Generation of Financial Informatics
The role of NoSQL in the Next Generation of Financial InformaticsAerospike, Inc.
 
Tectonic Shift: A New Foundation for Data Driven Business
Tectonic Shift: A New Foundation for Data Driven BusinessTectonic Shift: A New Foundation for Data Driven Business
Tectonic Shift: A New Foundation for Data Driven BusinessAerospike, Inc.
 
How to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
How to Get a Game Changing Performance Advantage with Intel SSDs and AerospikeHow to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
How to Get a Game Changing Performance Advantage with Intel SSDs and AerospikeAerospike, Inc.
 
Get Started with Data Science by Analyzing Traffic Data from California Highways
Get Started with Data Science by Analyzing Traffic Data from California HighwaysGet Started with Data Science by Analyzing Traffic Data from California Highways
Get Started with Data Science by Analyzing Traffic Data from California HighwaysAerospike, Inc.
 
Running a High Performance NoSQL Database on Amazon EC2 for Just $1.68/Hour
Running a High Performance NoSQL Database on Amazon EC2 for Just $1.68/HourRunning a High Performance NoSQL Database on Amazon EC2 for Just $1.68/Hour
Running a High Performance NoSQL Database on Amazon EC2 for Just $1.68/HourAerospike, Inc.
 
ACID & CAP: Clearing CAP Confusion and Why C In CAP ≠ C in ACID
ACID & CAP:  Clearing CAP Confusion and Why C In CAP ≠ C in ACIDACID & CAP:  Clearing CAP Confusion and Why C In CAP ≠ C in ACID
ACID & CAP: Clearing CAP Confusion and Why C In CAP ≠ C in ACIDAerospike, Inc.
 
Storm Persistence and Real-Time Analytics
Storm Persistence and Real-Time AnalyticsStorm Persistence and Real-Time Analytics
Storm Persistence and Real-Time AnalyticsAerospike, Inc.
 
You Snooze You Lose or How to Win in Ad Tech?
You Snooze You Lose or How to Win in Ad Tech?You Snooze You Lose or How to Win in Ad Tech?
You Snooze You Lose or How to Win in Ad Tech?Aerospike, Inc.
 
Distributing Data The Aerospike Way
Distributing Data The Aerospike WayDistributing Data The Aerospike Way
Distributing Data The Aerospike WayAerospike, Inc.
 
Predictable Big Data Performance in Real-time
Predictable Big Data Performance in Real-timePredictable Big Data Performance in Real-time
Predictable Big Data Performance in Real-timeAerospike, Inc.
 

More from Aerospike, Inc. (14)

Leveraging Big Data with Hadoop, NoSQL and RDBMS
Leveraging Big Data with Hadoop, NoSQL and RDBMSLeveraging Big Data with Hadoop, NoSQL and RDBMS
Leveraging Big Data with Hadoop, NoSQL and RDBMS
 
Using Databases and Containers From Development to Deployment
Using Databases and Containers  From Development to DeploymentUsing Databases and Containers  From Development to Deployment
Using Databases and Containers From Development to Deployment
 
01282016 Aerospike-Docker webinar
01282016 Aerospike-Docker webinar01282016 Aerospike-Docker webinar
01282016 Aerospike-Docker webinar
 
There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?
 
The role of NoSQL in the Next Generation of Financial Informatics
The role of NoSQL in the Next Generation of Financial InformaticsThe role of NoSQL in the Next Generation of Financial Informatics
The role of NoSQL in the Next Generation of Financial Informatics
 
Tectonic Shift: A New Foundation for Data Driven Business
Tectonic Shift: A New Foundation for Data Driven BusinessTectonic Shift: A New Foundation for Data Driven Business
Tectonic Shift: A New Foundation for Data Driven Business
 
How to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
How to Get a Game Changing Performance Advantage with Intel SSDs and AerospikeHow to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
How to Get a Game Changing Performance Advantage with Intel SSDs and Aerospike
 
Get Started with Data Science by Analyzing Traffic Data from California Highways
Get Started with Data Science by Analyzing Traffic Data from California HighwaysGet Started with Data Science by Analyzing Traffic Data from California Highways
Get Started with Data Science by Analyzing Traffic Data from California Highways
 
Running a High Performance NoSQL Database on Amazon EC2 for Just $1.68/Hour
Running a High Performance NoSQL Database on Amazon EC2 for Just $1.68/HourRunning a High Performance NoSQL Database on Amazon EC2 for Just $1.68/Hour
Running a High Performance NoSQL Database on Amazon EC2 for Just $1.68/Hour
 
ACID & CAP: Clearing CAP Confusion and Why C In CAP ≠ C in ACID
ACID & CAP:  Clearing CAP Confusion and Why C In CAP ≠ C in ACIDACID & CAP:  Clearing CAP Confusion and Why C In CAP ≠ C in ACID
ACID & CAP: Clearing CAP Confusion and Why C In CAP ≠ C in ACID
 
Storm Persistence and Real-Time Analytics
Storm Persistence and Real-Time AnalyticsStorm Persistence and Real-Time Analytics
Storm Persistence and Real-Time Analytics
 
You Snooze You Lose or How to Win in Ad Tech?
You Snooze You Lose or How to Win in Ad Tech?You Snooze You Lose or How to Win in Ad Tech?
You Snooze You Lose or How to Win in Ad Tech?
 
Distributing Data The Aerospike Way
Distributing Data The Aerospike WayDistributing Data The Aerospike Way
Distributing Data The Aerospike Way
 
Predictable Big Data Performance in Real-time
Predictable Big Data Performance in Real-timePredictable Big Data Performance in Real-time
Predictable Big Data Performance in Real-time
 

Recently uploaded

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 

Recently uploaded (20)

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 

Introduction to Aerospike

  • 1. ROCKET ENGINE FOR CONTEXT DRIVEN APPS THAT PERSONALIZE THE INTERNET BY YOUNG PAIK DIRECTOR SALES ENGINEERING, AEROSPIKE Aerospike aer . o . spike [air-oh- spahyk] noun, 1. tip of a rocket that enhances speed and stability © 2014 Aerospike. All rights reserved. Confidential 1
  • 2. Aerospike NoSQL Database © 2014 Aerospike. All rights reserved. Confidential 2
  • 3. AGENDA 1. How the game has changed, driving need for next-gen NoSQL 2. Who uses Aerospike and why 3. Architecture Overview © 2014 Aerospike. All rights reserved. Confidential 3
  • 4. Internet Enterprises have changed the game… Simple, Personalized, Instant Complex, Standardized, Silo-ed © 2014 Aerospike. All rights reserved. Confidential 4
  • 5. Consumers Expect and “Want it All” 1. Instant Response ■“Every 100ms latency costs Amazon 1% in sales” – Greg Linden, Amazon ■“An extra ½ sec in search page generation dropped traffic 20%” – Google (average 1.5 sec) ■“A 1 sec delay can cause 7% decline in conversion” – Walmart © 2014 Aerospike. All rights reserved. Confidential 5
  • 6. Consumers Expect and “Want it All” 1. Instant Response 2. Intuitive Service ■ Personalized & Consistent across channels ■ ■ Mobile devices, tablet, car… Web, mobile, social media… © 2014 Aerospike. All rights reserved. Confidential 6
  • 7. Consumers Expect and “Want it All” 1. Instant Response 2. Intuitive Service ■ Personalized & Consistent across channels ■ ■ ■ Mobile devices, tablet, car… Web, mobile, social media… Seamless across the business ■ Marketing, sales, support… © 2014 Aerospike. All rights reserved. Confidential 7
  • 8. Consumers Expect and “Want it All” 1. Instant Response 2. Intuitive Service 1. Always-On ■ How much does down-time cost? © 2014 Aerospike. All rights reserved. Confidential 8
  • 9. Enterprises must “Deliver it All” ■ Use every swipe, search, share to delight - Instantly, Intuitively, Always-On ■ (DIY or SaaS) CONTEXT ■ IDENTITY ■ SessionIDs, Cookies, DeviceIDs, ip-Addr ■ ATTRIBUTES ■ Demographic, geographic ■ BEHAVIOR (REAL-TIME) ■ Presence, swipe, search, share.. ■ Channels – web, phone, in-store.. ■ Services – frequency, sophistication ■ SEGMENTS (PRE-CALCULATED) ■ Attitudes, values, lifestyle, history.. ■ TRANSACTIONS ■ Payments, campaigns © 2014 Aerospike. All rights reserved. Confidential 9
  • 10. How Big is Real-time Big Context? # People Per Profile 10 M Customers * 25 kb 250 GB 500 M Prospects * 1 kb + 500 GB Real-time Context = 750 CONTEXT GB ■ IDENTITY ■ SessionIDs, Cookies, DeviceIDs, ip-Addr ■ ATTRIBUTES ■ How many Objects? ■ Demographic, geographic ■# People * # Devices * # Browsers ■People move around, cookies get cleared.. ■* 2x Replication ■ “100M people ≈ 2 Billion cookies” - eBay ■ BEHAVIOR (REAL-TIME) ■ Presence, swipe, search, share.. ■ Channels – web, phone, in-store.. ■ Services – frequency, sophistication ■ SEGMENTS (PRE-CALCULATED) ■ Attitudes, values, lifestyle, history.. ■ TRANSACTIONS ■ Payments, campaigns © 2014 Aerospike. All rights reserved. Confidential 10
  • 11. Aerospike Database – Powering Context-Driven Apps ■ Apps that Personalize the Internet Instantly, Intuitively & Always-On 1. ROCKET ENGINE - In-Memory, Flash-optimized 2. WEB SCALE – Distributed, Shared nothing 3. ACID RELIABILITY – Immediate Consistency, High Availability 4. NoSQL FLEXIBILITY – Distributed Queries, Real-time Analytics © 2014 Aerospike. All rights reserved. Confidential 11
  • 12. New Architecture for Context Computing APP SERVERS RDBMS DATA WAREHOUSE AEROSPIKE CLUSTER MILLIONS OF CONSUMERS BILLIONS OF DEVICES SEGMENTS R/W REAL-TIME CONTEXT CONTEXT IDENTITY Cookies, device ID.. ATTRIBUTES Age, Gender.. + REAL-TIME ANALYTICS Calculate models, Discover SEGMENTS eg early adopter, bargain hunter, mass affluent… BATCH ANALYTICS QUERIES & AGGREGATIONS Risk scores, best sellers, trending now… BEHAVIOR Click, search… SEGMENTS © 2014 Aerospike. All rights reserved. Confidential 12
  • 13. Pioneered by Ad-Tech AppNexus [x+1] Federated Media © 2014 Aerospike. All rights reserved. Confidential eXelate 13
  • 14. Powering the profile store for AppNexus (RTB) ■ “For the last three years, Aerospike’s database has been managing our vast volumes of user data. With Aerospike, we process many terabytes of data daily across our global data centers at a rate in excess of a million requests per second. – Mike Nolet, CTO ■ “AppNexus* operates at massive scale while paying close attention to the economics of the platform. Aerospike’s flash optimizations running on top of Intel® SSDs have given us the price, performance, reliability, and serviceability we need to grow our business.” – Timothy G Smith, SVP Technical Operations © 2014 Aerospike. All rights reserved. Confidential • 50 Billion Ad + 300 Billion Bid Requests/day for Microsoft Ad Exchange, Interactive Media (Deutsche Telekom), Collective… • 6 Billion Mobile Ads/day for Millennial Media Exchange • 100ms SLA from click to view 14
  • 15. Powering the eXelate Data Exchange (DMP) ■ 200 publishers/ marketers access real-time context on 700Million Consumers ■ Demographics, purchase intent, behavioral propensities from online /offline sources eg Nielsen, MasterCard Advisors, Bizo ■ Found SQL DBs an order of magnitude too expensive, considered several NoSQL DBs ■ 200 servers ingest 2 TB clickstream data per day to Aerospike and an analytics DWH ■ Models calculated in DWH, loaded into Aerospike: ■ 20 TB data, 60 Billion transactions per month ■ 50/50 balanced reads/writes ■ 12 node clusters, 7 SSDs/128GB DRAM per node ■ Data synchronized across clusters in 4 data centers ■ Aerospike delivered on all of these requirements.” – Elad Efraim, CTO of eXelate © 2014 Aerospike. All rights reserved. Confidential 15
  • 16. Powering [x+1] Origin Digital Marketing Hub (DMP + DSP) ■ Marketing Hub ■ Multi-channel analytics and personalization of messages across touchpoints ■ Integrated with leading CRM platforms ■ Deployed at many Fortune 500 companies ■ “It was a challenge to find an extremely highperformance, high availability database… ■ 4 TB of data, 2 Billion profiles ■ 5,000-10,000 attributes per profile analyzed in 4ms ■ 10 relevant recommendations suggested in 50ms - each time a visitor clicks on a website …providing fast reliable access to data in real-time is simple to say, but it’s not easy to do…. Aerospike has proven that our choice to buy, not build was the right decision.” – Patrick DeAngelis, CTO, [x+1] © 2014 Aerospike. All rights reserved. Confidential 16
  • 17. Internet-scale Context Computing Platforms EMAIL RETAIL E-COMMERCE SEARCH WEB OMNI CHANNEL GAMING VIDEO MOBILE SOCIAL © 2014 Aerospike. All rights reserved. Confidential 17
  • 18. Context driven Apps - Use Cases ADVERTISING • REAL-TIME BIDDING • DEMAND SIDE PLATFORM (DSP) • DATA MGMT PLATFORM (DMP) • SUPPLY SIDE PLATFORM (SSP) MARKETING • MULTI-SCREEN OFFERS • MULTI-CHANNEL PERSONALIZATION • REAL-TIME RECOMMENDATIONS • ONE TIME COUPONS • LOYALTY REWARDS • DEALS NEAR YOU • RELATED ITEMS SALES • PRODUCT AVAILABILITY • DYNAMIC PRICING • RISK SCORES • FRAUD PREVENTION • STREAM ANALYSIS © 2014 Aerospike. All rights reserved. Confidential SUPPORT • REAL-TIME DASHBOARDS • PERSONAL FINANCE PORTFOLIOS 18
  • 19. Next Gen NoSQL © 2014 Aerospike. All rights reserved. Confidential 19
  • 20. Aerospike Database / Context Computing Platform ■ Powering Context driven Apps that Personalize the Internet Instantly, Intuitively & Always-On! 1. ROCKET ENGINE - In-Memory, Flash-optimized 2. WEB SCALE – Distributed, Shared nothing 3. ACID RELIABILITY – Immediate Consistency, High Availability 4. NoSQL EXTENSIBILITY – Distributed Queries, Real-time Analytics © 2014 Aerospike. All rights reserved. Confidential 20
  • 21. ROCKET ENGINE Ask me. I’ll look up the answer and then tell it to you. • Indexes in DRAM • Data in DRAM / SSD • Balanced Reads & Writes • Highly Parallelized Ask and I’ll tell you now. • Lock-free + ACID OTHER DATABASE AEROSPIKE OS FILE SYSTEM HYBRID MEMORY SYSTEM™ PAGE CACHE BLOCK INTERFACE OPEN NVM BLOCK INTERFACE HDD SSD SSD OTHER DATABASE SSD SSD FLASH OPTIMIZED IN-MEMORY DATABASE © 2014 Aerospike. All rights reserved. Confidential 21
  • 22. Aerospike Certification Tool (ACT) for SSDs ■ Industry Standard Flash (SSD / PCI-E) Benchmark ■ Open Source Tool used by Flash Vendors to certify drives © 2014 Aerospike. All rights reserved. Confidential 22
  • 23. 10X Faster for Balanced Read/Write loads HIGH THROUGHPUT LOW LATENCY 350,000 Balanced Workload Read Latency Average Latency, ms 300,000 250,000 200,000 150,000 10 Aerospike 7.5 Cassandra 5 MongoDB 2.5 0 0 100,000 Throughput, ops/sec 50,000 Balanced Cassandra Read-Heavy MongoDB Couchbase 2.0* *We were forced to exclude Couchbase...since when run with either disk or replica durability on it was unable to complete the test.” – Thumbtack Technology Average Latency, ms Balanced Workload Update Latency 0 Aerospike 50,000 100,000 150,000 200,000 16 Aerospike 12 Cassandra 8 MongoDB 4 0 0 4 Node Cluster, each with: 50,000 100,000 150,000 200,000 Throughput, ops/sec CPU: 8 x Intel(R) Xeon(R) CPU E5-2665 0 @ 2.40GHz, RAM: 31 GB SSD: 4 x INTEL SSDSA2CW120G3, 120 GB (94 GB overprovisioned) HDD: ST500NM0011, 500 GB, SATA III, 7200 RPM OS: Ubuntu Server 12.04.1 64-bit (Linux kernel v.3.2.0) © 2014 Aerospike. All rights reserved. Confidential 23
  • 24. ONLY 186 SERVERS REQUIRED 14 SERVERS Actual customer analysis 99% < 1ms 500K TPS 10 TB Storage 2x Replication OTHER DATABASES DRAM & HDD SSD & DRAM Storage /server TPS /server Cost /server Server costs Power /server Power (2 years) $0.12 per kWh ave. US Maintenance (2 years) $3,600 /server 180 GB (196 GB) 500,000 $8,000 $1,488,000 0.9 kW 2.4 TB (4 x 700 GB) 500,000 $11,000 $154,000 1.1 kW $352,000 $32,400 $670,000 $50,400 Total $2,510,000 $236,800 © 2014 Aerospike. All rights reserved. Confidential 24
  • 25. WEB SCALE ■ Distributed Hash Table with No Hotspots ■ Every key hashed with RIPEMD160 into a 20 byte (fixed length) string cookie-abcdefg-12345678 ■ Hash + additional (fixed 64 bytes) data stored in DRAM in the index ■ Some bits from hash value are used to calculate the Partition ID (4096 partitions) ■ Partition ID maps to Node ID in the cluster 182023kh15hh3kahdjsh ■ Shared Nothing architecture … 1 4 2 3 3 2 4096 ■ No Load Balancers required Replica Node ID 1821 ■ Client just calculates Partition ID, determines Node ID Master Node ID 1820 ■ 1 Hop to data Partition ID 4 1 ■ Every node is identical © 2014 Aerospike. All rights reserved. Confidential 25
  • 26. WEB SCALE with ACID RELIABILITY OHI O 1) No Hotspots – DHT with RIPEMD160 simplifies data partitioning 2) Shared Nothing Architecture, every node identical Single row ACID – synch replication in cluster 5) Smart Cluster, Zero Touch – auto-failover, rebalancing, rolling upgrades.. Smart Client – 1 hop to data, no load balancers 3) 4) 6) Transactions and long running tasks prioritized © 2014 Aerospike. real-time All rights reserved. Confidential 7) XDR – asynch replication across data centers ensures Zero Downtime 26
  • 27. XDR ensures Zero Downtime ■ Cross Data Center Replication (XDR) enables geographic redundancy and location proximity ■ Maximum flexibility ■ Replication set at namespace level ■ Active-Passive /Active-Active modes ■ Changes in one data center can be ■ replicated to multiple data centers ■ forwarded to another data center ■ Clusters can have different number of nodes ■ Automatic failure handling ensures continuity in spite of node failures ■ Super Storm Sandy 2012 ■ Power outage, NYC Cluster down for 17 hours ■ Once power returned, XDR synched in 1 hour “Aerospike allows us to handle business continuity and reliability across 4 data centers seamlessly.” - Elad Efraim, CTO © 2014 Aerospike. All rights reserved. Confidential 27
  • 28. APP SERVER Architected for Context Computing APPLICATION APP/WEB SERVER AEROSPIKE SMART CLIENT™ • APIs (C, C#, Java, PHP, Python, Ruby, Erlang…) • Transactions, Cluster awareness EXTENSIBLE DATA MODEL AEROSPIKE SERVER Patents pendin Written in „C‟ • Str, Int, Lists, Maps • Lookups, Queries, Scans • User Defined Functions • Distributed Aggregations 4) NOSQL FLEXIBILITY MONITORING & MANAGEMENT • Aerospike Monitoring Console™ • Command Line Tools AEROSPIKE SMART CLUSTER™ 2) WEB SCALE AEROSPIKE HYBRID MEMORY SYSTEM™ AEROSPIKE (XDR) CROSS DATA CENTER REPLICATION™ 3) ACID RELIABILITY AEROSPIKE CLUSTER • PluginsNaglos, Graphite, Zabbi x AEROSPIKE REAL-TIME ENGINE™ 1) ROCKET FAST © 2014 Aerospike. All rights reserved. Confidential 28
  • 29. NOSQL EXTENSIBILITY ■ Namespaces (policy containers) ■ Determine storage - DRAM or Flash ■ Determine replication factor ■ Contain records and sets ■ Sets (tables) of records ■ Arbitrary grouping ■ Records (rows) ■ Max 128k, contain key and bins ■ Bin with same name can contain values of different types ■ String, integer, bytes (raw, blob, etc) ■ list ( an ordered collection of values ) ■ map ( a collection of keys and values ) ■ Bins can be added anytime © 2014 Aerospike. All rights reserved. Confidential 29
  • 30. Real-time Analytics on Operational Data DISTRIBUTED QUERIES 1. “Scatter” requests to all nodes 2. Indexes in DRAM for fast map of secondary  primary keys 3. Indexes co-located with data to guarantee ACID, manage migrations 4. Records read in parallel from all SSDs using lock free concurrency control 5. Aggregate results on each node 6. “Gather” results from all nodes on client STREAM AGGREGATIONS 1. Push Code/ Security Policies/ Rules to Data with UDFs 2. Pipe Query results through UDFs to Filter, Transform, Aggregate.. Map, Reduce REAL-TIME ANALYTICS on OPERATIONAL DATA (No ETL) ■ In Database, within the same Cluster ■ On the same Data, on XDR Replicated Clusters © 2014 Aerospike. All rights reserved. Confidential 30
  • 31. Aerospike Database / Context Computing Platform ■ Powering Context driven Apps that Personalize the Internet Instantly, Intuitively & Always-On! 1. ROCKET ENGINE - In-Memory, Flash-optimized 2. WEB SCALE – Distributed, Shared nothing 3. ACID RELIABILITY – Immediate Consistency, High Availability 4. NoSQL EXTENSIBILITY – Distributed Queries, Real-time Analytics © 2014 Aerospike. All rights reserved. Confidential 31
  • 32. Recognized as the only Visionary in Gartner's Magic Quadrant for Operational Database Management Systems Gartner, Magic Quadrant for Operational Database Management Systems Donald Fienberg et al. October 23, 2013 This graphic was published by Gartner, Inc. as part of a larger research document and should be evaluated in the context of the entire document. The Gartner document is available at www.aerospike.com . Gartner does not endorse any vendor, product or service depicted in its research publications, and does not advise technology users to select only those vendors with the highest ratings. Gartner research publications consist of the opinions of Gartner's research organization and should not be construed as statements of fact. Gartner disclaims all warranties, expressed or implied, with respect to this research, including any warranties of merchantability or fitness for a particular purpose. © 2014 Aerospike. All rights reserved. Confidential 32

Editor's Notes

  1. The first round of NoSQL databases were created for point solutions to specific problems. As the space has matured, there are new, more complex use cases that is driving the evolution of NoSQL.We will go through what some of these changes are, who is using AS and why, and and architectural overview of the database.
  2. One of the big changes in recent years has been the development of social media and multi-channel marketing. Companies that focused on a single channel have given way to companies that give their customers multiple ways to find and access content or products. These companies have focused on making it easy to share content, not only with multiple devices, but to family and friends.
  3. Some say “Speed kills,” but in today’s world speed sells. Consumers do not have the patience to wait for a page to come up, if you can’t get them in much less than a second, you will lose them.Amazon stats - Greg Linden, Amazon https://dl.google.com/io/2009/pres/DesigningOpenSocialAppsForSpeedandScale.pdfhttp://glinden.blogspot.com/2006/11/marissa-mayer-at-web-20.htmlhttp://home.blarg.net/~glinden/StanfordDataMining.2006-11-29.pptWalmart stats - Page Performance &amp; Site Conversion, Feb 2012“People will visit a Website less often if it is slower than a close competitor by &gt; 250 ms” – Microsoft
  4. Consumers now have multiple ways to access services. They do not want to have to remember on which device they did something. It should work across different devices.
  5. Companies must be able to support customers holistically. Marketing and sales should be aware of whether an email goes out is to an existing customer, an ex-customer, or one who has been having trouble.
  6. Customers have new expectations of when they can access systems. How often have you tried to transfer funds, only to find an issue due to “routine maintenance?”
  7. There are many new types of offers/deals that companies can offer to customers.REAL-TIME OFFERS ONE TIME COUPONS LOYALTY REWARDSDEALS NEAR YOURELATED ITEMSRECOMMENDATIONSPRODUCT AVAILABILITYDYNAMIC PRICINGRISK SCORESWhat these all rely on is the ability to track the customer across different channels, track their behavior in real-time, track segments over a period of time (often months) and also keep track of any transactions these customers may have made. All of this requires a new infrastructure that is at once much larger and much more responsive than ever before.
  8. To get a sense of the scale, suppose you are a medium sized business today. You might have 10 M customers you wish to track.
  9. SaaS platform vendors (B2B2C) and consumer facing enterprises must learn from pioneers who have already built internet scale interactions platforms that deliver the right offer at the right price right now.These pioneers have all implemented the same reference architecture:An application farm backed by an Aerospike cluster, an Interaction Store that contains user information that maps identities across different browsers, devices, websites or channels, context on what the user is doing right now – clicks, swipes, searches, tweets etc and segment information from big data analytics out of a hadoop cluster or data warehouse.The hadoop clusters and data warehouses store petabytes of data while only the most recent, most valuable, 1-100TB of hot data is stored in Aerospike.Applications use identity and segment information combined with real-time context and real-time analytics to determine the right offer right now.
  10. Companies in the ad:tech eco-system – Demand Side Platforms of DSPs, Supply Side Platforms or SSPs, Data exchanges or DMPs and Ad Exchanges – participate in Real-Time Bidding.RTB is only second to high frequency trading when it comes to low latency.These companies have just 100ms from the time someone clicks or swipes to when an offer is served.Many don’t make any money unless someone clicks on that offer – which means it has to be the right offer at the right time.They must examine the user’s cookie, check terabytes of data in Aerospike to know who the user is, what the user is doing, what the user may want, where the user is – website, mobile app, video etc – and then bid for the right to serve the offer.And only if they win the bid, can they actually serve the offer.If they cannot bid, win and serve within 100ms, they lose the opportunity and they lose revenue.
  11. BuiltWith.com tracks which websites use various technologies.Of the top Million sites, every time hundreds of millions of users click on over 40,000 websites, a call is made to AppNexus and in turn to Aerospike.AppNexus is second only to Google and BlueKai is not far behind.This fall, AppNexus announced that Millenial Media, the largest Mobile Ad Network would be sending it’s inventory to AppNexus for real-time bidding.Microsoft just announced that it would start making ads available on outlook clients – doubling inventory and sending to AppNexus for rtb.
  12. When it comes to speed, databases like SAP Hana go in-memory.Many databases cache data in memory and are “accelerated by flash” – but they just use SSDs instead of rotational drives.They still use the Linux file system that was built for spinning disks and get maybe a 2x performance boost.Aerospike uses a hybrid approach with indexes in DRAM and data that can be in DRAM of Flash.Aerospike can run just in DRAM alone, but most customers take advantage of the price/performance benefits of Flash.We access SSDs or PCI-E cards using a proprietary log structured file system that is optimized for flash.We use small block reads and large block writes to reduce wear and access is highly parallelized for maximum efficiency.
  13. This results in 10x higher throughput and sub millisecond response times for these tests of Balanced reads and writesFor 2 node clusters using SSDswith 2x replicationAnd immediate consistency for Aerospike (vs eventual consistency for Cassandra)YCSB is the standard benchmark for NoSQL. Code is open sourced on github and specs are published at http://www.aerospike.com/benchmarks/ so anyone can reproduce tests and results.
  14. High performance with Aerospike’s hybrid approach to memory also means that Aerospike can scale up to store more data per server and process the same load with 10x fewer servers than a DRAM only system.Here a customer priced a 10TB system (2x replication) that had to handle 500k TPS.A 14 server cluster with Aerospike cost only $236k compared to the 186 server cluster they would have had to purchase, install, operate and maintain with the competition.
  15. Gartner just published it’s Magic Quadrant on Operational Databases.Established companies like Oracle, Microsoft, IBM and SAP are in the leader quadrant.Of the emerging NoSQL players,Aerospike was the ONLY Visionary. The next generation of NoSQL and the next generation of In-Memory computing.