SlideShare a Scribd company logo
1 of 47
Download to read offline
Cedrick Lunven
Director of developer Advocacy
Top 10 present and future
innovations in the NoSQL
Cassandra ecosystem
Cédrick
Lunven
Developer Advocates, Events, Community, Partners
2
@clunven
Directeur Developer Relations
➢ Speaker
➢ Streamer
➢ Author
➢ Training
➢ Community Manager
➢ Application developer
➢ Technical Architect
➢ Presales
➢ Support
Agenda
1. Why Cassandra
2. Top 10 innovations in the Cassandra Ecosystem
2.1. Cassandra 4.1 - CEP-03 = System-wide Guardrails
2.2. Cassandra 4.1 - CEP-13 = Denylisting Partition Keys
2.3. Cassandra 4.1 - CEP-18 = Pluggable Extension Points
2.4. Cassandra 4.2 - CEP-07 = Storage Indexed Storage
2.5. Cassandra 4.2 - CEP-19 = Tries Memtables
2.6. Cassandra 4.2 - CEP-21 = Transactional Cluster MetaData
2.7. Cassandra 4.2 - CEP-20 = Dynamic Data Masking
2.8. Cassandra 4.2 - CEP-15 = Acid Transaction
2.9. Communicate with any application using Stargate
2.10. Deploy Cassandra in Kubernetes
3. Resources
3
Why Cassandra ?
4
Relational Databases are Versatile
5
Fast Queries
“Customer-facing”
High number of transactions
Usually Hot / Live Data
High SLA Requirements
(Response Time / Availability)
OLTP: Online Transaction Processing OLAP: Online Analytical Processing
Complex Queries
Historical
High volume of data
Often “Cold Data”
Used by / for Analytics
Volume
of Data
Response
Time
Relational Databases have limited scalability
6
They were designed to run on a single machine
●
●
●
Apache Cassandra = NoSQL Distributed Decentralised DBMS
7
NODE
NODE
NODE
NODE
NODE NODE
NODE
1 Installation = 1 NODE
✔ Capacity = ~ 2-4TB
✔ Throughput = LOTS Tx/sec/core
Communication:
✔ Gossiping
DataCenter | Ring
Distributed ?
8
• Geographic Distribution • Hybrid-Cloud and Multi-Cloud
On-premise
Always On
Every second of downtime
translates into lost revenue
High Availability
Apache
Cassandra
Built as a
Technical
Necessity
Linear Scalability
of Fortune 100
companies
have adopted
90%
Hyper-Scalability
Millions of operations per day,
hour, or second
Faster Pace
Every millisecond of latency
has consequence
Low Latency
Data Everywhere
On-premises, hybrid, multi-cloud,
centralized, or edge
Global Distribution
Use Cases
10
High Throughput
High Volume
Heavy Writes
Heavy Reads
Event Streaming Log Analytics
Internet of Things Other Time Series
Mission-Critical
No Data Loss
Always-on
Scalability
Availability
Distributed
Cloud-native
Caching Pricing
Market Data Inventory
Banking Retail
Tracking /
Logistics
Customer
Experience
API Layer Hybrid-cloud
Enterprise Data
Layer
Multi-cloud
Modern Cloud
Applications
Global Presence
Workload Mobility
Compliance /
GDPR
Cassandra 4.1 (2022)
● CEP-03 = System-wide Guardrails
● CEP-13 = Denylisting Partition Keys
● CEP-18 = Pluggable Extension Points
11
Cassandra 4.0 (2021)
Is a big Deal !
12
● Wait for 6 years
● Production Ready day 1
● Features
○ 25% faster
○ Faster Big Clusters
○ Incremental Repairs
○ Virtual Tables
○ Transient Replicas
Cassandra 4.1 - What’s in it?
● Themes
● Major yearly release
● Usability
● Security
● Pluggability
● Features
● Configuration Improvements
● System-wide Guardrails
● Denylisting Partition Keys
● Pluggable Extension Points
● QA building blocks
13
Improved Configuration
14
● Standard and Intuitive names
○ noun_verb
● Units in values
System-wide Guardrails (CEP-3)
● Hard and soft limits on magnitudes
● Disabling features entirely
● Disallowing specific configuration values
● Administrative users exempted
● Typically checked at the CQL layer
● Based on work in Astra
15
https://cassandra.apache.org/_/blog/Apache-Cassandra-4.1-Features-Guardrails-Framework.html
Cassandra 4.1 - Disabling Features
● Example: ALLOW FILTERING
● Configure in cassandra.yaml :
● Verify setting in cqlsh:
● Developer experience:
16
Cassandra 4.1 - Hard and Soft Limits
● Example: ALLOW FILTERING
● Configure in cassandra.yaml :
● Developer experience (soft limit aka warning):
17
Denylisting Partition Keys (CEP-13)
18
https://cassandra.apache.org/_/blog/Apache-Cassandra-4.1-Denylisting-Partitions.html
● Overloaded partitions break clusters
○ compactions, streaming, GC, etc
○ Bad data modeling choices
○ Unintended usage or attacks
● The solution: operators can identify
partition keys for which queries will be
disallowed overloaded keys
https://unsplash.com/@nevenkrcmarek
Pluggable extension points
● Valuable mechanism to add new features
without disrupting the core
○ Internal APIs and code restructuring
● Pluggable storage (memtables)
○ Next: SSTables
● Pluggable network encryption
○ External key providers
● Pluggable authentication
○ Extend cqlsh via Python modules
● External schema storage
○ etcd for Kubernetes environments
19
Updating to Cassandra 4.1
20
Upgrading to 4.1
● In place upgrade from 4.0
● No application changes
● Simple steps, same as last time
● 1. Verify your software versions
● 2. Perform a cluster-wide snapshot
● 3. Upgrade the first node and verify
● 4. Continue to each node in your
cluster
21
https://foojay.io/today/have-you-upgraded-to-cassandra-4-0/
If you need help preparing for an upgrade to 4.1
● For a limited time — until July 31, 2022 — our award-winning team of Apache Cassandra™
contributors are available to help evaluate the health of your clusters, including conducting a
comprehensive health check, at a reduced price of $200 USD a month.*
● This special offer includes:
● Best practices, performance and security improvements, plus SLA achievement
● Expert-led health check of your Apache Cassandra environment
● On-going support for 1 year via DataStax Luna
● Expertise and peace of mind for your team
● How can you learn more and request this offer?
https://www.datastax.com/lp/luna-apache-cassandra
22
Cassandra 4.2 (2023)
● CEP-07 = Storage Attached Index
● CEP-19 = Tries Memtables
● CEP-21 = Transactional Cluster MetaData
● CEP-20 = Dynamic Data Masking
● CEP-15 = Acid Transaction
23
Storage Attached Index
24
Intended to replace both 2i and SASI
Trie Memtables (CEP-19)
25
Transactional Cluster MetaData
26
ACID Transactions (CEP-15)
27
Cassandra Relationship With Developers
28
Easy Hard
● Scale
● Resilience
● Distribution
● Atomicity
● Serialized changes
● Complicated State
Built-in Data Modeling
& Code
Accord
29
Node 1
Node 4 Node 2
Node 3
● Every node has a Reorder Buffer
● Clock skew is cool
● Leaderless timestamp protocol
● Fast Path Electorates: Fault tolerance
● TL;DR One Round Trip - ish
Reorder
Buffer
Reorder
Buffer
Reorder
Buffer
Reorder
Buffer
Good for Cassandra
● Leaderless
● Scales like Cassandra
● Failure modes match
More Tools
● Communicate with any application using Stargate
● Deploy Cassandra in Kubernetes
30
Stargate Overview
An open source API framework for data
Stargate makes it easy to use a
database for any application workload
by adding plugin support for new APIs,
data types, and access methods
MICROSERVICES
DEVELOPERS
31
Astra DB
gRPC
CQL
GraphQL Document
API
Data
API
Cassandra
3.x
Cassandra
4.x DSE 6.x
API Extensions and Persistence Extensions
32
Stargate
Cassandra Query
Language
GraphQL REST Document
SQL like Table Model
Structured Data
Key-Value Data
Strong Types
Minimal query overhead
Hierarchy of
types and fields
Structured Data
Key-Value Data
Low query overhead
Row based
Structured Data
Key-Value Data
Weaker Types
High query overhead
JSON Documents
Semi-Structured Data
Weaker Types
High query overhead
Drivers Open API
More Performant More Flexible
gRPC
Structured Data (CQL)
Lighter weight
Native driver alternative
Low query overhead
33
Connecting to your cluster (Before)
NODE
NODE
NODE
NODE
NODE NODE
NODE
developers
Apps
CQL
CQL
CQL
CQL
CQL
Drivers
CQL
CQL
34
Connecting to your cluster (with Stargates)
NODE
NODE
NODE
NODE
NODE NODE
NODE
developers
Apps
SDK
CQL
CQL
CQL
CQL
CQL
Stargate
Stargate
Stargate
CQL
CQL
CQL
CQL
CQL
Load
Balancer
(HTTP)
35
+ =
36
Apache
Cassandra®
Cass-Operator
37
Apache
Cassandra®
Cass-Operator
Metrics
Collector
38
Apache
Cassandra®
Cass-Operator
Cassandra
Medusa
(backup/restore)
Metrics
Collector
📁S3, GCP,...
Repear
(repair)
39
Apache
Cassandra®
Cass-Operator
Cassandra
Medusa
(backup/restore)
Metrics
Collector
📁S3, GCP,...
Repear
(repair)
Stargate
Resources
41
Cassandra Community Events
42
● 10/11 - London
● 10/13 - Amsterdam
● 11/08 - Hanoi
● 11/10 - Jakarta
● 11/10 - Santa Clara
● 11/10 - Houston
● 11/10 - Seattle
● 11/15 - Singapore
Every
Wednesday
12pm ET
Cassandra Community Resources
Planet Cassandra is back!
https://www.youtube.com/channel/UCvP-AXuCr
-naAeEccCfKwUA
43
Developer Community
https://www.datastax.com/dev/community
Ecosystem Updates
● Try out 4.1 features
including guardrails
44
Coming soon:
● Cassandra 4.1 support
● Stargate 2.0 release
● New Docs API
● More APIs
Coming soon:
● Cassandra 4.1 support
● K8ssandra Operator v1.2
Cassandra: The Definitive Guide, Revised Third Edition
45
https://www.datastax.com/resources/ebook/oreilly-cassandra-the-definitive-guide
Questions
46
Thank You!
47
@clunven

More Related Content

Similar to Top 10 present and future innovations in the NoSQL Cassandra ecosystem (2022)

Leveraging Cassandra for real-time multi-datacenter public cloud analytics
Leveraging Cassandra for real-time multi-datacenter public cloud analyticsLeveraging Cassandra for real-time multi-datacenter public cloud analytics
Leveraging Cassandra for real-time multi-datacenter public cloud analyticsJulien Anguenot
 
Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
 Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep... Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...Databricks
 
Containers and Kubernetes
Containers and KubernetesContainers and Kubernetes
Containers and KubernetesAltoros
 
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...NETWAYS
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...confluent
 
CN Asturias - Stateful application for kubernetes
CN Asturias -  Stateful application for kubernetes CN Asturias -  Stateful application for kubernetes
CN Asturias - Stateful application for kubernetes Cédrick Lunven
 
Using Kubernetes to make cellular data plans cheaper for 50M users
Using Kubernetes to make cellular data plans cheaper for 50M usersUsing Kubernetes to make cellular data plans cheaper for 50M users
Using Kubernetes to make cellular data plans cheaper for 50M usersMirantis
 
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18Olga Zinkevych
 
NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1Ruslan Meshenberg
 
Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...
Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...
Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...DataStax
 
Achieve high throughput: A case study using a Pensando Distributed Services C...
Achieve high throughput: A case study using a Pensando Distributed Services C...Achieve high throughput: A case study using a Pensando Distributed Services C...
Achieve high throughput: A case study using a Pensando Distributed Services C...Principled Technologies
 
Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”
Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”
Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”GlobalLogic Ukraine
 
Patterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to KubernetesPatterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to KubernetesJosef Adersberger
 
Patterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to KubernetesPatterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to KubernetesQAware GmbH
 
Unleash the Power of Open Networking
Unleash the Power of Open NetworkingUnleash the Power of Open Networking
Unleash the Power of Open NetworkingCumulus Networks
 
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...Dataconomy Media
 
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
Best Practices for Building Hybrid-Cloud Architectures | Hans JespersenBest Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersenconfluent
 
Scylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi KivityScylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi KivityScyllaDB
 
Orchestrating Cassandra with Kubernetes Operator and PaaSTA
Orchestrating Cassandra with Kubernetes Operator and PaaSTAOrchestrating Cassandra with Kubernetes Operator and PaaSTA
Orchestrating Cassandra with Kubernetes Operator and PaaSTARaghavendra Prabhu
 

Similar to Top 10 present and future innovations in the NoSQL Cassandra ecosystem (2022) (20)

Leveraging Cassandra for real-time multi-datacenter public cloud analytics
Leveraging Cassandra for real-time multi-datacenter public cloud analyticsLeveraging Cassandra for real-time multi-datacenter public cloud analytics
Leveraging Cassandra for real-time multi-datacenter public cloud analytics
 
Multi-cluster k8ssandra
Multi-cluster k8ssandraMulti-cluster k8ssandra
Multi-cluster k8ssandra
 
Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
 Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep... Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
 
Containers and Kubernetes
Containers and KubernetesContainers and Kubernetes
Containers and Kubernetes
 
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...
OSMC 2023 | What’s new with Grafana Labs’s Open Source Observability stack by...
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
 
CN Asturias - Stateful application for kubernetes
CN Asturias -  Stateful application for kubernetes CN Asturias -  Stateful application for kubernetes
CN Asturias - Stateful application for kubernetes
 
Using Kubernetes to make cellular data plans cheaper for 50M users
Using Kubernetes to make cellular data plans cheaper for 50M usersUsing Kubernetes to make cellular data plans cheaper for 50M users
Using Kubernetes to make cellular data plans cheaper for 50M users
 
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
 
NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1
 
Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...
Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...
Cassandra Tools and Distributed Administration (Jeffrey Berger, Knewton) | C*...
 
Achieve high throughput: A case study using a Pensando Distributed Services C...
Achieve high throughput: A case study using a Pensando Distributed Services C...Achieve high throughput: A case study using a Pensando Distributed Services C...
Achieve high throughput: A case study using a Pensando Distributed Services C...
 
Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”
Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”
Сloud Webinar #1 “Architecture of Highly Loaded Geo-Distributed Applications”
 
Patterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to KubernetesPatterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to Kubernetes
 
Patterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to KubernetesPatterns and Pains of Migrating Legacy Applications to Kubernetes
Patterns and Pains of Migrating Legacy Applications to Kubernetes
 
Unleash the Power of Open Networking
Unleash the Power of Open NetworkingUnleash the Power of Open Networking
Unleash the Power of Open Networking
 
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
 
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
Best Practices for Building Hybrid-Cloud Architectures | Hans JespersenBest Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
 
Scylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi KivityScylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi Kivity
 
Orchestrating Cassandra with Kubernetes Operator and PaaSTA
Orchestrating Cassandra with Kubernetes Operator and PaaSTAOrchestrating Cassandra with Kubernetes Operator and PaaSTA
Orchestrating Cassandra with Kubernetes Operator and PaaSTA
 

More from Cédrick Lunven

BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...
BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...
BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...Cédrick Lunven
 
Avoiding Pitfalls for Cassandra.pdf
Avoiding Pitfalls for Cassandra.pdfAvoiding Pitfalls for Cassandra.pdf
Avoiding Pitfalls for Cassandra.pdfCédrick Lunven
 
Unlock cassandra data for application developers using graphQL
Unlock cassandra data for application developers using graphQLUnlock cassandra data for application developers using graphQL
Unlock cassandra data for application developers using graphQLCédrick Lunven
 
An oss api layer for your cassandra
An oss api layer for your cassandraAn oss api layer for your cassandra
An oss api layer for your cassandraCédrick Lunven
 
Xebicon2019 m icroservices
Xebicon2019   m icroservicesXebicon2019   m icroservices
Xebicon2019 m icroservicesCédrick Lunven
 
Reactive Programming with Cassandra
Reactive Programming with CassandraReactive Programming with Cassandra
Reactive Programming with CassandraCédrick Lunven
 
VoxxedDays Luxembourg FF4J
VoxxedDays Luxembourg FF4JVoxxedDays Luxembourg FF4J
VoxxedDays Luxembourg FF4JCédrick Lunven
 
VoxxedDays Luxembourg 2019
VoxxedDays Luxembourg 2019VoxxedDays Luxembourg 2019
VoxxedDays Luxembourg 2019Cédrick Lunven
 
Create API for your Databases
Create API for your DatabasesCreate API for your Databases
Create API for your DatabasesCédrick Lunven
 
Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...
Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...
Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...Cédrick Lunven
 
Streaming, Analytics and Reactive Applications with Apache Cassandra
Streaming, Analytics and Reactive Applications with Apache CassandraStreaming, Analytics and Reactive Applications with Apache Cassandra
Streaming, Analytics and Reactive Applications with Apache CassandraCédrick Lunven
 
Riviera jug apicassandra
Riviera jug apicassandraRiviera jug apicassandra
Riviera jug apicassandraCédrick Lunven
 
Paris Meetup Jhispter #9 - Generator FF4j for Jhipster
Paris Meetup Jhispter #9 - Generator FF4j for JhipsterParis Meetup Jhispter #9 - Generator FF4j for Jhipster
Paris Meetup Jhispter #9 - Generator FF4j for JhipsterCédrick Lunven
 
Introduction to Feature Toggle and FF4J
Introduction to Feature Toggle and FF4JIntroduction to Feature Toggle and FF4J
Introduction to Feature Toggle and FF4JCédrick Lunven
 

More from Cédrick Lunven (18)

BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...
BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...
BigData Paris 2022 - Innovations récentes et futures autour du NoSQL Apache ...
 
Avoiding Pitfalls for Cassandra.pdf
Avoiding Pitfalls for Cassandra.pdfAvoiding Pitfalls for Cassandra.pdf
Avoiding Pitfalls for Cassandra.pdf
 
Unlock cassandra data for application developers using graphQL
Unlock cassandra data for application developers using graphQLUnlock cassandra data for application developers using graphQL
Unlock cassandra data for application developers using graphQL
 
An oss api layer for your cassandra
An oss api layer for your cassandraAn oss api layer for your cassandra
An oss api layer for your cassandra
 
Xebicon2019 m icroservices
Xebicon2019   m icroservicesXebicon2019   m icroservices
Xebicon2019 m icroservices
 
DevFestBdm2019
DevFestBdm2019DevFestBdm2019
DevFestBdm2019
 
Reactive Programming with Cassandra
Reactive Programming with CassandraReactive Programming with Cassandra
Reactive Programming with Cassandra
 
Shift Dev Conf API
Shift Dev Conf APIShift Dev Conf API
Shift Dev Conf API
 
VoxxedDays Luxembourg FF4J
VoxxedDays Luxembourg FF4JVoxxedDays Luxembourg FF4J
VoxxedDays Luxembourg FF4J
 
VoxxedDays Luxembourg 2019
VoxxedDays Luxembourg 2019VoxxedDays Luxembourg 2019
VoxxedDays Luxembourg 2019
 
Design API - SnowCampIO
Design API - SnowCampIODesign API - SnowCampIO
Design API - SnowCampIO
 
Create API for your Databases
Create API for your DatabasesCreate API for your Databases
Create API for your Databases
 
Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...
Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...
Leveraging Feature Toggles for your Microservices (VoxxeddaysMicroservices Pa...
 
Streaming, Analytics and Reactive Applications with Apache Cassandra
Streaming, Analytics and Reactive Applications with Apache CassandraStreaming, Analytics and Reactive Applications with Apache Cassandra
Streaming, Analytics and Reactive Applications with Apache Cassandra
 
Riviera jug apicassandra
Riviera jug apicassandraRiviera jug apicassandra
Riviera jug apicassandra
 
Riviera JUG ff4j
Riviera JUG ff4jRiviera JUG ff4j
Riviera JUG ff4j
 
Paris Meetup Jhispter #9 - Generator FF4j for Jhipster
Paris Meetup Jhispter #9 - Generator FF4j for JhipsterParis Meetup Jhispter #9 - Generator FF4j for Jhipster
Paris Meetup Jhispter #9 - Generator FF4j for Jhipster
 
Introduction to Feature Toggle and FF4J
Introduction to Feature Toggle and FF4JIntroduction to Feature Toggle and FF4J
Introduction to Feature Toggle and FF4J
 

Recently uploaded

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Recently uploaded (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Top 10 present and future innovations in the NoSQL Cassandra ecosystem (2022)

  • 1. Cedrick Lunven Director of developer Advocacy Top 10 present and future innovations in the NoSQL Cassandra ecosystem
  • 2. Cédrick Lunven Developer Advocates, Events, Community, Partners 2 @clunven Directeur Developer Relations ➢ Speaker ➢ Streamer ➢ Author ➢ Training ➢ Community Manager ➢ Application developer ➢ Technical Architect ➢ Presales ➢ Support
  • 3. Agenda 1. Why Cassandra 2. Top 10 innovations in the Cassandra Ecosystem 2.1. Cassandra 4.1 - CEP-03 = System-wide Guardrails 2.2. Cassandra 4.1 - CEP-13 = Denylisting Partition Keys 2.3. Cassandra 4.1 - CEP-18 = Pluggable Extension Points 2.4. Cassandra 4.2 - CEP-07 = Storage Indexed Storage 2.5. Cassandra 4.2 - CEP-19 = Tries Memtables 2.6. Cassandra 4.2 - CEP-21 = Transactional Cluster MetaData 2.7. Cassandra 4.2 - CEP-20 = Dynamic Data Masking 2.8. Cassandra 4.2 - CEP-15 = Acid Transaction 2.9. Communicate with any application using Stargate 2.10. Deploy Cassandra in Kubernetes 3. Resources 3
  • 5. Relational Databases are Versatile 5 Fast Queries “Customer-facing” High number of transactions Usually Hot / Live Data High SLA Requirements (Response Time / Availability) OLTP: Online Transaction Processing OLAP: Online Analytical Processing Complex Queries Historical High volume of data Often “Cold Data” Used by / for Analytics Volume of Data Response Time
  • 6. Relational Databases have limited scalability 6 They were designed to run on a single machine ● ● ●
  • 7. Apache Cassandra = NoSQL Distributed Decentralised DBMS 7 NODE NODE NODE NODE NODE NODE NODE 1 Installation = 1 NODE ✔ Capacity = ~ 2-4TB ✔ Throughput = LOTS Tx/sec/core Communication: ✔ Gossiping DataCenter | Ring
  • 8. Distributed ? 8 • Geographic Distribution • Hybrid-Cloud and Multi-Cloud On-premise
  • 9. Always On Every second of downtime translates into lost revenue High Availability Apache Cassandra Built as a Technical Necessity Linear Scalability of Fortune 100 companies have adopted 90% Hyper-Scalability Millions of operations per day, hour, or second Faster Pace Every millisecond of latency has consequence Low Latency Data Everywhere On-premises, hybrid, multi-cloud, centralized, or edge Global Distribution
  • 10. Use Cases 10 High Throughput High Volume Heavy Writes Heavy Reads Event Streaming Log Analytics Internet of Things Other Time Series Mission-Critical No Data Loss Always-on Scalability Availability Distributed Cloud-native Caching Pricing Market Data Inventory Banking Retail Tracking / Logistics Customer Experience API Layer Hybrid-cloud Enterprise Data Layer Multi-cloud Modern Cloud Applications Global Presence Workload Mobility Compliance / GDPR
  • 11. Cassandra 4.1 (2022) ● CEP-03 = System-wide Guardrails ● CEP-13 = Denylisting Partition Keys ● CEP-18 = Pluggable Extension Points 11
  • 12. Cassandra 4.0 (2021) Is a big Deal ! 12 ● Wait for 6 years ● Production Ready day 1 ● Features ○ 25% faster ○ Faster Big Clusters ○ Incremental Repairs ○ Virtual Tables ○ Transient Replicas
  • 13. Cassandra 4.1 - What’s in it? ● Themes ● Major yearly release ● Usability ● Security ● Pluggability ● Features ● Configuration Improvements ● System-wide Guardrails ● Denylisting Partition Keys ● Pluggable Extension Points ● QA building blocks 13
  • 14. Improved Configuration 14 ● Standard and Intuitive names ○ noun_verb ● Units in values
  • 15. System-wide Guardrails (CEP-3) ● Hard and soft limits on magnitudes ● Disabling features entirely ● Disallowing specific configuration values ● Administrative users exempted ● Typically checked at the CQL layer ● Based on work in Astra 15 https://cassandra.apache.org/_/blog/Apache-Cassandra-4.1-Features-Guardrails-Framework.html
  • 16. Cassandra 4.1 - Disabling Features ● Example: ALLOW FILTERING ● Configure in cassandra.yaml : ● Verify setting in cqlsh: ● Developer experience: 16
  • 17. Cassandra 4.1 - Hard and Soft Limits ● Example: ALLOW FILTERING ● Configure in cassandra.yaml : ● Developer experience (soft limit aka warning): 17
  • 18. Denylisting Partition Keys (CEP-13) 18 https://cassandra.apache.org/_/blog/Apache-Cassandra-4.1-Denylisting-Partitions.html ● Overloaded partitions break clusters ○ compactions, streaming, GC, etc ○ Bad data modeling choices ○ Unintended usage or attacks ● The solution: operators can identify partition keys for which queries will be disallowed overloaded keys
  • 19. https://unsplash.com/@nevenkrcmarek Pluggable extension points ● Valuable mechanism to add new features without disrupting the core ○ Internal APIs and code restructuring ● Pluggable storage (memtables) ○ Next: SSTables ● Pluggable network encryption ○ External key providers ● Pluggable authentication ○ Extend cqlsh via Python modules ● External schema storage ○ etcd for Kubernetes environments 19
  • 21. Upgrading to 4.1 ● In place upgrade from 4.0 ● No application changes ● Simple steps, same as last time ● 1. Verify your software versions ● 2. Perform a cluster-wide snapshot ● 3. Upgrade the first node and verify ● 4. Continue to each node in your cluster 21 https://foojay.io/today/have-you-upgraded-to-cassandra-4-0/
  • 22. If you need help preparing for an upgrade to 4.1 ● For a limited time — until July 31, 2022 — our award-winning team of Apache Cassandra™ contributors are available to help evaluate the health of your clusters, including conducting a comprehensive health check, at a reduced price of $200 USD a month.* ● This special offer includes: ● Best practices, performance and security improvements, plus SLA achievement ● Expert-led health check of your Apache Cassandra environment ● On-going support for 1 year via DataStax Luna ● Expertise and peace of mind for your team ● How can you learn more and request this offer? https://www.datastax.com/lp/luna-apache-cassandra 22
  • 23. Cassandra 4.2 (2023) ● CEP-07 = Storage Attached Index ● CEP-19 = Tries Memtables ● CEP-21 = Transactional Cluster MetaData ● CEP-20 = Dynamic Data Masking ● CEP-15 = Acid Transaction 23
  • 24. Storage Attached Index 24 Intended to replace both 2i and SASI
  • 28. Cassandra Relationship With Developers 28 Easy Hard ● Scale ● Resilience ● Distribution ● Atomicity ● Serialized changes ● Complicated State Built-in Data Modeling & Code
  • 29. Accord 29 Node 1 Node 4 Node 2 Node 3 ● Every node has a Reorder Buffer ● Clock skew is cool ● Leaderless timestamp protocol ● Fast Path Electorates: Fault tolerance ● TL;DR One Round Trip - ish Reorder Buffer Reorder Buffer Reorder Buffer Reorder Buffer Good for Cassandra ● Leaderless ● Scales like Cassandra ● Failure modes match
  • 30. More Tools ● Communicate with any application using Stargate ● Deploy Cassandra in Kubernetes 30
  • 31. Stargate Overview An open source API framework for data Stargate makes it easy to use a database for any application workload by adding plugin support for new APIs, data types, and access methods MICROSERVICES DEVELOPERS 31
  • 32. Astra DB gRPC CQL GraphQL Document API Data API Cassandra 3.x Cassandra 4.x DSE 6.x API Extensions and Persistence Extensions 32
  • 33. Stargate Cassandra Query Language GraphQL REST Document SQL like Table Model Structured Data Key-Value Data Strong Types Minimal query overhead Hierarchy of types and fields Structured Data Key-Value Data Low query overhead Row based Structured Data Key-Value Data Weaker Types High query overhead JSON Documents Semi-Structured Data Weaker Types High query overhead Drivers Open API More Performant More Flexible gRPC Structured Data (CQL) Lighter weight Native driver alternative Low query overhead 33
  • 34. Connecting to your cluster (Before) NODE NODE NODE NODE NODE NODE NODE developers Apps CQL CQL CQL CQL CQL Drivers CQL CQL 34
  • 35. Connecting to your cluster (with Stargates) NODE NODE NODE NODE NODE NODE NODE developers Apps SDK CQL CQL CQL CQL CQL Stargate Stargate Stargate CQL CQL CQL CQL CQL Load Balancer (HTTP) 35
  • 42. Cassandra Community Events 42 ● 10/11 - London ● 10/13 - Amsterdam ● 11/08 - Hanoi ● 11/10 - Jakarta ● 11/10 - Santa Clara ● 11/10 - Houston ● 11/10 - Seattle ● 11/15 - Singapore Every Wednesday 12pm ET
  • 43. Cassandra Community Resources Planet Cassandra is back! https://www.youtube.com/channel/UCvP-AXuCr -naAeEccCfKwUA 43 Developer Community https://www.datastax.com/dev/community
  • 44. Ecosystem Updates ● Try out 4.1 features including guardrails 44 Coming soon: ● Cassandra 4.1 support ● Stargate 2.0 release ● New Docs API ● More APIs Coming soon: ● Cassandra 4.1 support ● K8ssandra Operator v1.2
  • 45. Cassandra: The Definitive Guide, Revised Third Edition 45 https://www.datastax.com/resources/ebook/oreilly-cassandra-the-definitive-guide