SlideShare a Scribd company logo
1 of 34
Performance Tuning of MySQL Cluster


  Santa Clara, April 2012




  Johan Andersson

  Severalnines AB

  johan@severalnines.com

  Cell +46 73 073 60 99
Copyright Severalnines 2012
2




      Agenda

       Scaling and Partitioning

       Designing a Scalable System

       Insert Performance Tuning

       Query Tuning

       Random tricks

       Disk Data Tuning




    Copyright Severalnines 2012
3




           Here is ...
    Access Layer

          App        App
         Server     Server



        MYSQL      MYSQL




    STORAGE LAYER


         DATA          DATA
         NODE          NODE



         P0             P1
      Node group 0

        Copyright Severalnines 2012
4




        It can scale linearly ...
    Access Layer

          App        App        App         App      App        App        App      App
         Server     Server     Server      Server   Server     Server     Server   Server



        MYSQL      MYSQL       MYSQL      MYSQL     MYSQL      NDBAPI    NDBAPI    NDBAPI




    STORAGE LAYER                       STORAGE LAYER                   STORAGE LAYER


         DATA          DATA                DATA         DATA                DATA            DATA
         NODE          NODE                NODE         NODE                NODE            NODE



         P0             P1                  P2           P3                  P4             P5
      Node group 0                       Node group 1                    Node group 2

      Copyright Severalnines 2012
if you find the bottlenecks

   A lot of CPU is used on the data nodes
       Probably a lot of large index scans and full table scans are used.

   A lot of CPU is used on the mysql servers
       Probably a lot of GROUP BY/DISTINCT or aggregate functions.

   Hardly no CPU is used on either mysql or data nodes
       Probably low load
       Time is spent on network (a lot of “ping pong” to satisfy a request).

   System is running slow in general
       Disks (io util), queries, swap (should never happen)



Copyright Severalnines 2012
and if you know how
   Adding mysqlds – trivial – if the mysqld is the bottleneck

   BUT! Adding data nodes
        More data nodes does not automatically give better performance
                ●
                    Latency may increase for a single query
                ●
                    Total throughout will be improved
        How to get both?




Copyright Severalnines 2012
7




      Designing a
      Scalable System

       Define the most typical Use Cases
           List all my friends, session management etc etc.
           Optimize everything for the typical use case

       Keep it simple
           Complex access patterns does not scale
           Simple access patterns do ( Primay key and Partitioned Index Scans )

       Note! There is no parameter in config.ini that affects performance – only availability.
           Everything is about the Schema and the Queries.
           Tune the mysql servers (sort buffers etc) as you would for innodb.




    Copyright Severalnines 2012
8




      Simple Access

       PRIMARY KEY lookups are HASH lookup O(1)

       INDEX searches a T-tree and takes O(log n) time.

       In 7.2 JOINs are ok, but in 7.1 you should try to avoid
        them.




    Copyright Severalnines 2012
9




       Data Access
    Access Layer

          App        App        App       App      App         App      App      App
         Server     Server     Server    Server   Server      Server   Server   Server



        MYSQL      MYSQL      MYSQL      MYSQL    MYSQL       NDBAPI   NDBAPI   NDBAPI




    STORAGE LAYER


         DATA          DATA               DATA         DATA              DATA            DATA
         NODE          NODE               NODE         NODE              NODE            NODE



         P0             P1                P2           P3                 P4             P5
      Node group 0                      Node group 1                   Node group 2

     Copyright Severalnines 2012
10




       Data Access

        One Request can hit all Partitions
             Sub-optimal and won't scale




     Copyright Severalnines 2012
11




       Data Access
     Access Layer

            App         App         App        App      App        App       App      App
           Server      Server      Server     Server   Server     Server    Server   Server



          MYSQL       MYSQL        MYSQL     MYSQL     MYSQL      NDBAPI   NDBAPI    NDBAPI




     STORAGE LAYER


           DATA           DATA                 DATA        DATA               DATA            DATA
           NODE           NODE                 NODE        NODE               NODE            NODE



           P0              P1                  P2           P3                 P4             P5
       Node group 0                         Node group 1                   Node group 2

     Copyright Severalnines 2012
12




       Data Access

        One Request hits one partition
             Scales!
             The number of Partitions (data nodes) does not matter!
        Partitioning is important!




     Copyright Severalnines 2012
13




       Partitioning

        MySQL Cluster auto-partitions based on the Primary Key
             Data is spread randomly
        If possible better to Partition on a part of the Primary Key
             CREATE TABLE user_friends(
                userid,
                friendid ,
                somedata,
                PRIMARY KEY (userid, friendid)) PARTITION BY KEY(userid)
             All records with userid=X will be stored in the same partition!
        Ultra important for MySQL Cluster 7.2 and Fast JOINs.



     Copyright Severalnines 2012
14




       Partitioning

        EXPLAIN PARTITIONS <query>
             Tells you what partitions you touch.
        Also verify with:

       mysql> show global status like 'ndb_pruned_scan_count’;
          +-----------------------+-------+
          | Variable_name         | Value |
          +-----------------------+-------+
          | Ndb_pruned_scan_count | 1     |
          +-----------------------+-------+

             Increases when Partition Pruning could be used.




     Copyright Severalnines 2012
15




       Insert Performance

        Scaling Inserts
             Option 1) Batch INSERTS if you can
                    ●   An insert batch of 10 records will perform 10x faster than 10 single
                        inserts!
                    ●   INSERT INTO t1 VALUES (<record1>), (<record2>), …,(<recordN>)
             Option 2) Many threads (parallelism)
             Or a combo of both
        Dumpfiles or LOAD DATA INFILEs
             Chunk them up and load in parallel on several mysqlds




     Copyright Severalnines 2012
16




       Insert Performance

        INSERTs in a table with AUTO_INCREMENT
        MySQL Server query Data nodes for an auto_increment
              –   The mysqld can hold a range of autoincs (cache)
              –   Before an INSERT, and autoinc must be fetched from either Data node
                  (slow) or on the cache (fast)
        ndb_autoincremet_prefetch_sz sets the cache size and it affects insert perf:
              –   ndb_autoincrement_prefetch_sz=1:            1211.91TPS
              –   ndb_autoincrement_prefetch_sz=256:          3471.71TPS
              –   ndb_autoincrement_prefetch_sz=1024:         3659.52TPS



     Copyright Severalnines 2012
17




       Insert Performance

        ndb_batch_size can also be important with LOAD DATA INFILE or
         dumps
             SET GLOBAL|SESSION NDB_BATCH_SIZE = 16M
             You may get LongMessageBuffer Overload
                     ●   Increase it in config.ini to 32M or 48M
        Also REDO logs may get overloaded if your disks are too slow and/or
         the REDO is too small.




     Copyright Severalnines 2012
18




       Query Performance

        Queries needs to be tuned as “usual”:
             Slow query / general log
             From a monitoring system (like ClusterControl)
             + a methodology




     Copyright Severalnines 2012
Query Performance
 Slow query log
    set global slow_query_log=1;
    set global long_query_time=0.01;
    set global log_queries_not_using_indexes=1;

 General log (if you don’t get enough info in the Slow Query Log)
    Activate for a very short period of time (30-60seconds) – intrusive
    Can fill up disk very fast – make sure you turn it off.
    set global general_log=1;

 Use Severalnines ClusterControl
    Includes a Cluster-wide Query Monitor.
    Query frequency, EXPLAINs, lock time etc.
    Performance Monitor and Manager.

  Copyright Severalnines 2012
Data Types
 BLOB/TEXT columns are stored in an external hidden table.
    First 255B are stored inline in main table
    Reading a BLOB/TEXT requires two reads
    One for reading the Main table + reading from hidden table

 Change to VARBINARY/VARCHAR if:
    Your BLOB/TEXTs can fit within an 8052B record
    (record size is currently 8052 Bytes)
    Reading/writing VARCHAR/VARBINARY is less expensive

Note 1: BLOB/TEXT are also more expensive in Innodb as BLOB/TEXT data is not
   inlined with the table. Thus, two disk seeks are needed to read a BLOB.

Note 2: Store images, movies etc outside the database on the filesystem.
  Copyright Severalnines 2012
Data Types
 Example
CREATE TABLE `t1_blob` (

`id` int(11) NOT NULL AUTO_INCREMENT,

`data1` blob,

`data2` blob,

PRIMARY KEY (`id`)

)ENGINE=ndbcluster

 Performance (8 threads, one mysqld, two data nodes):
   data1 and data2 as BLOBs: 5844TPS
   data1 and data2 as VARBINARY: 19206TPS
 ~3x

   Copyright Severalnines 2012
Denormalize

   Tables sharing the same PRIMARY KEY can be denormalized.
        Table T1: <UID, SOME_DATA>
        Table T2: <UID, SOME_OTHER_DATA
        SELECT * from T1,T2 WHERE T1.UID=T2.UID and T2.UID=1 requires
         two roundtrips.
        Starting with MySQL Cluster 7.2 only one roundtrip is needed,.

   Denormalize
        Table T12: <UID,SOME_DATA, SOME_OTHER_DATA>

   Improvement: 2X in throughput


Copyright Severalnines 2012
Query Tuning < 7.2
 Don't trust the OPTIMIZER in MySQL Cluster 7.1 and earlier
    Statistics gathering is non-existing
    Optimizer thinks there are only 10 rows to examine in each table!

 You have to do a lot of
    FORCE INDEX / STRAIGH_JOIN to get queries run the way you
     want.




 Copyright Severalnines 2012
Query Tuning < 7.2
 Classic example: if you have two similar indexes:
    index(a)
    index(a,ts)

on the following table
        CREATE TABLE `t1` (
     `id` int(11) NOT NULL AUTO_INCREMENT,
     `a` bigint(20) DEFAULT NULL,
     `ts` timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP,
     PRIMARY KEY (`id`),
     KEY `idx_t1_a` (`a`),
     KEY `idx_t1_a_ts` (`a`,`ts`)) ENGINE=ndbcluster DEFAULT CHARSET=latin1




  Copyright Severalnines 2012
Query Tuning < 7.2
mysql> explain select * from t1 where a=2 and ts='2011-10-05 15:32:11';

     +----+-------------+-------+------+----------------------+----------+---------+-------+------+-------------+
    | id | select_type | table | type | possible_keys             | key      | key_len | ref | rows | Extra       |
    +----+-------------+-------+------+----------------------+----------+---------+-------+------+-------------+
    | 1 | SIMPLE         | t1 | ref | idx_t1_a,idx_t1_a_ts | idx_t1_a | 9                | const | 10 | Using where |
    +----+-------------+-------+------+----------------------+----------+---------+-------+------+-------------+

 Use FORCE INDEX(..) ...
mysql> explain select * from t1 FORCE INDEX (idx_t1_a_ts) where a=2 and ts='2011-10-05
   15:32:11;

+| 1 | SIMPLE       | t1   | ref | idx_t1_a_ts | idx_t1_a_ts | 13           | const,const | 10 | Using where |

1 row in set (0.00 sec)

     ..to ensure the correct index is picked!

 The difference can be 1 record read instead of any number of
  records!


   Copyright Severalnines 2012
26




       Query Tuning in 7.2

        ANALYZE TABLE
             Must be performed periodically to rebuild index stats
        EXPLAIN EXTENDED/PARTITIONS
             Make sure the explain show “Child of JOIN pushed down”
             This means that the Fast JOIN of NDB could be used
             SHOW WARNINGS;
                    ●   Shows why a Query was not pushed down.




     Copyright Severalnines 2012
Ndb_cluster_connection_pool
 Problem:
   A Sendbuffer on the connection between mysqld and the data nodes is protected
    by a Mutex.
   Connection threads in MySQL must acquire Mutex and the put data in SendBuffer.
   Many threads gives more contention on the mutex
   Must scale out with many MySQL Servers.

 Workaround:
   Ndb_cluster_connection_pool (in my.cnf) creates more connections from one
    mysqld to the data nodes
   Threads load balance on the connections gives less contention on mutex which in
    turn gives increased scalabilty
   Less MySQL Servers needed to drive load!
   www.severalnines.com/cluster-configurator allows you to specify the connection
    pool.
   >70 % improvement.

 Copyright Severalnines 2012
Ndb_cluster_connection_pool
 Gives atleast 70% better performance and a MySQL Server that
  can scale beyond four database connections.

 Set Ndb_cluster_connection_pool=2x<CPU cores>
    It is a good starting point
    One free [mysqld] slot is required in config.ini for each
     Ndb_cluster_connection.
 4 mysql servers,each with Ndb_cluster_connection_pool=8 requires 32
  [mysqld] in config.ini




  Copyright Severalnines 2012
Disk Data Tuning
 Disk Data Tables
    Un-indexed columns → tablespace on disk
    Indexed columns → DataMemory
 DiskPageBufferMemory (DPBM) – LRU page cache
    Like innodb_buffer_pool
    Should be big as possible
 If data not in DPBM                                      DiskPage
    Go to TS and fetch (Slow) IndexMemory DataMemory       Buffer
                                                           Memory
 If data is DPBM
                                               REDO LOG
    Return page (faster)                                 UNDO LOG
                                                      Tablespace

  Copyright Severalnines 2012
Disk Data Tuning
 DiskPageBufferMemory
     –   Hit ratio (derived from ndbinfo.diskpagebuffer):
     –   1000*page_requests_direct_return/
         (page_requests_direct_return +
          page_requests_wait_io+
          page_requests_wait_queue)
     –   998 is good (like in innodb).
     –   DiskPageBufferMemory=2048M is a good start
                                                                 DiskPage
                                   IndexMemory DataMemory         Buffer
                                                                 Memory

                                                   REDO LOG
                                                                UNDO LOG
                                                            Tablespace

 Copyright Severalnines 2012
Disk Data Tuning
 UNDO LOG
     –   Always overseen but can be extended overtime
     –   Set it to 50% of the REDO log size :
           ●   0.5 x NoOfFragmentLogFiles x FragmentLogFileSize
 Undo buffer (specd in CREATE LOGFILE GROUP)
     –   32M to 64M (like the RedoBuffer)
     –   SharedGlobalMemory=512M
                                                                  DiskPage
                                  IndexMemory DataMemory           Buffer
                                                                  Memory

                                                REDO LOG
                                                                  UNDO LOG
                                                          Tablespace

 Copyright Severalnines 2012
More on Cluster
 Severalnines Forum
      –   http://support.severalnines.com/forums/20323398-mysql-cluster

 Johan Andersson @ blogspot
      –   http://johanandersson.blogspot.com
 Configuration and Deployment
      –   http://www.severalnines.com/cluster-configurator
      –   ~20 min to deploy a 4 node cluster (288 seconds is the
          World Record)
 Self-training
      –   http://severalnines.com/mysql-cluster-training



  Copyright Severalnines 2012
33




       Q&A


     Copyright Severalnines 2012
34




       Thank you for your time!

            johan@severalnines.com
            Twitter: @severalnines


     Copyright Severalnines 2012

More Related Content

What's hot

Introduction to Redis
Introduction to RedisIntroduction to Redis
Introduction to Redis
Dvir Volk
 
MapReduce Paradigm
MapReduce ParadigmMapReduce Paradigm
MapReduce Paradigm
Dilip Reddy
 
Hadoop ecosystem
Hadoop ecosystemHadoop ecosystem
Hadoop ecosystem
Mohamed Ali Mahmoud khouder
 
Ceph Intro and Architectural Overview by Ross Turk
Ceph Intro and Architectural Overview by Ross TurkCeph Intro and Architectural Overview by Ross Turk
Ceph Intro and Architectural Overview by Ross Turk
buildacloud
 

What's hot (20)

Ozone: scaling HDFS to trillions of objects
Ozone: scaling HDFS to trillions of objectsOzone: scaling HDFS to trillions of objects
Ozone: scaling HDFS to trillions of objects
 
PostgreSQL worst practices, version FOSDEM PGDay 2017 by Ilya Kosmodemiansky
PostgreSQL worst practices, version FOSDEM PGDay 2017 by Ilya KosmodemianskyPostgreSQL worst practices, version FOSDEM PGDay 2017 by Ilya Kosmodemiansky
PostgreSQL worst practices, version FOSDEM PGDay 2017 by Ilya Kosmodemiansky
 
Hadoop And Their Ecosystem
 Hadoop And Their Ecosystem Hadoop And Their Ecosystem
Hadoop And Their Ecosystem
 
Introduction to Big Data & Hadoop Architecture - Module 1
Introduction to Big Data & Hadoop Architecture - Module 1Introduction to Big Data & Hadoop Architecture - Module 1
Introduction to Big Data & Hadoop Architecture - Module 1
 
Strongly Consistent Global Indexes for Apache Phoenix
Strongly Consistent Global Indexes for Apache PhoenixStrongly Consistent Global Indexes for Apache Phoenix
Strongly Consistent Global Indexes for Apache Phoenix
 
HBase.pptx
HBase.pptxHBase.pptx
HBase.pptx
 
MySQL Database Architectures - High Availability and Disaster Recovery Solution
MySQL Database Architectures - High Availability and Disaster Recovery SolutionMySQL Database Architectures - High Availability and Disaster Recovery Solution
MySQL Database Architectures - High Availability and Disaster Recovery Solution
 
Introduction to Redis
Introduction to RedisIntroduction to Redis
Introduction to Redis
 
Ceph - A distributed storage system
Ceph - A distributed storage systemCeph - A distributed storage system
Ceph - A distributed storage system
 
MapReduce Paradigm
MapReduce ParadigmMapReduce Paradigm
MapReduce Paradigm
 
Hadoop ecosystem
Hadoop ecosystemHadoop ecosystem
Hadoop ecosystem
 
Introduction to apache zoo keeper
Introduction to apache zoo keeper Introduction to apache zoo keeper
Introduction to apache zoo keeper
 
Hadoop
HadoopHadoop
Hadoop
 
Understanding oracle rac internals part 1 - slides
Understanding oracle rac internals   part 1 - slidesUnderstanding oracle rac internals   part 1 - slides
Understanding oracle rac internals part 1 - slides
 
HBase for Architects
HBase for ArchitectsHBase for Architects
HBase for Architects
 
Large Scale Graph Analytics with JanusGraph
Large Scale Graph Analytics with JanusGraphLarge Scale Graph Analytics with JanusGraph
Large Scale Graph Analytics with JanusGraph
 
Ceph Introduction 2017
Ceph Introduction 2017  Ceph Introduction 2017
Ceph Introduction 2017
 
Ceph Intro and Architectural Overview by Ross Turk
Ceph Intro and Architectural Overview by Ross TurkCeph Intro and Architectural Overview by Ross Turk
Ceph Intro and Architectural Overview by Ross Turk
 
Raid
RaidRaid
Raid
 
Block Storage For VMs With Ceph
Block Storage For VMs With CephBlock Storage For VMs With Ceph
Block Storage For VMs With Ceph
 

Viewers also liked

Ramp-Tutorial for MYSQL Cluster - Scaling with Continuous Availability
Ramp-Tutorial for MYSQL Cluster - Scaling with Continuous AvailabilityRamp-Tutorial for MYSQL Cluster - Scaling with Continuous Availability
Ramp-Tutorial for MYSQL Cluster - Scaling with Continuous Availability
Pythian
 
Structural Design pattern - Adapter
Structural Design pattern - AdapterStructural Design pattern - Adapter
Structural Design pattern - Adapter
Manoj Kumar
 
MySQL Query And Index Tuning
MySQL Query And Index TuningMySQL Query And Index Tuning
MySQL Query And Index Tuning
Manikanda kumar
 

Viewers also liked (20)

MySQL Cluster performance best practices
MySQL Cluster performance best practicesMySQL Cluster performance best practices
MySQL Cluster performance best practices
 
What's new in MySQL Cluster 7.4 webinar charts
What's new in MySQL Cluster 7.4 webinar chartsWhat's new in MySQL Cluster 7.4 webinar charts
What's new in MySQL Cluster 7.4 webinar charts
 
MySQL Cluster Basics
MySQL Cluster BasicsMySQL Cluster Basics
MySQL Cluster Basics
 
MySQL High Availability Solutions - Feb 2015 webinar
MySQL High Availability Solutions - Feb 2015 webinarMySQL High Availability Solutions - Feb 2015 webinar
MySQL High Availability Solutions - Feb 2015 webinar
 
Severalnines Self-Training: MySQL® Cluster - Part VII
Severalnines Self-Training: MySQL® Cluster - Part VIISeveralnines Self-Training: MySQL® Cluster - Part VII
Severalnines Self-Training: MySQL® Cluster - Part VII
 
MySQL Tech Tour 2015 - Progettare, installare e configurare MySQL Cluster
MySQL Tech Tour 2015 - Progettare, installare e configurare MySQL ClusterMySQL Tech Tour 2015 - Progettare, installare e configurare MySQL Cluster
MySQL Tech Tour 2015 - Progettare, installare e configurare MySQL Cluster
 
MySQL user camp march 11th 2016
MySQL user camp march 11th 2016MySQL user camp march 11th 2016
MySQL user camp march 11th 2016
 
MySQL Developer Day conference: MySQL Replication and Scalability
MySQL Developer Day conference: MySQL Replication and ScalabilityMySQL Developer Day conference: MySQL Replication and Scalability
MySQL Developer Day conference: MySQL Replication and Scalability
 
FOSSASIA 2015: MySQL Group Replication
FOSSASIA 2015: MySQL Group ReplicationFOSSASIA 2015: MySQL Group Replication
FOSSASIA 2015: MySQL Group Replication
 
MySQL InnoDB Cluster and NDB Cluster
MySQL InnoDB Cluster and NDB ClusterMySQL InnoDB Cluster and NDB Cluster
MySQL InnoDB Cluster and NDB Cluster
 
FOSDEM 2015 - NoSQL and SQL the best of both worlds
FOSDEM 2015 - NoSQL and SQL the best of both worldsFOSDEM 2015 - NoSQL and SQL the best of both worlds
FOSDEM 2015 - NoSQL and SQL the best of both worlds
 
Ramp-Tutorial for MYSQL Cluster - Scaling with Continuous Availability
Ramp-Tutorial for MYSQL Cluster - Scaling with Continuous AvailabilityRamp-Tutorial for MYSQL Cluster - Scaling with Continuous Availability
Ramp-Tutorial for MYSQL Cluster - Scaling with Continuous Availability
 
Galera Cluster for MySQL vs MySQL (NDB) Cluster: A High Level Comparison
Galera Cluster for MySQL vs MySQL (NDB) Cluster: A High Level Comparison Galera Cluster for MySQL vs MySQL (NDB) Cluster: A High Level Comparison
Galera Cluster for MySQL vs MySQL (NDB) Cluster: A High Level Comparison
 
MySQL User Camp: GTIDs
MySQL User Camp: GTIDsMySQL User Camp: GTIDs
MySQL User Camp: GTIDs
 
MySQL User Camp: MySQL Cluster
MySQL User Camp: MySQL ClusterMySQL User Camp: MySQL Cluster
MySQL User Camp: MySQL Cluster
 
Open source India - MySQL Labs: Multi-Source Replication
Open source India - MySQL Labs: Multi-Source ReplicationOpen source India - MySQL Labs: Multi-Source Replication
Open source India - MySQL Labs: Multi-Source Replication
 
Structural Design pattern - Adapter
Structural Design pattern - AdapterStructural Design pattern - Adapter
Structural Design pattern - Adapter
 
MySQL High Availability with Group Replication
MySQL High Availability with Group ReplicationMySQL High Availability with Group Replication
MySQL High Availability with Group Replication
 
MySQL Query And Index Tuning
MySQL Query And Index TuningMySQL Query And Index Tuning
MySQL Query And Index Tuning
 
MySQL 5.7 + JSON
MySQL 5.7 + JSONMySQL 5.7 + JSON
MySQL 5.7 + JSON
 

Similar to Conference slides: MySQL Cluster Performance Tuning

MySQL 5.6 - Operations and Diagnostics Improvements
MySQL 5.6 - Operations and Diagnostics ImprovementsMySQL 5.6 - Operations and Diagnostics Improvements
MySQL 5.6 - Operations and Diagnostics Improvements
Morgan Tocker
 
Mysql User Camp : 20th June - Mysql New Features
Mysql User Camp : 20th June - Mysql New FeaturesMysql User Camp : 20th June - Mysql New Features
Mysql User Camp : 20th June - Mysql New Features
Tarique Saleem
 

Similar to Conference slides: MySQL Cluster Performance Tuning (20)

MySQL Cluster 7.3 Performance Tuning - Severalnines Slides
MySQL Cluster 7.3 Performance Tuning - Severalnines SlidesMySQL Cluster 7.3 Performance Tuning - Severalnines Slides
MySQL Cluster 7.3 Performance Tuning - Severalnines Slides
 
RDS for MySQL, No BS Operations and Patterns
RDS for MySQL, No BS Operations and PatternsRDS for MySQL, No BS Operations and Patterns
RDS for MySQL, No BS Operations and Patterns
 
20141011 my sql clusterv01pptx
20141011 my sql clusterv01pptx20141011 my sql clusterv01pptx
20141011 my sql clusterv01pptx
 
MySQL Cluster Performance Tuning - 2013 MySQL User Conference
MySQL Cluster Performance Tuning - 2013 MySQL User ConferenceMySQL Cluster Performance Tuning - 2013 MySQL User Conference
MySQL Cluster Performance Tuning - 2013 MySQL User Conference
 
Vote NO for MySQL
Vote NO for MySQLVote NO for MySQL
Vote NO for MySQL
 
NoSQL and MySQL
NoSQL and MySQLNoSQL and MySQL
NoSQL and MySQL
 
Apache Ignite: In-Memory Hammer for Your Data Science Toolkit
Apache Ignite: In-Memory Hammer for Your Data Science ToolkitApache Ignite: In-Memory Hammer for Your Data Science Toolkit
Apache Ignite: In-Memory Hammer for Your Data Science Toolkit
 
MySQL no Paypal Tesla e Uber
MySQL no Paypal Tesla e UberMySQL no Paypal Tesla e Uber
MySQL no Paypal Tesla e Uber
 
Intro to Azure SQL database
Intro to Azure SQL databaseIntro to Azure SQL database
Intro to Azure SQL database
 
MySQL HA
MySQL HAMySQL HA
MySQL HA
 
Solving performance problems in MySQL without denormalization
Solving performance problems in MySQL without denormalizationSolving performance problems in MySQL without denormalization
Solving performance problems in MySQL without denormalization
 
Akiban Technologies: Renormalize
Akiban Technologies: RenormalizeAkiban Technologies: Renormalize
Akiban Technologies: Renormalize
 
Akiban Technologies: Renormalize
Akiban Technologies: RenormalizeAkiban Technologies: Renormalize
Akiban Technologies: Renormalize
 
Breakthrough performance with MySQL Cluster (2012)
Breakthrough performance with MySQL Cluster (2012)Breakthrough performance with MySQL Cluster (2012)
Breakthrough performance with MySQL Cluster (2012)
 
Slides: Introducing the new ClusterControl 1.2.10 for MySQL, MongoDB and Post...
Slides: Introducing the new ClusterControl 1.2.10 for MySQL, MongoDB and Post...Slides: Introducing the new ClusterControl 1.2.10 for MySQL, MongoDB and Post...
Slides: Introducing the new ClusterControl 1.2.10 for MySQL, MongoDB and Post...
 
Building a high-performance data lake analytics engine at Alibaba Cloud with ...
Building a high-performance data lake analytics engine at Alibaba Cloud with ...Building a high-performance data lake analytics engine at Alibaba Cloud with ...
Building a high-performance data lake analytics engine at Alibaba Cloud with ...
 
Nagios Conference 2012 - Dan Wittenberg - Case Study: Scaling Nagios Core at ...
Nagios Conference 2012 - Dan Wittenberg - Case Study: Scaling Nagios Core at ...Nagios Conference 2012 - Dan Wittenberg - Case Study: Scaling Nagios Core at ...
Nagios Conference 2012 - Dan Wittenberg - Case Study: Scaling Nagios Core at ...
 
MySQL 5.6 - Operations and Diagnostics Improvements
MySQL 5.6 - Operations and Diagnostics ImprovementsMySQL 5.6 - Operations and Diagnostics Improvements
MySQL 5.6 - Operations and Diagnostics Improvements
 
Mysql User Camp : 20th June - Mysql New Features
Mysql User Camp : 20th June - Mysql New FeaturesMysql User Camp : 20th June - Mysql New Features
Mysql User Camp : 20th June - Mysql New Features
 
Mysql User Camp : 20-June-14 : Mysql New features and NoSQL Support
 Mysql User Camp : 20-June-14 : Mysql New features and NoSQL Support Mysql User Camp : 20-June-14 : Mysql New features and NoSQL Support
Mysql User Camp : 20-June-14 : Mysql New features and NoSQL Support
 

More from Severalnines

Webinar slides: How to Migrate from Oracle DB to MariaDB
Webinar slides: How to Migrate from Oracle DB to MariaDBWebinar slides: How to Migrate from Oracle DB to MariaDB
Webinar slides: How to Migrate from Oracle DB to MariaDB
Severalnines
 
Webinar slides: How to Automate & Manage PostgreSQL with ClusterControl
Webinar slides: How to Automate & Manage PostgreSQL with ClusterControlWebinar slides: How to Automate & Manage PostgreSQL with ClusterControl
Webinar slides: How to Automate & Manage PostgreSQL with ClusterControl
Severalnines
 
Webinar slides: How to Manage Replication Failover Processes for MySQL, Maria...
Webinar slides: How to Manage Replication Failover Processes for MySQL, Maria...Webinar slides: How to Manage Replication Failover Processes for MySQL, Maria...
Webinar slides: How to Manage Replication Failover Processes for MySQL, Maria...
Severalnines
 
Webinar slides: Backup Management for MySQL, MariaDB, PostgreSQL & MongoDB wi...
Webinar slides: Backup Management for MySQL, MariaDB, PostgreSQL & MongoDB wi...Webinar slides: Backup Management for MySQL, MariaDB, PostgreSQL & MongoDB wi...
Webinar slides: Backup Management for MySQL, MariaDB, PostgreSQL & MongoDB wi...
Severalnines
 
Webinar slides: Free Monitoring (on Steroids) for MySQL, MariaDB, PostgreSQL ...
Webinar slides: Free Monitoring (on Steroids) for MySQL, MariaDB, PostgreSQL ...Webinar slides: Free Monitoring (on Steroids) for MySQL, MariaDB, PostgreSQL ...
Webinar slides: Free Monitoring (on Steroids) for MySQL, MariaDB, PostgreSQL ...
Severalnines
 
Webinar slides: An Introduction to Performance Monitoring for PostgreSQL
Webinar slides: An Introduction to Performance Monitoring for PostgreSQLWebinar slides: An Introduction to Performance Monitoring for PostgreSQL
Webinar slides: An Introduction to Performance Monitoring for PostgreSQL
Severalnines
 
Webinar slides: Migrating to Galera Cluster for MySQL and MariaDB
Webinar slides: Migrating to Galera Cluster for MySQL and MariaDBWebinar slides: Migrating to Galera Cluster for MySQL and MariaDB
Webinar slides: Migrating to Galera Cluster for MySQL and MariaDB
Severalnines
 
Webinar slides: How to Measure Database Availability?
Webinar slides: How to Measure Database Availability?Webinar slides: How to Measure Database Availability?
Webinar slides: How to Measure Database Availability?
Severalnines
 
Webinar slides: Designing Open Source Databases for High Availability
Webinar slides: Designing Open Source Databases for High AvailabilityWebinar slides: Designing Open Source Databases for High Availability
Webinar slides: Designing Open Source Databases for High Availability
Severalnines
 

More from Severalnines (20)

Cloud's future runs through Sovereign DBaaS
Cloud's future runs through Sovereign DBaaSCloud's future runs through Sovereign DBaaS
Cloud's future runs through Sovereign DBaaS
 
Tips to drive maria db cluster performance for nextcloud
Tips to drive maria db cluster performance for nextcloudTips to drive maria db cluster performance for nextcloud
Tips to drive maria db cluster performance for nextcloud
 
Working with the Moodle Database: The Basics
Working with the Moodle Database: The BasicsWorking with the Moodle Database: The Basics
Working with the Moodle Database: The Basics
 
SysAdmin Working from Home? Tips to Automate MySQL, MariaDB, Postgres & MongoDB
SysAdmin Working from Home? Tips to Automate MySQL, MariaDB, Postgres & MongoDBSysAdmin Working from Home? Tips to Automate MySQL, MariaDB, Postgres & MongoDB
SysAdmin Working from Home? Tips to Automate MySQL, MariaDB, Postgres & MongoDB
 
(slides) Polyglot persistence: utilizing open source databases as a Swiss poc...
(slides) Polyglot persistence: utilizing open source databases as a Swiss poc...(slides) Polyglot persistence: utilizing open source databases as a Swiss poc...
(slides) Polyglot persistence: utilizing open source databases as a Swiss poc...
 
Webinar slides: How to Migrate from Oracle DB to MariaDB
Webinar slides: How to Migrate from Oracle DB to MariaDBWebinar slides: How to Migrate from Oracle DB to MariaDB
Webinar slides: How to Migrate from Oracle DB to MariaDB
 
Webinar slides: How to Automate & Manage PostgreSQL with ClusterControl
Webinar slides: How to Automate & Manage PostgreSQL with ClusterControlWebinar slides: How to Automate & Manage PostgreSQL with ClusterControl
Webinar slides: How to Automate & Manage PostgreSQL with ClusterControl
 
Webinar slides: How to Manage Replication Failover Processes for MySQL, Maria...
Webinar slides: How to Manage Replication Failover Processes for MySQL, Maria...Webinar slides: How to Manage Replication Failover Processes for MySQL, Maria...
Webinar slides: How to Manage Replication Failover Processes for MySQL, Maria...
 
Webinar slides: Backup Management for MySQL, MariaDB, PostgreSQL & MongoDB wi...
Webinar slides: Backup Management for MySQL, MariaDB, PostgreSQL & MongoDB wi...Webinar slides: Backup Management for MySQL, MariaDB, PostgreSQL & MongoDB wi...
Webinar slides: Backup Management for MySQL, MariaDB, PostgreSQL & MongoDB wi...
 
Disaster Recovery Planning for MySQL & MariaDB
Disaster Recovery Planning for MySQL & MariaDBDisaster Recovery Planning for MySQL & MariaDB
Disaster Recovery Planning for MySQL & MariaDB
 
MariaDB Performance Tuning Crash Course
MariaDB Performance Tuning Crash CourseMariaDB Performance Tuning Crash Course
MariaDB Performance Tuning Crash Course
 
Performance Tuning Cheat Sheet for MongoDB
Performance Tuning Cheat Sheet for MongoDBPerformance Tuning Cheat Sheet for MongoDB
Performance Tuning Cheat Sheet for MongoDB
 
Advanced MySql Data-at-Rest Encryption in Percona Server
Advanced MySql Data-at-Rest Encryption in Percona ServerAdvanced MySql Data-at-Rest Encryption in Percona Server
Advanced MySql Data-at-Rest Encryption in Percona Server
 
Polyglot Persistence Utilizing Open Source Databases as a Swiss Pocket Knife
Polyglot Persistence Utilizing Open Source Databases as a Swiss Pocket KnifePolyglot Persistence Utilizing Open Source Databases as a Swiss Pocket Knife
Polyglot Persistence Utilizing Open Source Databases as a Swiss Pocket Knife
 
Webinar slides: Free Monitoring (on Steroids) for MySQL, MariaDB, PostgreSQL ...
Webinar slides: Free Monitoring (on Steroids) for MySQL, MariaDB, PostgreSQL ...Webinar slides: Free Monitoring (on Steroids) for MySQL, MariaDB, PostgreSQL ...
Webinar slides: Free Monitoring (on Steroids) for MySQL, MariaDB, PostgreSQL ...
 
Webinar slides: An Introduction to Performance Monitoring for PostgreSQL
Webinar slides: An Introduction to Performance Monitoring for PostgreSQLWebinar slides: An Introduction to Performance Monitoring for PostgreSQL
Webinar slides: An Introduction to Performance Monitoring for PostgreSQL
 
Webinar slides: Our Guide to MySQL & MariaDB Performance Tuning
Webinar slides: Our Guide to MySQL & MariaDB Performance TuningWebinar slides: Our Guide to MySQL & MariaDB Performance Tuning
Webinar slides: Our Guide to MySQL & MariaDB Performance Tuning
 
Webinar slides: Migrating to Galera Cluster for MySQL and MariaDB
Webinar slides: Migrating to Galera Cluster for MySQL and MariaDBWebinar slides: Migrating to Galera Cluster for MySQL and MariaDB
Webinar slides: Migrating to Galera Cluster for MySQL and MariaDB
 
Webinar slides: How to Measure Database Availability?
Webinar slides: How to Measure Database Availability?Webinar slides: How to Measure Database Availability?
Webinar slides: How to Measure Database Availability?
 
Webinar slides: Designing Open Source Databases for High Availability
Webinar slides: Designing Open Source Databases for High AvailabilityWebinar slides: Designing Open Source Databases for High Availability
Webinar slides: Designing Open Source Databases for High Availability
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 

Conference slides: MySQL Cluster Performance Tuning

  • 1. Performance Tuning of MySQL Cluster Santa Clara, April 2012 Johan Andersson Severalnines AB johan@severalnines.com Cell +46 73 073 60 99 Copyright Severalnines 2012
  • 2. 2 Agenda  Scaling and Partitioning  Designing a Scalable System  Insert Performance Tuning  Query Tuning  Random tricks  Disk Data Tuning Copyright Severalnines 2012
  • 3. 3 Here is ... Access Layer App App Server Server MYSQL MYSQL STORAGE LAYER DATA DATA NODE NODE P0 P1 Node group 0 Copyright Severalnines 2012
  • 4. 4 It can scale linearly ... Access Layer App App App App App App App App Server Server Server Server Server Server Server Server MYSQL MYSQL MYSQL MYSQL MYSQL NDBAPI NDBAPI NDBAPI STORAGE LAYER STORAGE LAYER STORAGE LAYER DATA DATA DATA DATA DATA DATA NODE NODE NODE NODE NODE NODE P0 P1 P2 P3 P4 P5 Node group 0 Node group 1 Node group 2 Copyright Severalnines 2012
  • 5. if you find the bottlenecks  A lot of CPU is used on the data nodes  Probably a lot of large index scans and full table scans are used.  A lot of CPU is used on the mysql servers  Probably a lot of GROUP BY/DISTINCT or aggregate functions.  Hardly no CPU is used on either mysql or data nodes  Probably low load  Time is spent on network (a lot of “ping pong” to satisfy a request).  System is running slow in general  Disks (io util), queries, swap (should never happen) Copyright Severalnines 2012
  • 6. and if you know how  Adding mysqlds – trivial – if the mysqld is the bottleneck  BUT! Adding data nodes  More data nodes does not automatically give better performance ● Latency may increase for a single query ● Total throughout will be improved  How to get both? Copyright Severalnines 2012
  • 7. 7 Designing a Scalable System  Define the most typical Use Cases  List all my friends, session management etc etc.  Optimize everything for the typical use case  Keep it simple  Complex access patterns does not scale  Simple access patterns do ( Primay key and Partitioned Index Scans )  Note! There is no parameter in config.ini that affects performance – only availability.  Everything is about the Schema and the Queries.  Tune the mysql servers (sort buffers etc) as you would for innodb. Copyright Severalnines 2012
  • 8. 8 Simple Access  PRIMARY KEY lookups are HASH lookup O(1)  INDEX searches a T-tree and takes O(log n) time.  In 7.2 JOINs are ok, but in 7.1 you should try to avoid them. Copyright Severalnines 2012
  • 9. 9 Data Access Access Layer App App App App App App App App Server Server Server Server Server Server Server Server MYSQL MYSQL MYSQL MYSQL MYSQL NDBAPI NDBAPI NDBAPI STORAGE LAYER DATA DATA DATA DATA DATA DATA NODE NODE NODE NODE NODE NODE P0 P1 P2 P3 P4 P5 Node group 0 Node group 1 Node group 2 Copyright Severalnines 2012
  • 10. 10 Data Access  One Request can hit all Partitions  Sub-optimal and won't scale Copyright Severalnines 2012
  • 11. 11 Data Access Access Layer App App App App App App App App Server Server Server Server Server Server Server Server MYSQL MYSQL MYSQL MYSQL MYSQL NDBAPI NDBAPI NDBAPI STORAGE LAYER DATA DATA DATA DATA DATA DATA NODE NODE NODE NODE NODE NODE P0 P1 P2 P3 P4 P5 Node group 0 Node group 1 Node group 2 Copyright Severalnines 2012
  • 12. 12 Data Access  One Request hits one partition  Scales!  The number of Partitions (data nodes) does not matter!  Partitioning is important! Copyright Severalnines 2012
  • 13. 13 Partitioning  MySQL Cluster auto-partitions based on the Primary Key  Data is spread randomly  If possible better to Partition on a part of the Primary Key  CREATE TABLE user_friends( userid, friendid , somedata, PRIMARY KEY (userid, friendid)) PARTITION BY KEY(userid)  All records with userid=X will be stored in the same partition!  Ultra important for MySQL Cluster 7.2 and Fast JOINs. Copyright Severalnines 2012
  • 14. 14 Partitioning  EXPLAIN PARTITIONS <query>  Tells you what partitions you touch.  Also verify with: mysql> show global status like 'ndb_pruned_scan_count’; +-----------------------+-------+ | Variable_name | Value | +-----------------------+-------+ | Ndb_pruned_scan_count | 1 | +-----------------------+-------+  Increases when Partition Pruning could be used. Copyright Severalnines 2012
  • 15. 15 Insert Performance  Scaling Inserts  Option 1) Batch INSERTS if you can ● An insert batch of 10 records will perform 10x faster than 10 single inserts! ● INSERT INTO t1 VALUES (<record1>), (<record2>), …,(<recordN>)  Option 2) Many threads (parallelism)  Or a combo of both  Dumpfiles or LOAD DATA INFILEs  Chunk them up and load in parallel on several mysqlds Copyright Severalnines 2012
  • 16. 16 Insert Performance  INSERTs in a table with AUTO_INCREMENT  MySQL Server query Data nodes for an auto_increment – The mysqld can hold a range of autoincs (cache) – Before an INSERT, and autoinc must be fetched from either Data node (slow) or on the cache (fast)  ndb_autoincremet_prefetch_sz sets the cache size and it affects insert perf: – ndb_autoincrement_prefetch_sz=1: 1211.91TPS – ndb_autoincrement_prefetch_sz=256: 3471.71TPS – ndb_autoincrement_prefetch_sz=1024: 3659.52TPS Copyright Severalnines 2012
  • 17. 17 Insert Performance  ndb_batch_size can also be important with LOAD DATA INFILE or dumps  SET GLOBAL|SESSION NDB_BATCH_SIZE = 16M  You may get LongMessageBuffer Overload ● Increase it in config.ini to 32M or 48M  Also REDO logs may get overloaded if your disks are too slow and/or the REDO is too small. Copyright Severalnines 2012
  • 18. 18 Query Performance  Queries needs to be tuned as “usual”:  Slow query / general log  From a monitoring system (like ClusterControl)  + a methodology Copyright Severalnines 2012
  • 19. Query Performance  Slow query log  set global slow_query_log=1;  set global long_query_time=0.01;  set global log_queries_not_using_indexes=1;  General log (if you don’t get enough info in the Slow Query Log)  Activate for a very short period of time (30-60seconds) – intrusive  Can fill up disk very fast – make sure you turn it off.  set global general_log=1;  Use Severalnines ClusterControl  Includes a Cluster-wide Query Monitor.  Query frequency, EXPLAINs, lock time etc.  Performance Monitor and Manager. Copyright Severalnines 2012
  • 20. Data Types  BLOB/TEXT columns are stored in an external hidden table.  First 255B are stored inline in main table  Reading a BLOB/TEXT requires two reads  One for reading the Main table + reading from hidden table  Change to VARBINARY/VARCHAR if:  Your BLOB/TEXTs can fit within an 8052B record  (record size is currently 8052 Bytes)  Reading/writing VARCHAR/VARBINARY is less expensive Note 1: BLOB/TEXT are also more expensive in Innodb as BLOB/TEXT data is not inlined with the table. Thus, two disk seeks are needed to read a BLOB. Note 2: Store images, movies etc outside the database on the filesystem. Copyright Severalnines 2012
  • 21. Data Types  Example CREATE TABLE `t1_blob` ( `id` int(11) NOT NULL AUTO_INCREMENT, `data1` blob, `data2` blob, PRIMARY KEY (`id`) )ENGINE=ndbcluster  Performance (8 threads, one mysqld, two data nodes):  data1 and data2 as BLOBs: 5844TPS  data1 and data2 as VARBINARY: 19206TPS  ~3x Copyright Severalnines 2012
  • 22. Denormalize  Tables sharing the same PRIMARY KEY can be denormalized.  Table T1: <UID, SOME_DATA>  Table T2: <UID, SOME_OTHER_DATA  SELECT * from T1,T2 WHERE T1.UID=T2.UID and T2.UID=1 requires two roundtrips.  Starting with MySQL Cluster 7.2 only one roundtrip is needed,.  Denormalize  Table T12: <UID,SOME_DATA, SOME_OTHER_DATA>  Improvement: 2X in throughput Copyright Severalnines 2012
  • 23. Query Tuning < 7.2  Don't trust the OPTIMIZER in MySQL Cluster 7.1 and earlier  Statistics gathering is non-existing  Optimizer thinks there are only 10 rows to examine in each table!  You have to do a lot of  FORCE INDEX / STRAIGH_JOIN to get queries run the way you want. Copyright Severalnines 2012
  • 24. Query Tuning < 7.2  Classic example: if you have two similar indexes:  index(a)  index(a,ts) on the following table CREATE TABLE `t1` ( `id` int(11) NOT NULL AUTO_INCREMENT, `a` bigint(20) DEFAULT NULL, `ts` timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP, PRIMARY KEY (`id`), KEY `idx_t1_a` (`a`), KEY `idx_t1_a_ts` (`a`,`ts`)) ENGINE=ndbcluster DEFAULT CHARSET=latin1 Copyright Severalnines 2012
  • 25. Query Tuning < 7.2 mysql> explain select * from t1 where a=2 and ts='2011-10-05 15:32:11'; +----+-------------+-------+------+----------------------+----------+---------+-------+------+-------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +----+-------------+-------+------+----------------------+----------+---------+-------+------+-------------+ | 1 | SIMPLE | t1 | ref | idx_t1_a,idx_t1_a_ts | idx_t1_a | 9 | const | 10 | Using where | +----+-------------+-------+------+----------------------+----------+---------+-------+------+-------------+  Use FORCE INDEX(..) ... mysql> explain select * from t1 FORCE INDEX (idx_t1_a_ts) where a=2 and ts='2011-10-05 15:32:11; +| 1 | SIMPLE | t1 | ref | idx_t1_a_ts | idx_t1_a_ts | 13 | const,const | 10 | Using where | 1 row in set (0.00 sec)  ..to ensure the correct index is picked!  The difference can be 1 record read instead of any number of records! Copyright Severalnines 2012
  • 26. 26 Query Tuning in 7.2  ANALYZE TABLE  Must be performed periodically to rebuild index stats  EXPLAIN EXTENDED/PARTITIONS  Make sure the explain show “Child of JOIN pushed down”  This means that the Fast JOIN of NDB could be used  SHOW WARNINGS; ● Shows why a Query was not pushed down. Copyright Severalnines 2012
  • 27. Ndb_cluster_connection_pool  Problem:  A Sendbuffer on the connection between mysqld and the data nodes is protected by a Mutex.  Connection threads in MySQL must acquire Mutex and the put data in SendBuffer.  Many threads gives more contention on the mutex  Must scale out with many MySQL Servers.  Workaround:  Ndb_cluster_connection_pool (in my.cnf) creates more connections from one mysqld to the data nodes  Threads load balance on the connections gives less contention on mutex which in turn gives increased scalabilty  Less MySQL Servers needed to drive load!  www.severalnines.com/cluster-configurator allows you to specify the connection pool.  >70 % improvement. Copyright Severalnines 2012
  • 28. Ndb_cluster_connection_pool  Gives atleast 70% better performance and a MySQL Server that can scale beyond four database connections.  Set Ndb_cluster_connection_pool=2x<CPU cores>  It is a good starting point  One free [mysqld] slot is required in config.ini for each Ndb_cluster_connection.  4 mysql servers,each with Ndb_cluster_connection_pool=8 requires 32 [mysqld] in config.ini Copyright Severalnines 2012
  • 29. Disk Data Tuning  Disk Data Tables  Un-indexed columns → tablespace on disk  Indexed columns → DataMemory  DiskPageBufferMemory (DPBM) – LRU page cache  Like innodb_buffer_pool  Should be big as possible  If data not in DPBM DiskPage  Go to TS and fetch (Slow) IndexMemory DataMemory Buffer Memory  If data is DPBM REDO LOG  Return page (faster) UNDO LOG Tablespace Copyright Severalnines 2012
  • 30. Disk Data Tuning  DiskPageBufferMemory – Hit ratio (derived from ndbinfo.diskpagebuffer): – 1000*page_requests_direct_return/ (page_requests_direct_return + page_requests_wait_io+ page_requests_wait_queue) – 998 is good (like in innodb). – DiskPageBufferMemory=2048M is a good start DiskPage IndexMemory DataMemory Buffer Memory REDO LOG UNDO LOG Tablespace Copyright Severalnines 2012
  • 31. Disk Data Tuning  UNDO LOG – Always overseen but can be extended overtime – Set it to 50% of the REDO log size : ● 0.5 x NoOfFragmentLogFiles x FragmentLogFileSize  Undo buffer (specd in CREATE LOGFILE GROUP) – 32M to 64M (like the RedoBuffer) – SharedGlobalMemory=512M DiskPage IndexMemory DataMemory Buffer Memory REDO LOG UNDO LOG Tablespace Copyright Severalnines 2012
  • 32. More on Cluster  Severalnines Forum – http://support.severalnines.com/forums/20323398-mysql-cluster  Johan Andersson @ blogspot – http://johanandersson.blogspot.com  Configuration and Deployment – http://www.severalnines.com/cluster-configurator – ~20 min to deploy a 4 node cluster (288 seconds is the World Record)  Self-training – http://severalnines.com/mysql-cluster-training Copyright Severalnines 2012
  • 33. 33 Q&A Copyright Severalnines 2012
  • 34. 34 Thank you for your time! johan@severalnines.com Twitter: @severalnines Copyright Severalnines 2012