Scaling Crittercism to 30,000 Requests Per Second and Beyond with MongoDB

Scaling to 30,000 Requests Per Second
and Beyond
with MongoDB
Mike Chesnut
Director of Operations Engineering
Crittercism

● Pick something and go with it
How a Startup Gets Started

● Make mistakes along the way

● Correct the mistakes you can

● Correct the mistakes you can
● Work around the ones you can’t

What I’ll Talk About
● Crittercism - Background and Architecture

● Router (mongos) Architecture

● Sharding Considerations

● The Balancing Act

● The Balancing Act
● Q&A

Critter-What?
A Brief History...

Critter-What?
Our Founders
(Rob, Andrew, Jeeyun)

Critter-What?
Our Founders
Let’s make a mobile app!
It’ll be awesome!

Critter-What?
(Unnamed Dating App)

Critter-What?
Our Founders
Our app isn’t so awesome
after all...

Architecture
APIFeedback
Crashes

Architecture
APIFeedback
App Loads
Crashes

Architecture
APIFeedback
App Loads
Crashes
Handled
Exceptions

Architecture
API
App Loads
Crashes
Handled
Exceptions

Architecture
API
Crashes
Handled
Exceptions
App Loads
batch

Architecture
API
Crashes
Handled
Exceptions
Metadata
App Loads
batch

Architecture
DynamoDB
API
Crashes
Handled
Exceptions
Metadata
App Loads
batch

Architecture
DynamoDB
API
API
Crashes
Handled
Exceptions
Metadata
Performance
Data
Geo Data
App Loads
batch

Architecture
DynamoDB
API
API
Crashes
Handled
Exceptions
Metadata
Performance
Data
Geo Data
40,000 req/s
App Loads
batch

Router Architecture
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
MongoDB Cluster

Router Architecture
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
client
process
application server
client
process
application server
Client Application(s) MongoDB Cluster

Router Architecture
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongos
client
process
application server
mongos
client
process
application server

Router Architecture
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongos
client
process
application server
mongos
client
process
application server
mongod
server
mongod
server
config
server
config servers

Router Architecture
RS
RS
RS
conf
ms
app
ms
app

Router Architecture
RS
RS
RS
conf
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
appms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app

Router Architecture
RS
RS
RS
conf
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
appms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
.
.
.

Single mongos per client problems we encountered:
Router Architecture

Router Architecture
● thousands of connections to config servers

Router Architecture
● config server CPU load

Router Architecture
● config server CPU load
● configdb propagation delays

Router Architecture
RS
RS
RS
conf
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
appms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
.
.
.
We went from this...

Router Architecture
RS
RS
RS
conf
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
appms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app ms
app
.
.
.
.
.
.
To this.

Router Architecture
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongos
client
process
application server
mongos
client
process
application server
Client Application(s) MongoDB ClusterRouter Tier

Router Architecture
Separate mongos tier advantages:

Router Architecture
● greatly reduced number of connections to each mongod

Router Architecture
● far fewer hosts talking to the config servers

Router Architecture
● much faster configdb propagation

Router Architecture
Disadvantages:

Router Architecture
Disadvantages:
● additional network hop

Router Architecture
Disadvantages:
● additional network hop
● host failure has a larger effect

Router Architecture
RS
RS
RS
conf
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
appms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
.
.
.
mongos-per-host failure:

Router Architecture
RS
RS
RS
conf
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
appms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app ms
app
.
.
.
.
.
.
Separate mongos tier failure:

Router Architecture
RS
RS
RS
conf
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
appms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app ms
app
.
.
.
.
.
.
So increase the number of
mongos routers:

Router Architecture
RS
RS
RS
conf
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
appms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
ms
app
.
.
.
.
.
.
ms
ms
So increase the number of
mongos routers:

Router Architecture - Evolve!
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongos
client
process
application server
mongos
client
process
application server
Client Application(s) MongoDB ClusterMaybe at first,
doing the
mongos-per-host
architecture
is fine.

Maybe at first,
doing the
mongos-per-host
architecture
is fine.
And it will probably
remain fine
for quite a while.
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongos
client
process
application server
mongos
client
process
application server

mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongod
server
mongod
server
mongod
server
replica set
mongos
client
process
application server
mongos
client
process
application server
Client Application(s) MongoDB ClusterRouter TierThis is an area
where you can
and should be
willing to adapt
as you go
(and as needed).

Pick something you want to live with.
Sharding Considerations

What could we have done differently?
Sharding Considerations

The Balancing Act
Why wouldn’t you run the balancer in the first place?

The Balancing Act
● great question

The Balancing Act
● great question
● for us, it’s because we deleted some old data at one point, and left
a bunch of holes

The Balancing Act
● great question
a bunch of holes
○ we turned it off while deleting this data

The Balancing Act
● great question
a bunch of holes
○ and then were unable to turn it back on

The Balancing Act
● great question
a bunch of holes
● but maybe you start without it

The Balancing Act
● great question
a bunch of holes
● or maybe you need to turn it off for maintenance and forget to turn
it back on

The Balancing Act
● great question
a bunch of holes
● or maybe you need to turn it off for maintenance and forget to turn
it back on
Obviously, don’t do this. But if you do, here’s what happens...

The Balancing Act
Fresh, new, empty cluster… But no balancer running.

The Balancing Act
Now we’re pretty full, so let’s add another shard...

The Balancing Act
And keep inserting...

The Balancing Act
Suddenly we find ourselves with a very unbalanced cluster.

The Balancing Act
But if we enable the balancer, it will DoS the 5th shard!

The Balancing Act
The approximate effect looks something like this:

So what can we do?
The Balancing Act

So what can we do?
1. add IOPS
The Balancing Act

So what can we do?
1. add IOPS
2. make sure your config servers have plenty of CPU (and IOPS)
The Balancing Act

So what can we do?
1. add IOPS
3. slowly move chunks manually
The Balancing Act

So what can we do?
1. add IOPS
4. approach a balanced state
The Balancing Act

So what can we do?
1. add IOPS
5. hold your breath
The Balancing Act

So what can we do?
1. add IOPS
5. hold your breath
6. try re-enabling the balancer
The Balancing Act

How to manually balance:
1. determine a chunk on a hot shard
2. monitor effects on both the source and target shards
3. move the chunk
4. allow the system to settle
5. repeat
The Balancing Act

Conclusion here:
Run the balancer!
The Balancing Act

● Design ahead of time
o “NoSQL” lets you play it by ear
o but some of these decisions will bite you later
● Be willing to correct past mistakes
o dedicate time and resources to adapting
o learn how to live with the mistakes you can’t correct
Summary

References
● MongoDB Blog post (details on shard
migration):http://blog.mongodb.org/post/77278906988/crittercism-scaling-to-billions-of-
requests-per-day-on
● MongoDB Webinar (details on manual chunk
migrations):http://www.mongodb.com/presentations/webinar-back-basics-3-scaling-30000-requests-
second-mongodb
● Documentation on mongos routers:http://docs.mongodb.org/master/core/sharded-
cluster-query-routing/
● Documentation on the balancer:http://docs.mongodb.org/manual/tutorial/manage-
sharded-cluster-balancer/
● Documentation on shard keys:http://docs.mongodb.org/manual/core/sharding-shard-
key/
Crittercism: http://www.crittercism.com/ to learn more,
and http://www.crittercism.com/careers/ if you want to help us!

Scaling Crittercism to 30,000 Requests Per Second and Beyond with MongoDB

Recommended

Recommended

More Related Content

Similar to Scaling Crittercism to 30,000 Requests Per Second and Beyond with MongoDB

Similar to Scaling Crittercism to 30,000 Requests Per Second and Beyond with MongoDB (20)

More from MongoDB

More from MongoDB (20)

Recently uploaded

Recently uploaded (20)

Scaling Crittercism to 30,000 Requests Per Second and Beyond with MongoDB

Editor's Notes