PyCon US 2012 - Web Server Bottlenecks and Performance Tuning

Web Server
Bottlenecks and
Performance Tuning
Graham Dumpleton
PyCon – March 2012

Graham Dumpleton @GrahamDumpleton

Starting my PyCon talk. Lets hope I don't loose my
voice completely while doing this.

Follow along

✴http://www.slideshare.net/GrahamDumpleton

Web application

Web application Front end time
0.15 seconds 3.1 seconds

Performance Golden Rule

"80-90% of the end-user
response time is spent on the
frontend. Start there."

http://www.stevesouders.com/blog/2012/02/10/
the-performance-golden-rule/

Are benchmarks stupid?

http://nichol.as/benchmark-of-python-web-servers

Benchmarks as a tool

✴Web server benchmarks are of more
value when used as an exploratory tool to
understand how a speciﬁc system
works, not to compare systems.

What about load tests?

✴Hitting a site with extreme load will only
show you that it will likely fail under a
denial of service attack.
✴Your typical web server load test isn't
alone going to help you understand how a
web server is contributing to it failing.

What should you test?

✴You should use a range of purpose built
tests to trigger certain scenarios.
✴Use the tests to explore corner cases and
not just the typical use case.

Environment factors

✴Amount of memory available.
✴Number of processors available.
✴Use of threads vs processes.
✴Python global interpreter lock (GIL)

Client impacts

✴Slow HTTP browsers/clients.
✴Browser keep alive connections.

Application requirements

✴Need to handle static assets.

Use cases to explore

✴Memory used by web application.
✴Using processes versus threads.
✴Impacts of long running requests.
✴Restarting of server processes.
✴Startup costs and lazy loading.

Memory usage

1 process 1 process
1000 threads 1 thread

http://nichol.as/benchmark-of-python-web-servers

What effects memory use?

✴Web server base memory usage.
✴Web server per thread memory usage.
✴Application base memory usage.
✴Is application loaded prior to forking?
✴Per request transient memory usage.

Processes Vs Threads

150

135

120

105

90

75
Processes

60

45

30

15

0
0 15 30 45 60 75 90 105 120 135 150

Threads

Apache/mod_wsgi defaults

Conﬁguration Max Processes Threads

Apache (prefork)
mod_wsgi (embedded) 150 1
Apache (worker)
mod_wsgi (embedded) 6 25
Apache (prefork)
mod_wsgi (daemon) 1 15
Apache (worker)
mod_wsgi (daemon) 1 15

Other WSGI servers

Configuration Max Processes Threads

FASTCGI
flup (prefork) 50 1
FASTCGI
flup (threaded) 1 5
gunicorn 1 1
uWSGI 1 1

tornado 1 1

Less than fair
150

135
Apache (prefork)
120 mod_wsgi (embedded)
105
FASTCGI
90 ﬂup (prefork)
75
Apache (prefork/worker)
Processes

60
mod_wsgi (daemon)
45
Apache (worker)
30
mod_wsgi (embedded)
15

0
0 15 30 45 60 75 90 105 120 135 150

Threads

What to use?

✴Number of overall threads dictated by:
✴Number of concurrent users.
✴Response time for requests.
✴Processes preferred over threads, but:
✴Restricted by amount of memory.
✴Choice inﬂuenced by number of processors.

Thread utilisation
1 second

6

5
Threads

4

3

2

1

Request backlog
Backlog occurred and queue
time increased to 750 ms

150ms

60 requests Thread utilisation jumped from
per second 2.5 to 7.5 and maxed out at 9

Processes are better
Backlog only started
at higher throughput and queue
time mostly under 100ms

100ms

Thread utilisation only jumped from
75 requests
2.5 to 7.5 at higher throughput
per second
and didn't actually reach 9

CPU bound

Bulk of time is from doing
things within the process itself

I/O wait

Waiting on responses from
backend services a signiﬁcant
proportion of time

Long running requests

✴Complex calculations.
✴Slow backend services.
✴Large ﬁle uploads.
✴Large responses.
✴Slow HTTP clients.

Varying request times

Average: 1385 ms
Minimum: 4.7 ms
Maximum: 20184 ms
Std Dev: 3896 ms

Performance breakdown

Why is creating the connection
to PostgreSQL taking up 40%
of overall response time

Slow HTTP clients

✴Add nginx as a front end to the WSGI server.
✴Brings the following benefits to the WSGI server.
✴Isolation from slow clients.
✴No need to handle keep alive in the WSGI server.
✴Can offload serving of static files.
✴Can use X-Accel-Redirect for dynamically
generated files.

Request funnelling

nginx
front end

Apache
workers

mod_wsgi
daemons

Forced restarts
✴Triggers for restarts:
✴Manual restart to ﬁx issues/conﬁguration.
✴Maximum number of requests reached.
✴Reloading of new application code.
✴Individual requests block/timeout.
✴Restarts can make things worse.

Auto scaling
✴Apache/mod_wsgi embedded mode.
✴Apache prefork MPM defaults.
✴Initial 1 / Maximum 150
✴Apache worker MPM defaults.
✴Initial 2 / Maximum 6
✴Auto scaling can make things worse.

Pre load everything

✴Start maximum processes up front.
✴Pre load your web application when the
process starts and not lazily loaded on
the ﬁrst request.
✴Keep processes persistent in memory
and avoid unnecessary restarts.

Horizontal scaling

✴Using more servers is ﬁne.
✴Load balance across dedicated hosts.
✴Or add additional hosts as required.
✴Ensure though that if adding more hosts
that you have preloaded the web
application before directing trafﬁc to it.

Monitoring is key

✴Treat your server
as a black box and
you will never
know what is going
on inside.

Server monitoring

✴Open source tools.
✴Monit
✴Munin
✴Cacti
✴Nagios

Python web tools
✴Django debug toolbar.
✴Only useful for debugging a single request
in a development setting.
✴Sentry.
✴Useful for capturing runtime errors, but
performance issues don't generate
exceptions.

Summing up

✴Use benchmarks to explore a speciﬁc
system, not to compare different systems.
✴Don't trust the defaults of any server, you
need to tune it for your web application.
✴Monitor your live production systems.
✴New Relic for really deep introspection.

Try New Relic

✴ Graham.Dumpleton@gmail.com
✴ http://www.slideshare.net/GrahamDumpleton

✴ Find out more about New Relic:
✴ http://newrelic.com
✴ Extended Pro Trial for PyCon attendees:
✴ http://newrelic.com/30
✴ Come work for New Relic:
✴ http://newrelic.com/about/jobs

PyCon US 2012 - Web Server Bottlenecks and Performance Tuning

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to PyCon US 2012 - Web Server Bottlenecks and Performance Tuning

Similar to PyCon US 2012 - Web Server Bottlenecks and Performance Tuning (20)

More from Graham Dumpleton

More from Graham Dumpleton (11)

Recently uploaded

Recently uploaded (20)

PyCon US 2012 - Web Server Bottlenecks and Performance Tuning

Editor's Notes