SlideShare a Scribd company logo
1 of 91
Drupal’s page loading performance in 2010

Wim Leers ~ http://wimleers.com/

Drupal.org, IRC, Twitter, LinkedIn: wimleers
Bachelor thesis


“Improving Drupal’s page loading performance”


• Promotor: Prof. dr. Wim Lamotte
• Co-promotor: dr. Peter Quax
• Mentors: Stijn Agten & Maarten Wijnants
Goal

• Faster web sites


     • Speed = satisfaction = more & happier visitors = more revenue




Source: http://www.slideshare.net/stubbornella/designing-fast-websites-presentation, Nicole Sullivan, Yahoo!
Goal

• Faster web sites


     • Speed = satisfaction = more & happier visitors = more revenue


• Google: +0.5s → -20% searches




Source: http://www.slideshare.net/stubbornella/designing-fast-websites-presentation, Nicole Sullivan, Yahoo!
Terminology: page loading performance




              90%                10%




                    CSS, JS, images …
                    HTML
Drupal’s page loading performance

• One of the most effective measures:
   “Use a CDN”
Drupal’s page loading performance

• One of the most effective measures:
   “Use a CDN”


• Drupal: not yet possible! Possible now! :)
Terminology: CDN
Key properties of a CDN
Key properties of a CDN

• Geographical spread
Key properties of a CDN

• Geographical spread


• Pull versus Push

                                          Pull
                                                           F
                          transfer
                                      automatically       We
                          protocol



                           +         virtually no setup



                           –           no flexibility
Key properties of a CDN

• Geographical spread


• Pull versus Push

                                     Pull                 Push
                                                      FTP, SFTP, rsync,
                     transfer
                                 automatically       WebDAV, Amazon S3
                     protocol
                                                             …


                        +       virtually no setup        flexibility



                        –         no flexibility            setup
Key properties of a CDN

• Geographical spread


• Pull versus Push


• Lock-in                            Pull                 Push
                                                      FTP, SFTP, rsync,
                     transfer
                                 automatically       WebDAV, Amazon S3
                     protocol
                                                             …


                        +       virtually no setup        flexibility



                        –         no flexibility            setup
Profiling: Episodes
Profiling: Episodes

• Measures “episodes” during page loading
Profiling: Episodes

• Measures “episodes” during page loading

• Real measurements: JS in browser, for each visitor

• No simulation!
Episodes module
Episodes module

• Drupal module that offers Episodes integration
Episodes module

• Drupal module that offers Episodes integration
Episodes Server module
Episodes Server module

• Drupal module that visualizes the collected measurements
Daemon: File Conveyor
Daemon: File Conveyor

1. Configuration: simple XML file
Daemon: File Conveyor

1. Configuration: simple XML file

2. Detection: instantaneous
Daemon: File Conveyor

1. Configuration: simple XML file

2. Detection: instantaneous

3. Processing: store image more efficiently … — extensible!
Daemon: File Conveyor

1. Configuration: simple XML file

2. Detection: instantaneous

3. Processing: store image more efficiently … — extensible!

4. Syncing: supports many protocols (FTP, Amazon S3 …) — extensible!
Daemon: File Conveyor

1. Configuration: simple XML file

2. Detection: instantaneous

3. Processing: store image more efficiently … — extensible!

4. Syncing: supports many protocols (FTP, Amazon S3 …) — extensible!

5. Result: SQLite DB with CDN URLs
Daemon: File Conveyor

1. Configuration: simple XML file

2. Detection: instantaneous

3. Processing: store image more efficiently … — extensible!

4. Syncing: supports many protocols (FTP, Amazon S3 …) — extensible!

5. Result: SQLite DB with CDN URLs




    Project homepage: http://fileconveyor.org/ — code on GitHub
Demo
Daemon’s capabilities: scenario 1

Company wants to switch from CDN provider X to Amazon S3.
- CDN X: FTP
- Amazon S3: custom protocol
Daemon’s capabilities: scenario 1

Company wants to switch from CDN provider X to Amazon S3.
- CDN X: FTP
- Amazon S3: custom protocol


• Setup:


  • CDN X → Amazon S3


  • File Conveyor
Daemon’s capabilities: scenario 1

Company wants to switch from CDN provider X to Amazon S3.
- CDN X: FTP
- Amazon S3: custom protocol


• Setup:


   • CDN X → Amazon S3


   • File Conveyor


• Alternative: write a lot of code
Daemon’s capabilities: scenario 2

U.S. company expands to South-Korea
Daemon’s capabilities: scenario 2

U.S. company expands to South-Korea


• Setup:


  • North-American CDN


  • Static file server in South-Korea


  • File Conveyor + language-/subdomain-based logic to pick CDN/server
Daemon’s capabilities: scenario 2

U.S. company expands to South-Korea


• Setup:


  • North-American CDN


  • Static file server in South-Korea


  • File Conveyor + language-/subdomain-based logic to pick CDN/server


• Alternative: global CDN or slower web site in South-Korea
CDN integration module
CDN integration module

• Drupal core patch for Drupal 6, committed to Drupal 7 &   6
CDN integration module

• Drupal core patch for Drupal 6, committed to Drupal 7 &   6

• Basic mode: Origin Pull CDN
CDN integration module

• Drupal core patch for Drupal 6, committed to Drupal 7 &   6

• Basic mode: Origin Pull CDN

• Advanced mode: any CDN (File Conveyor)
Test case
Test case
Test case
Bachelor thesis: conclusion
Bachelor thesis: conclusion

1. Speed of web site very important
Bachelor thesis: conclusion

1. Speed of web site very important

2. Episodes → monitor page loading performance
Bachelor thesis: conclusion

1. Speed of web site very important

2. Episodes → monitor page loading performance

3. CDN integration module → best method to integrate Drupal with a CDN
Bachelor thesis: conclusion

1. Speed of web site very important

2. Episodes → monitor page loading performance

3. CDN integration module → best method to integrate Drupal with a CDN

4. File Conveyor → flexible CDN integration
It doesn’t end here!
What else exists …
What else exists …

High-traffic Drupal sites:
What else exists …

High-traffic Drupal sites:
What else exists …

High-traffic Drupal sites:




  “Pressflow is a distribution of Drupal with integrated performance,
         scalability, availability, and testing enhancements.”
What else is coming …
What else is coming …

1. CDN integration 2
What else is coming …

1. CDN integration 2

2. ESI
What else is coming …

1. CDN integration 2

2. ESI

3. Master thesis
CDN integration 2

 Goals
CDN integration 2

 Goals


 1. Significant expansion of functionality
CDN integration 2

 Goals
                                            Obsoleting modules:
 1. Significant expansion of functionality     Parallel & Simple CDN
                                            New mode: Media Mover
CDN integration 2

 Goals
                                            Obsoleting modules:
 1. Significant expansion of functionality     Parallel & Simple CDN
                                            New mode: Media Mover


 2. Awesome UX
CDN integration 2

 Goals
                                            Obsoleting modules:
 1. Significant expansion of functionality     Parallel & Simple CDN
                                            New mode: Media Mover


 2. Awesome UX                                  Bojhan Somers
CDN integration 2

 Goals
                                                Obsoleting modules:
 1. Significant expansion of functionality         Parallel & Simple CDN
                                                New mode: Media Mover


 2. Awesome UX                                      Bojhan Somers



 3. Excellent docs + high-quality screencasts
CDN integration 2

 Goals
                                                Obsoleting modules:
 1. Significant expansion of functionality         Parallel & Simple CDN
                                                New mode: Media Mover


 2. Awesome UX                                      Bojhan Somers



 3. Excellent docs + high-quality screencasts



 4. Streamlined CDN-specific UI
CDN integration 2

 Goals
                                                Obsoleting modules:
 1. Significant expansion of functionality         Parallel & Simple CDN
                                                New mode: Media Mover


 2. Awesome UX                                      Bojhan Somers



 3. Excellent docs + high-quality screencasts



 4. Streamlined CDN-specific UI
CDN integration 2
ESI




      Source: http://www.trygve-lie.com/blog/entry/esi_explained_simple
ESI




      Source: http://www.trygve-lie.com/blog/entry/esi_explained_simple
ESI




      Based on: http://www.trygve-lie.com/blog/entry/esi_explained_simple
ESI
ESI

• ESI module: http://drupal.org/project/esi
ESI

• ESI module: http://drupal.org/project/esi

• In development
ESI

• ESI module: http://drupal.org/project/esi

• In development

•             developers are actively participating
ESI

• ESI module: http://drupal.org/project/esi

• In development

•               developers are actively participating

    • David Strauss (Four Kitchens)
ESI

• ESI module: http://drupal.org/project/esi

• In development

•               developers are actively participating

    • David Strauss (Four Kitchens)

    • Josh Koenig (Chapter Three)
ESI

• ESI module: http://drupal.org/project/esi

• In development

•               developers are actively participating

    • David Strauss (Four Kitchens)

    • Josh Koenig (Chapter Three)



• Note: not every CDN supports ESI!
Master thesis


“Web Performance Optimization: Analytics”



• Promotor: Prof. dr. Jan van den Bussche
Master thesis
Master thesis

• An application that can automatically extract conclusions out of Episodes logs
 and visualize them.
Master thesis

• An application that can automatically extract conclusions out of Episodes logs
  and visualize them.

• e.g.:
Master thesis

• An application that can automatically extract conclusions out of Episodes logs
  and visualize them.

• e.g.:

  • “http://d.o/ is slow in Belgium, for users of the ISP Telenet”
Master thesis

• An application that can automatically extract conclusions out of Episodes logs
  and visualize them.

• e.g.:

  • “http://d.o/ is slow in Belgium, for users of the ISP Telenet”

  • “http://d.o/project/esi has slowly loading CSS”
Master thesis

• An application that can automatically extract conclusions out of Episodes logs
  and visualize them.

• e.g.:

  • “http://d.o/ is slow in Belgium, for users of the ISP Telenet”

  • “http://d.o/project/esi has slowly loading CSS”

  • “http://d.o/project/cdn has slowly loading JS for visitors that use the browser
    Internet Explorer 6 or 7”
Master thesis

• An application that can automatically extract conclusions out of Episodes logs
  and visualize them.

• e.g.:

  • “http://d.o/ is slow in Belgium, for users of the ISP Telenet”

  • “http://d.o/project/esi has slowly loading CSS”

  • “http://d.o/project/cdn has slowly loading JS for visitors that use the browser
    Internet Explorer 6 or 7”


• Challenges: complex data mining, distributed computing, storage, performance
Master thesis

• An application that can automatically extract conclusions out of Episodes logs
  and visualize them.

• e.g.:

  • “http://d.o/ is slow in Belgium, for users of the ISP Telenet”

  • “http://d.o/project/esi has slowly loading CSS”

  • “http://d.o/project/cdn has slowly loading JS for visitors that use the browser
    Internet Explorer 6 or 7”


• Challenges: complex data mining, distributed computing, storage, performance

    See http://wimleers.com/tags/master-thesis
Conclusion
Conclusion

1. Speed of web site very important

2. Episodes → monitor page loading performance

3. CDN integration module → best method to integrate Drupal with a CDN

4. File Conveyor → flexible CDN integration
Conclusion

1. Speed of web site very important

2. Episodes → monitor page loading performance

3. CDN integration module → best method to integrate Drupal with a CDN

4. File Conveyor → flexible CDN integration

5. CDN integration module 2 → more awesome!
Conclusion

1. Speed of web site very important

2. Episodes → monitor page loading performance

3. CDN integration module → best method to integrate Drupal with a CDN

4. File Conveyor → flexible CDN integration

5. CDN integration module 2 → more awesome!

6. ESI → big potential
Conclusion

1. Speed of web site very important

2. Episodes → monitor page loading performance

3. CDN integration module → best method to integrate Drupal with a CDN

4. File Conveyor → flexible CDN integration

5. CDN integration module 2 → more awesome!

6. ESI → big potential

7. Master thesis → more insight
Conclusion

1. Speed of web site very important

2. Episodes → monitor page loading performance

3. CDN integration module → best method to integrate Drupal with a CDN

4. File Conveyor → flexible CDN integration

5. CDN integration module 2 → more awesome!

6. ESI → big potential

7. Master thesis → more insight


    This talk will be available at http://wimleers.com/talk/fosdem-2010
Need Drupal page loading performance help?

  Contact me: http://wimleers.com/contact




  This talk will be available at http://wimleers.com/talk/fosdem-2010

More Related Content

Viewers also liked

Master Thesis Overview Remco de Kramer
Master Thesis Overview Remco de KramerMaster Thesis Overview Remco de Kramer
Master Thesis Overview Remco de Kramer
Remco de Kramer
 
APAMPA MASTER THESIS PRESENTATION
APAMPA MASTER THESIS PRESENTATIONAPAMPA MASTER THESIS PRESENTATION
APAMPA MASTER THESIS PRESENTATION
Olatunji Apampa
 
N.A.C. Hartman; 465392; Master Thesis Marketing Management
N.A.C. Hartman; 465392; Master Thesis Marketing ManagementN.A.C. Hartman; 465392; Master Thesis Marketing Management
N.A.C. Hartman; 465392; Master Thesis Marketing Management
Nick Hartman
 
System edukacyjny w polsce stary power point
System edukacyjny w polsce   stary power pointSystem edukacyjny w polsce   stary power point
System edukacyjny w polsce stary power point
angelus85
 
Condition monitoring for track circuits: A multiple-model approach (MSc. Thesis)
Condition monitoring for track circuits: A multiple-model approach (MSc. Thesis)Condition monitoring for track circuits: A multiple-model approach (MSc. Thesis)
Condition monitoring for track circuits: A multiple-model approach (MSc. Thesis)
Wouter Verbeek
 
Master Thesis Sebastian Guth - Developing a competitor analysis canvas model ...
Master Thesis Sebastian Guth - Developing a competitor analysis canvas model ...Master Thesis Sebastian Guth - Developing a competitor analysis canvas model ...
Master Thesis Sebastian Guth - Developing a competitor analysis canvas model ...
Sebastian Guth
 
Bachelor Thesis - Internationalisation & Foreign Market Entry Mode
Bachelor Thesis - Internationalisation & Foreign Market Entry ModeBachelor Thesis - Internationalisation & Foreign Market Entry Mode
Bachelor Thesis - Internationalisation & Foreign Market Entry Mode
Tobias Lukaschek
 
Thesis Defense_Mirka Heitland_2014
Thesis Defense_Mirka Heitland_2014Thesis Defense_Mirka Heitland_2014
Thesis Defense_Mirka Heitland_2014
Mirka Heitland
 

Viewers also liked (17)

Final Presentation
Final PresentationFinal Presentation
Final Presentation
 
Master Thesis Overview Remco de Kramer
Master Thesis Overview Remco de KramerMaster Thesis Overview Remco de Kramer
Master Thesis Overview Remco de Kramer
 
My Master thesis
My Master thesisMy Master thesis
My Master thesis
 
Master thesis
Master thesisMaster thesis
Master thesis
 
APAMPA MASTER THESIS PRESENTATION
APAMPA MASTER THESIS PRESENTATIONAPAMPA MASTER THESIS PRESENTATION
APAMPA MASTER THESIS PRESENTATION
 
N.A.C. Hartman; 465392; Master Thesis Marketing Management
N.A.C. Hartman; 465392; Master Thesis Marketing ManagementN.A.C. Hartman; 465392; Master Thesis Marketing Management
N.A.C. Hartman; 465392; Master Thesis Marketing Management
 
Global marketing - entry strategies
Global marketing - entry strategiesGlobal marketing - entry strategies
Global marketing - entry strategies
 
System edukacyjny w polsce stary power point
System edukacyjny w polsce   stary power pointSystem edukacyjny w polsce   stary power point
System edukacyjny w polsce stary power point
 
Condition monitoring for track circuits: A multiple-model approach (MSc. Thesis)
Condition monitoring for track circuits: A multiple-model approach (MSc. Thesis)Condition monitoring for track circuits: A multiple-model approach (MSc. Thesis)
Condition monitoring for track circuits: A multiple-model approach (MSc. Thesis)
 
Why do new ventures in e-commerce fail?
Why do new ventures in e-commerce fail?Why do new ventures in e-commerce fail?
Why do new ventures in e-commerce fail?
 
Thesis writing clinic 2014
Thesis writing clinic 2014Thesis writing clinic 2014
Thesis writing clinic 2014
 
Exposé zur Master Thesis
Exposé zur Master ThesisExposé zur Master Thesis
Exposé zur Master Thesis
 
Thesis presentation
Thesis presentationThesis presentation
Thesis presentation
 
Master Thesis Sebastian Guth - Developing a competitor analysis canvas model ...
Master Thesis Sebastian Guth - Developing a competitor analysis canvas model ...Master Thesis Sebastian Guth - Developing a competitor analysis canvas model ...
Master Thesis Sebastian Guth - Developing a competitor analysis canvas model ...
 
Bachelor Thesis - Internationalisation & Foreign Market Entry Mode
Bachelor Thesis - Internationalisation & Foreign Market Entry ModeBachelor Thesis - Internationalisation & Foreign Market Entry Mode
Bachelor Thesis - Internationalisation & Foreign Market Entry Mode
 
emerging market entry strategies
emerging market entry strategiesemerging market entry strategies
emerging market entry strategies
 
Thesis Defense_Mirka Heitland_2014
Thesis Defense_Mirka Heitland_2014Thesis Defense_Mirka Heitland_2014
Thesis Defense_Mirka Heitland_2014
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Drupal's page loading performance in 2010

  • 1. Drupal’s page loading performance in 2010 Wim Leers ~ http://wimleers.com/ Drupal.org, IRC, Twitter, LinkedIn: wimleers
  • 2. Bachelor thesis “Improving Drupal’s page loading performance” • Promotor: Prof. dr. Wim Lamotte • Co-promotor: dr. Peter Quax • Mentors: Stijn Agten & Maarten Wijnants
  • 3. Goal • Faster web sites • Speed = satisfaction = more & happier visitors = more revenue Source: http://www.slideshare.net/stubbornella/designing-fast-websites-presentation, Nicole Sullivan, Yahoo!
  • 4. Goal • Faster web sites • Speed = satisfaction = more & happier visitors = more revenue • Google: +0.5s → -20% searches Source: http://www.slideshare.net/stubbornella/designing-fast-websites-presentation, Nicole Sullivan, Yahoo!
  • 5. Terminology: page loading performance 90% 10% CSS, JS, images … HTML
  • 6. Drupal’s page loading performance • One of the most effective measures: “Use a CDN”
  • 7. Drupal’s page loading performance • One of the most effective measures: “Use a CDN” • Drupal: not yet possible! Possible now! :)
  • 10. Key properties of a CDN • Geographical spread
  • 11. Key properties of a CDN • Geographical spread • Pull versus Push Pull F transfer automatically We protocol + virtually no setup – no flexibility
  • 12. Key properties of a CDN • Geographical spread • Pull versus Push Pull Push FTP, SFTP, rsync, transfer automatically WebDAV, Amazon S3 protocol … + virtually no setup flexibility – no flexibility setup
  • 13. Key properties of a CDN • Geographical spread • Pull versus Push • Lock-in Pull Push FTP, SFTP, rsync, transfer automatically WebDAV, Amazon S3 protocol … + virtually no setup flexibility – no flexibility setup
  • 15. Profiling: Episodes • Measures “episodes” during page loading
  • 16. Profiling: Episodes • Measures “episodes” during page loading • Real measurements: JS in browser, for each visitor • No simulation!
  • 18. Episodes module • Drupal module that offers Episodes integration
  • 19. Episodes module • Drupal module that offers Episodes integration
  • 21. Episodes Server module • Drupal module that visualizes the collected measurements
  • 23. Daemon: File Conveyor 1. Configuration: simple XML file
  • 24. Daemon: File Conveyor 1. Configuration: simple XML file 2. Detection: instantaneous
  • 25. Daemon: File Conveyor 1. Configuration: simple XML file 2. Detection: instantaneous 3. Processing: store image more efficiently … — extensible!
  • 26. Daemon: File Conveyor 1. Configuration: simple XML file 2. Detection: instantaneous 3. Processing: store image more efficiently … — extensible! 4. Syncing: supports many protocols (FTP, Amazon S3 …) — extensible!
  • 27. Daemon: File Conveyor 1. Configuration: simple XML file 2. Detection: instantaneous 3. Processing: store image more efficiently … — extensible! 4. Syncing: supports many protocols (FTP, Amazon S3 …) — extensible! 5. Result: SQLite DB with CDN URLs
  • 28. Daemon: File Conveyor 1. Configuration: simple XML file 2. Detection: instantaneous 3. Processing: store image more efficiently … — extensible! 4. Syncing: supports many protocols (FTP, Amazon S3 …) — extensible! 5. Result: SQLite DB with CDN URLs Project homepage: http://fileconveyor.org/ — code on GitHub
  • 29. Demo
  • 30. Daemon’s capabilities: scenario 1 Company wants to switch from CDN provider X to Amazon S3. - CDN X: FTP - Amazon S3: custom protocol
  • 31. Daemon’s capabilities: scenario 1 Company wants to switch from CDN provider X to Amazon S3. - CDN X: FTP - Amazon S3: custom protocol • Setup: • CDN X → Amazon S3 • File Conveyor
  • 32. Daemon’s capabilities: scenario 1 Company wants to switch from CDN provider X to Amazon S3. - CDN X: FTP - Amazon S3: custom protocol • Setup: • CDN X → Amazon S3 • File Conveyor • Alternative: write a lot of code
  • 33. Daemon’s capabilities: scenario 2 U.S. company expands to South-Korea
  • 34. Daemon’s capabilities: scenario 2 U.S. company expands to South-Korea • Setup: • North-American CDN • Static file server in South-Korea • File Conveyor + language-/subdomain-based logic to pick CDN/server
  • 35. Daemon’s capabilities: scenario 2 U.S. company expands to South-Korea • Setup: • North-American CDN • Static file server in South-Korea • File Conveyor + language-/subdomain-based logic to pick CDN/server • Alternative: global CDN or slower web site in South-Korea
  • 37. CDN integration module • Drupal core patch for Drupal 6, committed to Drupal 7 & 6
  • 38. CDN integration module • Drupal core patch for Drupal 6, committed to Drupal 7 & 6 • Basic mode: Origin Pull CDN
  • 39. CDN integration module • Drupal core patch for Drupal 6, committed to Drupal 7 & 6 • Basic mode: Origin Pull CDN • Advanced mode: any CDN (File Conveyor)
  • 44. Bachelor thesis: conclusion 1. Speed of web site very important
  • 45. Bachelor thesis: conclusion 1. Speed of web site very important 2. Episodes → monitor page loading performance
  • 46. Bachelor thesis: conclusion 1. Speed of web site very important 2. Episodes → monitor page loading performance 3. CDN integration module → best method to integrate Drupal with a CDN
  • 47. Bachelor thesis: conclusion 1. Speed of web site very important 2. Episodes → monitor page loading performance 3. CDN integration module → best method to integrate Drupal with a CDN 4. File Conveyor → flexible CDN integration
  • 50. What else exists … High-traffic Drupal sites:
  • 51. What else exists … High-traffic Drupal sites:
  • 52. What else exists … High-traffic Drupal sites: “Pressflow is a distribution of Drupal with integrated performance, scalability, availability, and testing enhancements.”
  • 53. What else is coming …
  • 54. What else is coming … 1. CDN integration 2
  • 55. What else is coming … 1. CDN integration 2 2. ESI
  • 56. What else is coming … 1. CDN integration 2 2. ESI 3. Master thesis
  • 58. CDN integration 2 Goals 1. Significant expansion of functionality
  • 59. CDN integration 2 Goals Obsoleting modules: 1. Significant expansion of functionality Parallel & Simple CDN New mode: Media Mover
  • 60. CDN integration 2 Goals Obsoleting modules: 1. Significant expansion of functionality Parallel & Simple CDN New mode: Media Mover 2. Awesome UX
  • 61. CDN integration 2 Goals Obsoleting modules: 1. Significant expansion of functionality Parallel & Simple CDN New mode: Media Mover 2. Awesome UX Bojhan Somers
  • 62. CDN integration 2 Goals Obsoleting modules: 1. Significant expansion of functionality Parallel & Simple CDN New mode: Media Mover 2. Awesome UX Bojhan Somers 3. Excellent docs + high-quality screencasts
  • 63. CDN integration 2 Goals Obsoleting modules: 1. Significant expansion of functionality Parallel & Simple CDN New mode: Media Mover 2. Awesome UX Bojhan Somers 3. Excellent docs + high-quality screencasts 4. Streamlined CDN-specific UI
  • 64. CDN integration 2 Goals Obsoleting modules: 1. Significant expansion of functionality Parallel & Simple CDN New mode: Media Mover 2. Awesome UX Bojhan Somers 3. Excellent docs + high-quality screencasts 4. Streamlined CDN-specific UI
  • 66. ESI Source: http://www.trygve-lie.com/blog/entry/esi_explained_simple
  • 67. ESI Source: http://www.trygve-lie.com/blog/entry/esi_explained_simple
  • 68. ESI Based on: http://www.trygve-lie.com/blog/entry/esi_explained_simple
  • 69. ESI
  • 70. ESI • ESI module: http://drupal.org/project/esi
  • 71. ESI • ESI module: http://drupal.org/project/esi • In development
  • 72. ESI • ESI module: http://drupal.org/project/esi • In development • developers are actively participating
  • 73. ESI • ESI module: http://drupal.org/project/esi • In development • developers are actively participating • David Strauss (Four Kitchens)
  • 74. ESI • ESI module: http://drupal.org/project/esi • In development • developers are actively participating • David Strauss (Four Kitchens) • Josh Koenig (Chapter Three)
  • 75. ESI • ESI module: http://drupal.org/project/esi • In development • developers are actively participating • David Strauss (Four Kitchens) • Josh Koenig (Chapter Three) • Note: not every CDN supports ESI!
  • 76. Master thesis “Web Performance Optimization: Analytics” • Promotor: Prof. dr. Jan van den Bussche
  • 78. Master thesis • An application that can automatically extract conclusions out of Episodes logs and visualize them.
  • 79. Master thesis • An application that can automatically extract conclusions out of Episodes logs and visualize them. • e.g.:
  • 80. Master thesis • An application that can automatically extract conclusions out of Episodes logs and visualize them. • e.g.: • “http://d.o/ is slow in Belgium, for users of the ISP Telenet”
  • 81. Master thesis • An application that can automatically extract conclusions out of Episodes logs and visualize them. • e.g.: • “http://d.o/ is slow in Belgium, for users of the ISP Telenet” • “http://d.o/project/esi has slowly loading CSS”
  • 82. Master thesis • An application that can automatically extract conclusions out of Episodes logs and visualize them. • e.g.: • “http://d.o/ is slow in Belgium, for users of the ISP Telenet” • “http://d.o/project/esi has slowly loading CSS” • “http://d.o/project/cdn has slowly loading JS for visitors that use the browser Internet Explorer 6 or 7”
  • 83. Master thesis • An application that can automatically extract conclusions out of Episodes logs and visualize them. • e.g.: • “http://d.o/ is slow in Belgium, for users of the ISP Telenet” • “http://d.o/project/esi has slowly loading CSS” • “http://d.o/project/cdn has slowly loading JS for visitors that use the browser Internet Explorer 6 or 7” • Challenges: complex data mining, distributed computing, storage, performance
  • 84. Master thesis • An application that can automatically extract conclusions out of Episodes logs and visualize them. • e.g.: • “http://d.o/ is slow in Belgium, for users of the ISP Telenet” • “http://d.o/project/esi has slowly loading CSS” • “http://d.o/project/cdn has slowly loading JS for visitors that use the browser Internet Explorer 6 or 7” • Challenges: complex data mining, distributed computing, storage, performance See http://wimleers.com/tags/master-thesis
  • 86. Conclusion 1. Speed of web site very important 2. Episodes → monitor page loading performance 3. CDN integration module → best method to integrate Drupal with a CDN 4. File Conveyor → flexible CDN integration
  • 87. Conclusion 1. Speed of web site very important 2. Episodes → monitor page loading performance 3. CDN integration module → best method to integrate Drupal with a CDN 4. File Conveyor → flexible CDN integration 5. CDN integration module 2 → more awesome!
  • 88. Conclusion 1. Speed of web site very important 2. Episodes → monitor page loading performance 3. CDN integration module → best method to integrate Drupal with a CDN 4. File Conveyor → flexible CDN integration 5. CDN integration module 2 → more awesome! 6. ESI → big potential
  • 89. Conclusion 1. Speed of web site very important 2. Episodes → monitor page loading performance 3. CDN integration module → best method to integrate Drupal with a CDN 4. File Conveyor → flexible CDN integration 5. CDN integration module 2 → more awesome! 6. ESI → big potential 7. Master thesis → more insight
  • 90. Conclusion 1. Speed of web site very important 2. Episodes → monitor page loading performance 3. CDN integration module → best method to integrate Drupal with a CDN 4. File Conveyor → flexible CDN integration 5. CDN integration module 2 → more awesome! 6. ESI → big potential 7. Master thesis → more insight This talk will be available at http://wimleers.com/talk/fosdem-2010
  • 91. Need Drupal page loading performance help? Contact me: http://wimleers.com/contact This talk will be available at http://wimleers.com/talk/fosdem-2010

Editor's Notes

  1. A content delivery network (CDN) is a collection of webservers that are distributed across multiple locations, to deliver data more efficiently to the end user. Typically, the server with the shortest distance to the end user is selected.
  2. - geographical spread: more spread is better, but more expensive - lock-in: * push: supporting all transfer protocols is too hard * pull: not every CDN supports pulling (and unlikely that the same features are supported)
  3. - geographical spread: more spread is better, but more expensive - lock-in: * push: supporting all transfer protocols is too hard * pull: not every CDN supports pulling (and unlikely that the same features are supported)
  4. - geographical spread: more spread is better, but more expensive - lock-in: * push: supporting all transfer protocols is too hard * pull: not every CDN supports pulling (and unlikely that the same features are supported)
  5. - geographical spread: more spread is better, but more expensive - lock-in: * push: supporting all transfer protocols is too hard * pull: not every CDN supports pulling (and unlikely that the same features are supported)
  6. - geographical spread: more spread is better, but more expensive - lock-in: * push: supporting all transfer protocols is too hard * pull: not every CDN supports pulling (and unlikely that the same features are supported)
  7. - geographical spread: more spread is better, but more expensive - lock-in: * push: supporting all transfer protocols is too hard * pull: not every CDN supports pulling (and unlikely that the same features are supported)
  8. - geographical spread: more spread is better, but more expensive - lock-in: * push: supporting all transfer protocols is too hard * pull: not every CDN supports pulling (and unlikely that the same features are supported)
  9. - Best of +- 10 evaluated tools and commercial services
  10. - Best of +- 10 evaluated tools and commercial services
  11. - Best of +- 10 evaluated tools and commercial services
  12. Result: no switching cost → no lock-in!
  13. Result: no switching cost → no lock-in!
  14. Result: no switching cost → no lock-in!
  15. Result: faster web site, same setup cost, lower maintenance cost
  16. Result: faster web site, same setup cost, lower maintenance cost
  17. Result: faster web site, same setup cost, lower maintenance cost
  18. - own web site - > 100.000 visitors per month, >150 countries - makes it a good test case to show the effect of CDN integration - far less images than the average web site, thus less HTTP requests, thus harder to show the difference - Europe + Russia: server in Belgium, rest of the world: CDN with servers in North America - uses ip2country module to determine country of user
  19. - information in the screenshots generated by the Episodes Server module - > 2.7 million episodes measured over > 260k page views - before June 21: large variability, especially globally - starting June 21: CDN integration active - starting June 22: also CSS & JS of delicious widget with CDN integration - very clear downward trend starting June 21, even stronger starting June 22 (partially thanks to caching in the browser)