HOW TO PAINLESSLY MIGRATE LARGE PAGES WITH MILLIONS OF PAGES.
The methodology and technical aspects of the migration of large parties: The process of technical configuration, tools, examples of traps, and more.
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
Big Site Migrations- Michal Magdziarz, CEO of DeepCrawl
1. URL Platform Migration
for BIG Websites
Michal Magdziarz
DeepCrawl
CEO & Co-founder
How to avoid a disaster
2. Success Influencers
There are two main factors that affect a success of a migrations
• The quality of redirections – short & long term effect
• The quality of the new website – long term effect
3. Success Influencers
Today we will focus on the first element so the traffic immediately after
the migration doesn’t look like this.
4. Table of Contents
• Migration Process
• Technical Server Setup & Apache RewriteRules
• Migration/Redirections Testing
• Practical/Pitfalls
13. Migration Process - Priority URLs list
Divide your Test URL list based on priority:
• Top 5000 - covering 70% of traffic
• Top 10K - covering 90% of traffic
• Top 100K - covering 95% of traffic
Based on page templates
• Top 10k products
• Top 10k categories
• Top 10k search results
• etc.
Based on rule type - Sample URLs per each rule
14. Migration Process - Page Relevancy & Quality
NEW PLATFORMLEGACY PLATFORM
151 results 23 results
33. Testing – Other potential issues
• Changes to Mobile configuration e.g. Dedicated Mobile
Consolidation to Responsive
• Changes to International hreflang setup
• Changes to Schema tags
• Changes to Titles & Descriptions
• Crawlability & Authority Distribution
• Multidomain Migrations
• Etc.
35. Practical/Pitfalls
Practical/Pitfalls
• Robots.txt – disallow for a URL in the Redirect Chain
• Don't forget to remove your test domain from rewrite rules
• Test the set up in the live ENV
• 301 vs 302 vs canonical
• Performance - one to one vs rule based
• Other: PPC, Social, Missing content
36. Pitfalls - disallow of a URL in a Redirect Chain
domain.com/legacy-url
>> 301 >>
domain.com/redirected-to-url
>> 301 >>
domain.com/redirected-to-disallowed-url
>> 301 >>
domain.com/new-url (200 status)
../robots.txt
User-agent: *
Disallow: *disallowed*
37. Practical - 301 vs 302
• Straight 301s are for the braves
• My preference is to go for 302s for 2 weeks before committing to 301
• This allows for any last minute corrections
• Please see below the result of robots.txt issue and straight 301s below
• 70% drop in traffic in 1st week, the website has never fully recovered
38. Pitfalls - Performance – one-to-one vs. rule based
• A big ecommerce shop received a poorly written one-to-one
RewiteRules
• The RewiteRule file contained 100K rules!!
• This resulted in such poor performance that the RewriteRules were
switched off as they were slowing the site significantly
• The lack of redirections resulted in a massive drop in traffic
• The 100K RewriteRules were rewritten into:
• 17 - RewriteRules
• 10 - RewriteCond
• 4 - RewriteMaps
• The traffic was partially recovered
40. Pitfalls – Don't forget to remove your test domain
Rewrite
Rules
Redirection
Test Server
Live
Server
LIVE
Traffic
Live
Server
Rewrite
Rules
LIVE
Traffic!
41. Thanks for your time
FREE Trial:
https://www.deepcrawl.com/sem-krk