SlideShare a Scribd company logo
1 of 42
Download to read offline
How do the machines know
   what Tasty Wheat
       tasted like?
           Mouse – The Matrix
Short SEO History
Short SEO History
     • Web1 0
       Web1.0
     • Web2.0
     • Web3.0
Genesis
• A story of the Internet by
  A story of the Internet, by
• Solving the most important problems
        l i fl      db
• Greatly influenced by one man…
Tim Berners‐Lee
  Tim Berners Lee




“the World Wide Web is Berners-Lee's
alone. He designed it. He loosed it on the
              g
world. And he more than anyone else has
fought to keep it open, nonproprietary and
free.”
                         Time Magazine, 1999
                         Time Magazine 1999
The Problem
                The Problem
• Where can I find the information?
  Where can I find the information?


       “Our ineptitude in getting at the record is
       largely caused by the artificiality of the
       systems of indexing ”
                  indexing.
                        The Atlantic Monthly, 1945
Archie, 1990
               Archie, 1990
• Indexed file names and
  Indexed file names and
• Returned results based on pattern matching
Web1.0
Web1 0
Web1.0
•   Means HTML
    Means HTML
•   Is born in 1991, with the help of
•   Tim Berners‐Lee (TBL), who also founded
      i              ( ) h l f          d d
•   WWW Consortium (W3C) at MIT, and also
•   Created WWW Virtual Library – the 1st catalog
Yahoo Directory, 1994
           Yahoo Directory, 1994
•   Vertical = categories is like
    Vertical = categories... is like
•   “Show me all the stuff and I’ll handle it”
•   Manually indexed stuff, which was
            ll i d d ff hi h
•   OK for starters, but…
•   Websites quickly grew in number and
•   Y! started charging money for one listing
    Y! started charging money for one listing
•   Increasingly more money...
,1994
•   First SE to fully search text
    First SE to fully search text
•   Bought by AOL, then
•   S ld
    Sold to Excite, which
                i      hi h
•   Excite went bankrupt and
•   WebCrawler ends up bought by InfoSpace
Other  Search Engines
    Other “Search Engines”
•   1994, reaches 60mil pages in  96
    1994 reaches 60mil pages in ‘96
•   1995, bought by Overture, bought by Y!
•   1996, meta search, bought by Lycos
     996             h b    h b
•   1997, bought by IAC/InterActiveCorp
•   1999, bought by Overture, meaning Y!
Shopping fun, right?
Shopping fun, right?
, 1998
                         , 1998
• Open Directory Project
  Open Directory Project
• Each listing is checked and certified by a 
  volunteer
• The main source for Google Directory
Current State of Search Industry
Current State of Search Industry
Web1.0 Problems
• SE couldn’t understand text so
  SE couldn t understand text, so 
• They said “why don’t you implement some 
  meta tags (description & keywords) so we can 
  meta tags (description & keywords) so we can
  get a glimpse of what you’re saying”
• Th
  The relevancy of a page with respect to a 
         l        f         ih
  keyword was determined by a few factors, so
• It was very easy to abuse and spam, therefore
                       p   q
• Search Results had poor qualityy
Web2.0
Web2 0
Web2.0
• Is coined by Tim O’Reilly yet
  Is coined by... Tim O Reilly, yet
• TBL later said that “web2.0” is a stupid, 
  meaningless term and that he thought of it 
  meaningless term and that he thought of it
  first in ’96 anyway
Web2.0 means
               Web2.0 means
•              which grew apart because of
               which grew apart because of
•   PageRank (1998) invented by
•   Larry & Sergei who adapted the algo from
          &S     i h d        d h l f
•   An MIT professor who had developed
•   A nasty mathematical formula for positioning 
      y               p
    keywords in a 3d space model based on the 
    relevancy that one kw holds … whatever
PageRank actually means
         PageRank actually means
•   That a link is a vote and
    That a link is a vote and
•   Not all links are created equal, so
•   It matters who links to you
                  h li k
•   Just like in our real life society
• Read the content of pages really well just that
  Read the content of pages really well, just that
• Pages were crappy:
  –NNon‐standard coding
          t d d di
  – Ugly tech (like applets)
  – Senseless IA
• So Google said: “don’t do evil and try to nicely 
  format the info, according to W3C standards”
  (remember TBL)
Enter the SEO
Enter the SEO
SEO
• Is a multitude of practices aimed at facilitating
  Is a multitude of practices aimed at facilitating 
  the indexing of pages by search engines
• Evolves as the ranking algorithm changes and
  Evolves as the ranking algorithm changes, and
• Of course, the algorithm is kept secret.
SEO actually means
SEO actually means




           Courtesy of Kelly Ishikawa
SEO actually means
           SEO actually means
• An on‐going battle between bots & SEO guys
  An on going battle between bots & SEO guys
• Now 100+ factors influence ranking
• And I’d like to take the time to talk about each 
    d ’d lik        k h i            lk b        h
  one of them in the following…
Just kidding
Just kidding
My SEO Cheat Sheet
            My SEO Cheat Sheet
• Consider:
  1.   Page Titles
  2.   URLs (mod_rewrite)
  3.   Anchor Text
  4.   Website Architecture (IA)
  5.   Link Title & Alt Images
  6.   Relevant content (text)
  7.
  7    Sitemap xml
       Sitemap.xml
  8.   Hosting
  9.   Freshness
Resources



     Matt Cutts Blog




Mihai’s SEO Cheat Sheet :D
Web2.0 Problems
•   © for pictures articles books etc
       for pictures, articles, books, etc
•   PPC fraud
•   Privacy
      i
•   Search Engine SPAM
•   Link bombing
•   Paid links
    Paid links
•   But more important...
Web2.0 Problems
• SE still don’t understand what the $#%@
  SE still don t understand what the $#%@ 
  you’re talking about
• Crawling a website’s interface to extract info is
  Crawling a website s interface to extract info is 
  almost insane
Web3.0
Web3 0
Web3.0
                  Web3.0 
• Means semantic web
         semantic web
• Attention migrates from syntax/formatting to 
  semantics and
  semantics and
• Meta Data (data about the data) becomes...
Web3.0



                         &
Resource Description
Resource Description            Microformats
    Framework
Resource Description Framework
     Resource Description Framework
•   A kind of XML
    A kind of XML
•   RDF = Subject + Predicate + Object
•   S + P + O creates a Triple which
            O             i l hi h
•   Can describe almost anything in the universe
•   Triples are connectable (eg: FOAF)
•   RDFa = XHTML + RDF (W3C compliant)
    RDFa  XHTML + RDF (W3C compliant)
Microformats
•   hCalendar 
•   hCard
•   rel‐tag
•   VoteLinks
•   XFN
•   Geo
•   hResume
•   hReview
    hR i
•   etc
Case Study
Case Study
SPARQL
• SPARQL Protocol and RDF Query Language
  SPARQL Protocol and RDF Query Language
• Standardized on 15th Jan 08 (1 month ago) and
• Endorsed by?... TBL
    d    db ?

  quot;Trying to use the Semantic Web without
   SPARQL is like trying to use a relational
         Q           y g
            database without SQL“
                        TBL
Potential
•   With SPARQL you skip the presentation layer
    With SPARQL you skip the presentation layer
•   You can query ad‐hoc any API, so
•   You don’t need to crawl in advance, therefore
        d ’      d          li d            h f
•   Information will be as fresh as it gets
And possibilities
             And possibilities
• Query: “I can has pizza?” 
  Query:  I can has pizza?
• Returns: 
  –Af i d f
    A friend of yours (XFN ‐ F b k)
                      (XFN Facebook) 
  – has a colleague (FOAF ‐ LinkedIN) who
  – said that they make good pizza (hReview ‐ yelp) at
                                   (              )
  – a restaurant nearby (geo – Gmaps)
  – Tip: U2 in concert today (hCalendar ‐ upcoming)
Perhaps now we can see
       Perhaps now we can see
• Why Social Networking Communities are
  Why Social Networking Communities are 
  worth so much, even though most of them 
  don’t have a revenue model
  – Facebook
  – LinkedIN
  – Meebo
  – Beebo 
  – Pipu...
• They/We are the databases of the future
Thanks!

“Most of the right choices in SEO come from
  asking: What’s the best thing for the user?”
       g                      g
                                                                        Matt Cutts




                                   Mihai Gheza 
                                   Mih i Gh
    Creative Commons Attribution‐Noncommercial‐Share Alike 3.0 Unported License.

More Related Content

What's hot

Footprints for backlinks - Find quality backlinks in minutes
Footprints for backlinks - Find quality backlinks in minutesFootprints for backlinks - Find quality backlinks in minutes
Footprints for backlinks - Find quality backlinks in minutesSeo 4 you 2
 
SEO Crash Course - Go from White Belt to Ninja in Search Optimization
SEO Crash Course - Go from White Belt to Ninja in Search OptimizationSEO Crash Course - Go from White Belt to Ninja in Search Optimization
SEO Crash Course - Go from White Belt to Ninja in Search OptimizationPercussion Software
 
Social Networking and Youth Work
Social Networking and Youth WorkSocial Networking and Youth Work
Social Networking and Youth WorkJess Nichols
 
12 Ways to Improve Your Business Website or Blog
12 Ways to Improve Your Business Website or Blog12 Ways to Improve Your Business Website or Blog
12 Ways to Improve Your Business Website or BlogCharles Holmes
 
Social media for E-commerce
Social media for E-commerceSocial media for E-commerce
Social media for E-commerceAdWords Robot
 
Visualizing your Graph
Visualizing your GraphVisualizing your Graph
Visualizing your GraphMax De Marzi
 
creating portable social networks with microformats
creating portable social networks with microformatscreating portable social networks with microformats
creating portable social networks with microformatselliando dias
 
Real Estate Marketing System
Real Estate Marketing SystemReal Estate Marketing System
Real Estate Marketing SystemJosh Schoenly
 
Everything you wanted to know about crawling, but didn't know where to ask
Everything you wanted to know about crawling, but didn't know where to askEverything you wanted to know about crawling, but didn't know where to ask
Everything you wanted to know about crawling, but didn't know where to askBill Slawski
 
Jobs revised march 2012
Jobs revised march 2012Jobs revised march 2012
Jobs revised march 2012Lisa Lindsay
 
Dream careers - Full book
Dream careers - Full bookDream careers - Full book
Dream careers - Full bookAbdallah Yakoub
 
SEO Footprints by www.Netrix.co.uk - Comprehensive Guide to Website Footprints
SEO Footprints by www.Netrix.co.uk - Comprehensive Guide to Website FootprintsSEO Footprints by www.Netrix.co.uk - Comprehensive Guide to Website Footprints
SEO Footprints by www.Netrix.co.uk - Comprehensive Guide to Website FootprintsMark D. Griffin
 
DG Group - Active Or Passive Website
DG Group - Active Or Passive WebsiteDG Group - Active Or Passive Website
DG Group - Active Or Passive WebsiteFranco De Bonis
 

What's hot (17)

Footprints for backlinks - Find quality backlinks in minutes
Footprints for backlinks - Find quality backlinks in minutesFootprints for backlinks - Find quality backlinks in minutes
Footprints for backlinks - Find quality backlinks in minutes
 
SEO Crash Course - Go from White Belt to Ninja in Search Optimization
SEO Crash Course - Go from White Belt to Ninja in Search OptimizationSEO Crash Course - Go from White Belt to Ninja in Search Optimization
SEO Crash Course - Go from White Belt to Ninja in Search Optimization
 
Social Networking and Youth Work
Social Networking and Youth WorkSocial Networking and Youth Work
Social Networking and Youth Work
 
You, the online brand
You, the online brandYou, the online brand
You, the online brand
 
12 Ways to Improve Your Business Website or Blog
12 Ways to Improve Your Business Website or Blog12 Ways to Improve Your Business Website or Blog
12 Ways to Improve Your Business Website or Blog
 
Social media for E-commerce
Social media for E-commerceSocial media for E-commerce
Social media for E-commerce
 
Footprints
FootprintsFootprints
Footprints
 
Visualizing your Graph
Visualizing your GraphVisualizing your Graph
Visualizing your Graph
 
creating portable social networks with microformats
creating portable social networks with microformatscreating portable social networks with microformats
creating portable social networks with microformats
 
Real Estate Marketing System
Real Estate Marketing SystemReal Estate Marketing System
Real Estate Marketing System
 
Everything you wanted to know about crawling, but didn't know where to ask
Everything you wanted to know about crawling, but didn't know where to askEverything you wanted to know about crawling, but didn't know where to ask
Everything you wanted to know about crawling, but didn't know where to ask
 
Dirk Spencer - Finding Open Jobs with Job Boards -
Dirk Spencer - Finding Open Jobs with Job Boards - Dirk Spencer - Finding Open Jobs with Job Boards -
Dirk Spencer - Finding Open Jobs with Job Boards -
 
Jobs revised march 2012
Jobs revised march 2012Jobs revised march 2012
Jobs revised march 2012
 
Dream careers - Full book
Dream careers - Full bookDream careers - Full book
Dream careers - Full book
 
SEO Footprints by www.Netrix.co.uk - Comprehensive Guide to Website Footprints
SEO Footprints by www.Netrix.co.uk - Comprehensive Guide to Website FootprintsSEO Footprints by www.Netrix.co.uk - Comprehensive Guide to Website Footprints
SEO Footprints by www.Netrix.co.uk - Comprehensive Guide to Website Footprints
 
DG Group - Active Or Passive Website
DG Group - Active Or Passive WebsiteDG Group - Active Or Passive Website
DG Group - Active Or Passive Website
 
SEO Quick Wins: The Small Things that Make The Big Differences
SEO Quick Wins: The Small Things that Make The Big DifferencesSEO Quick Wins: The Small Things that Make The Big Differences
SEO Quick Wins: The Small Things that Make The Big Differences
 

Similar to SEO for the Semantic Web

Htce Pres Office 2.0 April 2007 R4
Htce Pres Office 2.0 April 2007 R4Htce Pres Office 2.0 April 2007 R4
Htce Pres Office 2.0 April 2007 R4troyangrignon
 
Schema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowSchema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowRichard Wallis
 
[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...
[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...
[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...Scott Abel
 
Analyzing Your Deliverables: Developing the Optimal Documentation Library
Analyzing Your Deliverables: Developing the Optimal Documentation LibraryAnalyzing Your Deliverables: Developing the Optimal Documentation Library
Analyzing Your Deliverables: Developing the Optimal Documentation LibraryScott Abel
 
David Esrati, The Blogzilla Report- Fact, Fiction Fear: The Monster of the In...
David Esrati, The Blogzilla Report- Fact, Fiction Fear: The Monster of the In...David Esrati, The Blogzilla Report- Fact, Fiction Fear: The Monster of the In...
David Esrati, The Blogzilla Report- Fact, Fiction Fear: The Monster of the In...webcontent2007
 
Web 2.0 Managerial Economics
Web 2.0 Managerial EconomicsWeb 2.0 Managerial Economics
Web 2.0 Managerial EconomicsAvinash Singh
 
Thriving In The 21st Century: Speaking the Language of the Digital Native
Thriving In The 21st Century: Speaking the Language of the Digital NativeThriving In The 21st Century: Speaking the Language of the Digital Native
Thriving In The 21st Century: Speaking the Language of the Digital NativeGlenn Wiebe
 
API's, Freebase, and the Collaborative Semantic web
API's, Freebase, and the Collaborative Semantic webAPI's, Freebase, and the Collaborative Semantic web
API's, Freebase, and the Collaborative Semantic webDan Delany
 
Christian heilmann an-open-web-for-all
Christian heilmann   an-open-web-for-allChristian heilmann   an-open-web-for-all
Christian heilmann an-open-web-for-allHow to Web
 
Finding harmony in web development
Finding harmony in web developmentFinding harmony in web development
Finding harmony in web developmentChristian Heilmann
 
Using Social Media As A Marketing Tool
Using Social Media As A Marketing ToolUsing Social Media As A Marketing Tool
Using Social Media As A Marketing ToolMichael Mckay
 
Internet101 Presentation
Internet101 PresentationInternet101 Presentation
Internet101 Presentationmacfam6
 

Similar to SEO for the Semantic Web (20)

SEO and Accessibility
SEO and AccessibilitySEO and Accessibility
SEO and Accessibility
 
When?
When?When?
When?
 
Web 2.0
Web 2.0Web 2.0
Web 2.0
 
Htce Pres Office 2.0 April 2007 R4
Htce Pres Office 2.0 April 2007 R4Htce Pres Office 2.0 April 2007 R4
Htce Pres Office 2.0 April 2007 R4
 
Schema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowSchema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & How
 
Search Engine Google
Search Engine GoogleSearch Engine Google
Search Engine Google
 
[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...
[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...
[Workshop] Analyzing Your Deliverables: Developing the Optimal Documentation ...
 
Analyzing Your Deliverables: Developing the Optimal Documentation Library
Analyzing Your Deliverables: Developing the Optimal Documentation LibraryAnalyzing Your Deliverables: Developing the Optimal Documentation Library
Analyzing Your Deliverables: Developing the Optimal Documentation Library
 
Web 2.0 Expo
Web 2.0 ExpoWeb 2.0 Expo
Web 2.0 Expo
 
David Esrati, The Blogzilla Report- Fact, Fiction Fear: The Monster of the In...
David Esrati, The Blogzilla Report- Fact, Fiction Fear: The Monster of the In...David Esrati, The Blogzilla Report- Fact, Fiction Fear: The Monster of the In...
David Esrati, The Blogzilla Report- Fact, Fiction Fear: The Monster of the In...
 
Web 2.0 Managerial Economics
Web 2.0 Managerial EconomicsWeb 2.0 Managerial Economics
Web 2.0 Managerial Economics
 
Thriving In The 21st Century: Speaking the Language of the Digital Native
Thriving In The 21st Century: Speaking the Language of the Digital NativeThriving In The 21st Century: Speaking the Language of the Digital Native
Thriving In The 21st Century: Speaking the Language of the Digital Native
 
API's, Freebase, and the Collaborative Semantic web
API's, Freebase, and the Collaborative Semantic webAPI's, Freebase, and the Collaborative Semantic web
API's, Freebase, and the Collaborative Semantic web
 
Christian heilmann an-open-web-for-all
Christian heilmann   an-open-web-for-allChristian heilmann   an-open-web-for-all
Christian heilmann an-open-web-for-all
 
Finding harmony in web development
Finding harmony in web developmentFinding harmony in web development
Finding harmony in web development
 
Using Social Media As A Marketing Tool
Using Social Media As A Marketing ToolUsing Social Media As A Marketing Tool
Using Social Media As A Marketing Tool
 
Internet101 Presentation
Internet101 PresentationInternet101 Presentation
Internet101 Presentation
 
Innards of Web2.0
Innards of Web2.0Innards of Web2.0
Innards of Web2.0
 
Never Ending Story 1
Never Ending Story 1Never Ending Story 1
Never Ending Story 1
 
Yahoo for the Masses
Yahoo for the MassesYahoo for the Masses
Yahoo for the Masses
 

Recently uploaded

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 

Recently uploaded (20)

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 

SEO for the Semantic Web

  • 1. How do the machines know what Tasty Wheat tasted like? Mouse – The Matrix
  • 2. Short SEO History Short SEO History • Web1 0 Web1.0 • Web2.0 • Web3.0
  • 3. Genesis • A story of the Internet by A story of the Internet, by • Solving the most important problems l i fl db • Greatly influenced by one man…
  • 4. Tim Berners‐Lee Tim Berners Lee “the World Wide Web is Berners-Lee's alone. He designed it. He loosed it on the g world. And he more than anyone else has fought to keep it open, nonproprietary and free.” Time Magazine, 1999 Time Magazine 1999
  • 5. The Problem The Problem • Where can I find the information? Where can I find the information? “Our ineptitude in getting at the record is largely caused by the artificiality of the systems of indexing ” indexing. The Atlantic Monthly, 1945
  • 6. Archie, 1990 Archie, 1990 • Indexed file names and Indexed file names and • Returned results based on pattern matching
  • 8. Web1.0 • Means HTML Means HTML • Is born in 1991, with the help of • Tim Berners‐Lee (TBL), who also founded i ( ) h l f d d • WWW Consortium (W3C) at MIT, and also • Created WWW Virtual Library – the 1st catalog
  • 9. Yahoo Directory, 1994 Yahoo Directory, 1994 • Vertical = categories is like Vertical = categories... is like • “Show me all the stuff and I’ll handle it” • Manually indexed stuff, which was ll i d d ff hi h • OK for starters, but… • Websites quickly grew in number and • Y! started charging money for one listing Y! started charging money for one listing • Increasingly more money...
  • 10.
  • 11. ,1994 • First SE to fully search text First SE to fully search text • Bought by AOL, then • S ld Sold to Excite, which i hi h • Excite went bankrupt and • WebCrawler ends up bought by InfoSpace
  • 12. Other  Search Engines Other “Search Engines” • 1994, reaches 60mil pages in  96 1994 reaches 60mil pages in ‘96 • 1995, bought by Overture, bought by Y! • 1996, meta search, bought by Lycos 996 h b h b • 1997, bought by IAC/InterActiveCorp • 1999, bought by Overture, meaning Y!
  • 14. , 1998 , 1998 • Open Directory Project Open Directory Project • Each listing is checked and certified by a  volunteer • The main source for Google Directory
  • 16. Web1.0 Problems • SE couldn’t understand text so SE couldn t understand text, so  • They said “why don’t you implement some  meta tags (description & keywords) so we can  meta tags (description & keywords) so we can get a glimpse of what you’re saying” • Th The relevancy of a page with respect to a  l f ih keyword was determined by a few factors, so • It was very easy to abuse and spam, therefore p q • Search Results had poor qualityy
  • 18. Web2.0 • Is coined by Tim O’Reilly yet Is coined by... Tim O Reilly, yet • TBL later said that “web2.0” is a stupid,  meaningless term and that he thought of it  meaningless term and that he thought of it first in ’96 anyway
  • 19. Web2.0 means Web2.0 means • which grew apart because of which grew apart because of • PageRank (1998) invented by • Larry & Sergei who adapted the algo from &S i h d d h l f • An MIT professor who had developed • A nasty mathematical formula for positioning  y p keywords in a 3d space model based on the  relevancy that one kw holds … whatever
  • 20. PageRank actually means PageRank actually means • That a link is a vote and That a link is a vote and • Not all links are created equal, so • It matters who links to you h li k • Just like in our real life society
  • 21. • Read the content of pages really well just that Read the content of pages really well, just that • Pages were crappy: –NNon‐standard coding t d d di – Ugly tech (like applets) – Senseless IA • So Google said: “don’t do evil and try to nicely  format the info, according to W3C standards” (remember TBL)
  • 23. SEO • Is a multitude of practices aimed at facilitating Is a multitude of practices aimed at facilitating  the indexing of pages by search engines • Evolves as the ranking algorithm changes and Evolves as the ranking algorithm changes, and • Of course, the algorithm is kept secret.
  • 24. SEO actually means SEO actually means Courtesy of Kelly Ishikawa
  • 25. SEO actually means SEO actually means • An on‐going battle between bots & SEO guys An on going battle between bots & SEO guys • Now 100+ factors influence ranking • And I’d like to take the time to talk about each  d ’d lik k h i lk b h one of them in the following…
  • 27. My SEO Cheat Sheet My SEO Cheat Sheet • Consider: 1. Page Titles 2. URLs (mod_rewrite) 3. Anchor Text 4. Website Architecture (IA) 5. Link Title & Alt Images 6. Relevant content (text) 7. 7 Sitemap xml Sitemap.xml 8. Hosting 9. Freshness
  • 28. Resources Matt Cutts Blog Mihai’s SEO Cheat Sheet :D
  • 29. Web2.0 Problems • © for pictures articles books etc for pictures, articles, books, etc • PPC fraud • Privacy i • Search Engine SPAM • Link bombing • Paid links Paid links • But more important...
  • 30. Web2.0 Problems • SE still don’t understand what the $#%@ SE still don t understand what the $#%@  you’re talking about • Crawling a website’s interface to extract info is Crawling a website s interface to extract info is  almost insane
  • 32. Web3.0 Web3.0  • Means semantic web semantic web • Attention migrates from syntax/formatting to  semantics and semantics and • Meta Data (data about the data) becomes...
  • 33. Web3.0 & Resource Description Resource Description Microformats Framework
  • 34. Resource Description Framework Resource Description Framework • A kind of XML A kind of XML • RDF = Subject + Predicate + Object • S + P + O creates a Triple which O i l hi h • Can describe almost anything in the universe • Triples are connectable (eg: FOAF) • RDFa = XHTML + RDF (W3C compliant) RDFa  XHTML + RDF (W3C compliant)
  • 35. Microformats • hCalendar  • hCard • rel‐tag • VoteLinks • XFN • Geo • hResume • hReview hR i • etc
  • 37. SPARQL • SPARQL Protocol and RDF Query Language SPARQL Protocol and RDF Query Language • Standardized on 15th Jan 08 (1 month ago) and • Endorsed by?... TBL d db ? quot;Trying to use the Semantic Web without SPARQL is like trying to use a relational Q y g database without SQL“ TBL
  • 38. Potential • With SPARQL you skip the presentation layer With SPARQL you skip the presentation layer • You can query ad‐hoc any API, so • You don’t need to crawl in advance, therefore d ’ d li d h f • Information will be as fresh as it gets
  • 39. And possibilities And possibilities • Query: “I can has pizza?”  Query:  I can has pizza? • Returns:  –Af i d f A friend of yours (XFN ‐ F b k) (XFN Facebook)  – has a colleague (FOAF ‐ LinkedIN) who – said that they make good pizza (hReview ‐ yelp) at ( ) – a restaurant nearby (geo – Gmaps) – Tip: U2 in concert today (hCalendar ‐ upcoming)
  • 40. Perhaps now we can see Perhaps now we can see • Why Social Networking Communities are Why Social Networking Communities are  worth so much, even though most of them  don’t have a revenue model – Facebook – LinkedIN – Meebo – Beebo  – Pipu... • They/We are the databases of the future
  • 41.
  • 42. Thanks! “Most of the right choices in SEO come from asking: What’s the best thing for the user?” g g Matt Cutts Mihai Gheza  Mih i Gh Creative Commons Attribution‐Noncommercial‐Share Alike 3.0 Unported License.