The basics of SEO are technical accessibility, relevance, quality, and authority. Or: can it be crawled, does it meet a keyword need, and is it trustworthy? In each of these areas, we need to build on solid foundational understanding, and find the areas where advanced understanding will give us an edge. Will’s recent research has shown common gaps in understanding, and highlighted interesting advanced topics. In this wide-ranging session, he guarantees you’ll learn something, and you’ll come away with training guidelines for the basics.
52. Luckily released their
parser with source:
“The library is slightly
modified (i.e. some internal
headers and equivalent
symbols) production code
used by Googlebot”
53. /*static*/ absl::string_view RobotsMatcher::ExtractUserAgent(
absl::string_view user_agent) {
// Allowed characters in user-agent are [a-zA-Z_-].
const char* end = user_agent.data();
while (absl::ascii_isalpha(*end) || *end == '-' || *end == '_') {
++end;
}
return user_agent.substr(0, end - user_agent.data());
}
Source: open source robots.txt parser
54. /*static*/ absl::string_view RobotsMatcher::ExtractUserAgent(
absl::string_view user_agent) {
// Allowed characters in user-agent are [a-zA-Z_-].
const char* end = user_agent.data();
while (absl::ascii_isalpha(*end) || *end == '-' || *end == '_') {
++end;
}
return user_agent.substr(0, end - user_agent.data());
}
Source: open source robots.txt parser
55. // Allowed characters in user-agent
are [a-zA-Z_-].
Source: open source robots.txt parser
62. Mozilla/5.0 AppleWebKit/537.36
(KHTML, like Gecko; compatible;
Googlebot/2.1;
+http://www.google.com/bot.html)
Chrome/W.X.Y.Z Safari/537.36
Source: updating the user agent of Googlebot
63. Mozilla/5.0 AppleWebKit/537.36
(KHTML, like Gecko; compatible;
Googlebot/2.1;
+http://www.google.com/bot.html)
Chrome/W.X.Y.Z Safari/537.36
Source: updating the user agent of Googlebot
The “token”
132. Thanks to @KaneJamison for the reference link
Collection
frequency (cf)
Document
frequency (df)
try 10,422
insurance
133. Thanks to @KaneJamison for the reference link
Collection
frequency (cf)
Document
frequency (df)
try 10,422
insurance 10,440
134. Thanks to @KaneJamison for the reference link
Collection
frequency (cf)
Document
frequency (df)
try 10,422 8,760
insurance 10,440
135. Thanks to @KaneJamison for the reference link
Collection
frequency (cf)
Document
frequency (df)
try 10,422 8,760
insurance 10,440 3,997
136. Thanks to @KaneJamison for the reference link
Inverse
collection
frequency (icf)
Inverse
document
frequency (idf)
try 0.000096 0.000114
insurance 0.000096 0.000250
137. Thanks to @KaneJamison for the reference link
Inverse
collection
frequency (icf)
Inverse
document
frequency (idf)
try 0.000096 0.000114
insurance 0.000096 0.000250
192. ● Citicorp Center
● Spider web
● Ice cream
● Lightning
● Robot
● Gate
● St. John’s
● Who wants to be a millionaire
● Fire
● Long tail
● Lisa Schneider
● Confused
● Hay stack
● Graph
● Cookies
● London