Machine Learning is becoming a more and more important part of everything Google does, but can seem quite inaccessible to learn about.
This presentation doesn't try to teach you how to do ML, but focuses instead on showing you the types of problems that ML can address, how Google have used it previously, and how they might use it in the future.
23. What we can do is
identify spammy or
non-spammy
attributes.
24. Are there adverts on the page?
Are there lots of spelling mistakes?
Is there little text content?
Are there Calls To Action in ALL CAPS?
Some Possible Spam Signals
26. List of pages we’ve
manually classified.
List of attributes that we
believe are important to
classifying pages.
27. adverts
on page?
more than 5
spelling
mistakes?
less than 200
words of
content?
CTA in ALL
CAPS?
site A Y Y Y Y Spam Site
site B N N Y Y Good Site
site C Y N N N Spam Site
site D N Y N Y Spam Site
site E N Y N N Good Site
Example Data
33. adverts
on page?
more than 5
spelling
mistakes?
less than 200
words of
content?
CTA in ALL
CAPS?
site A Y Y Y Y Spam Site
site B N N Y Y Good Site
site C Y N N N Spam Site
site D N Y N Y Spam Site
site E N Y N N Good Site
Example Data