2. Hi, I’m Nick Eubanks
❏ First business (painting) at 19 years old. $280k in revenue
during summer of 2003.
❏ Launch first digital company in 2008 (atomni), built CMS for
rapid deployment of Microsites. Sold.
❏ Build custom CMS for reviews in Japan. Angel funded. Sold.
❏ Built and sold lead-gen websites in medical, legal, and SEO.
❏ Joined TrafficSafetyStore.com in May 2012. Grew from
~$3M to well north of ~$20 Million
❏ Currently split time between NK Tech and IFTF.
Nick Eubanks
Founder and CEO, I’m From The Future
Founder, TrafficThinkTank.com
Executive Director, NK Tech
Owner, ADDHero.com
@nick_eubanks
3. We All Want More Traffic to Our Website
@nick_eubanks
6. Understanding Search Engines
❏ Process of information retrieval; NLP
❏ Training machines to “understand” text
❏ Identifying patterns; topics
❏ Identifying importance; term frequency
@nick_eubanks
7. In Comes TF*IDF
❏ Term frequency x inverse document Frequency
❏ Process for machine-based information retrieval
❏ Calculates weights based on frequency of terms used throughout entire document set
@nick_eubanks
8. What does that look like...
❏ Defining term “weights” based on the
frequency of use through the
document
❏ Removal of common “stop” words like
the, and, is, because, and so on
❏ Excluding common HTML elements
such as words used in header, footer,
sidebars, and navigational elements
@nick_eubanks
9. Topic Identification
❏ Identify themes across terms to group
into “topics”
❏ Ahrefs has some of this
❏ Much better with trained corpora
using tf*idf
❏ Can do manually if you have the
patience
❏ Use topics to design your content
calendars
@nick_eubanks
10. Building Relevance
❏ Process:
Keyword research > Topic
Identification > Theme
❏ Pillar content = Top-level theme
❏ Build topic content in clusters
❏ Example:
https://www.hubspot.com/marketing
-statistics
@nick_eubanks
11. Imagine this Scenario
❏ I gave you a stack of 1,000 documents
❏ Could you answer these questions:
❏ How would you organize them?
❏ What if I asked what they were
about?
❏ What if I asked for all the documents
about watermelons?
❏ How would you provide this
information?
@nick_eubanks
12. You would need to create a topic model
❏ To do this, you could use a set of different
colored highlighters
❏ Then scan the documents looking for
words that are related to each other, and
highlight them
❏ Then you would create an index
❏ You would come up with a topic for
each set of related terms
❏ You would annotate which pages
terms relevant to that topic
appeared on
@nick_eubanks
18. What are Word Vectors
❏ Simply, a vector of weights
❏ Word2Vec
❏ Pre-trained
❏ King - man + woman = queen
❏ Spatial relations where proximity
represents relevance
@nick_eubanks
19. Term Clustering into
Topics using LDA
❏ Tf*idf has evolved - a look at latent dirichlet
allocation [LDA]
❏ LDA is a generative statistical model
❏ Ability to generate probabilities; this is
where it gets interesting...
@nick_eubanks
20.
21. Topic Clustering
❏ Grouping topics into themes
❏ Where keyword research becomes
important
❏ Building corpora of documents
representative of relevant topics
❏ Scraping Google really helps with
this
@nick_eubanks
22. How Does This Get You More SEO Traffic?
@nick_eubanks
23. What if you knew...
❏ The terms Google was expecting to find
❏ The topics Google was expecting to find
❏ The concepts that should be linked to
❏ The concepts and content that your page
should be linked from?
@nick_eubanks
24. Not “Keyword Density”
❏ Dial in the term weights from tf*idf for a
specific document
❏ Stuff in a bunch of keywords because you
see them weighted heavily in the top
ranking pages
❏ Google is much, much smarter than this
❏ Look for terms that represent topics
❏ Use those topics to perform additional
keyword research
❏ Explore the pages currently ranking, and
analyze their
❏ Internal links (both directions)
❏ External links (both directions)
Don’t Do
@nick_eubanks