SEO Automation for Expansive Websites #SMX #14A

Michael Nguyen at SMX West
Michael Nguyen, director of SEO at Connexity/Shopzilla, on stage at SMX West 2015

This SMX West 2015 mini-session titled “SEO Platforms for Large Scale Websites” is a presentation grounded in the belief that you can automate everything for SEO and directly integrate it to your products. This is especially important for large websites. Michael Nguyen, director of SEO at Connexity/Shopzilla, is our presenter.

SEO for Large Sites

What is large? Too big to fit in your head. Examples are networks of sites and sites with thousands or millions of pieces of content. Enterprise sites and long-tail businesses are also considered large.

Large sites have big problems. Good things about large sites include domain authority and a lot of content. But problems include:

  • Too many products!
  • Duplicate content and canonical issues
  • Crawl efficiency
  • Shifting inventory
  • Shifting search demand

This is a discovery optimization problem. How do you make it easy for users and search engines to get to the content you care about? The solution is to organize content, surface content and improve the content value.

Checklist if you’ve inherited a large site and run into these problems:

  • Identify valuable content: Figure out the pages/products that are valuable.
  • Keyword research: Look for natural language and not jargon.
  • Site architecture: Manually create a taxonomy with the help of info architects and keyword research.
  • Content for category pages: Manually create content for landing pages to merchandise products/content.
  • Flag dupe content: Audit content and map duplicate pages to a canonical URL.
  • Deal with stale inventory: Audit content/products for removal and canonicalization. If it’s temporary, keep it up; if it’s permanent, redirect or 404.
  • Deal with pagination and facets: Create rules for managing crawl and technical SEO. Control the crawl.
  • Promote high quality content: Remove the junk, market the good and be selective.
  • Build category linking support: Link to categories and subcategories in all key areas. Flatten the crawl architecture by linking across deep pages. Remember links come in from everywhere.

You’re done! … Not so fast. If you’re doing the whole process by hand, you find you run into changes as you go. Change is constant. Inventory and business changes. Not a one-time deal. Doing it manually doesn’t scale for large operations. Many variables to manage. The answer is automation. Scale your operation with technology and process.

How to Automate

1. Identify routine tasks. Figure out how often you do a task. Start small. Automate very small tasks to start.
2. Improve operational efficiency. Focus on speeding up processes.
3. Test and validate. When you start to trust an automated system, check to make sure it’s doing the job right.
4. Keep in mind the bigger picture.


Great SEO Platforms

Characteristics of a great SEO platform:

  • Make the core product better
  • Enable testing and experimentation
  • Leverage big data
  • Combine data and expert intuition
  • Dynamic content management
  • Related searches
  • Page scoring: If you can evaluate content with KPIs, you can direct users to pages that matter the most. This is business-specific scoring.
  • Duplicate content classification: Ensure that you only promote your most useful content.
  • Backlink classification: Easily audit a large number of backlinks.

Get to Good Enough

You can’t ever get to perfection. You’ll never automate yourself out of a job. Get to good enough first. Try to get value out of every step. APIs are your friend; you don’t always have to build things in-house. Offload tasks and data gathering (AuthorityLabs, GA/GWT, Deepcrawl, Botify). Utilize your internal search engine to understand search engine concepts like keyword data. Leverage data and search science, like books on informational retrieval and open source tools (sentiment analysis, spam filters, duplicate content classifiers; he gives Mahout, Hadoop, HBase as examples). Build a feedback loop into your system to generate data, make judgments, have it continually understand what’s going on and loop it back into the system.

This is the future of SEO.

Virginia Nussey is the director of content marketing at MobileMonkey. Prior to joining this startup in 2018, Virginia was the operations and content manager at Bruce Clay Inc., having joined the company in 2008 as a writer and blogger.

See Virginia's author page for links to connect on social media.

Comments (0)
Filed under: SEO — Tags: ,
Still on the hunt for actionable tips and insights? Each of these recent SEO posts is better than the last!
Bruce Clay on May 8, 2024
The Always Up-to-Date SEO Checklist
Bruce Clay on April 4, 2024
What Makes an SEO Program Successful?
Bruce Clay on April 3, 2024
How Can I Improve My Website Rankings Through SEO?


Your email address will not be published. Required fields are marked *

Serving North America based in the Los Angeles Metropolitan Area
Bruce Clay, Inc. | PO Box 1338 | Moorpark CA, 93020
Voice: 1-805-517-1900 | Toll Free: 1-866-517-1900 | Fax: 1-805-517-1919