Last Updated on August 28, 2016 by Alex Miller
At PosiRank, we’ve been teaching the importance of addressing low quality pages for many years now. Why? Because it’s just SO effective at boosting rankings.
1 Sentence Summary: Google HATES indexing crappy pages in their index and they’ll punish you for it. Plus, these low quality pages dilute the link equity to your higher quality pages. Not good.
Ok, any skepticism should be totally removed now… 🙂
Here’s The Truth:
If you have low quality pages in the index (this mostly covers “thin content” pages and/or pages with duplicate content) this means those specific pages, as well as your entire site, will have a reduced quality score.
Yes, your entire site gets affected by those other “selfish” pages. Not good.
In other words, Google might look at your site and say:
“Ok, this site has 100 pages – 60 of them are great (lots of high quality unique content) but 40 of them are pretty terrible (low quality pages).
Consequently, let’s give them a quality score of say…5/10.”
But – if you fixed a bunch of those 40 pages and reduced down to say 5 low quality pages, your quality score overall might shoot up from a 5/10 –> 8/10.
What’s cool is that as you start to fix these pages (more detail below on this), you’ll not only see those pages perform a lot better – but your site in general will rank higher across the board.
All of your pages will inflate, to some degree.
This is because:
a) Your quality score increases
b) Your sites internal link equity increases to your remaining indexed pages (vs being diluted)
Pretty neat, eh?
What makes this even better is that Google will notice these improvements (and reward you) within usually 2-3 weeks of crawling your site and seeing the changes.
What Results Can I Expect?
An almost impossible question to answer, but on a page-specific basis, you could easily see ranking increases of 25-400%+.
Consider how drastic an improvement it is to turn a 150 word article into a 3k word high quality, super in-depth resource that helps users. That’s easily going to be a 200-400%+ gain.
The more pages you fix, the more your site as a whole will benefit – meaning all of your pages will boost. You can’t do this all in one evening, but build it into your schedule to regularly fix a handful of pages each day / week (or whatever you can do). The results make the exercise addictive 🙂
A friend of mine came to me for some SEO advice about a year ago.
He had a 400+ page site in the travel space. I noticed he had about 70 tag pages indexed. ALL we did initially was to de-index those tag pages (after confirming they were getting no traffic).
A 25% increase in traffic within weeks which added another ~80 visitors / day. Not bad for quickly getting rid of a bunch of junk pages! (and there’s so much more that can be done than just de-indexing tag pages).
Unless you don’t like the idea of serious ranking increases in under 2 weeks, then please read on folks!
Step 1 – Understand What An Indexed, “Low Quality” Page Actually Is
Important: We are only interested in pages that are indexed in Google.
This means that you can in theory put anything on your site that isn’t high quality, just as long as it’s no-indexed in Google. Of course, don’t feed your visitors from the garbage can 🙂 But my point is that it’s the indexed pages that cause harm to your rankings.
An example of publishing a “low quality” page (but keeping it no-indexed) is sharing with your visitors a relevant 3rd party press release online that they might find interesting and then posting it on your site. That’s absolutely fine as long as it’s set to “noindex”. By doing so, you won’t cause any harm to your rankings.
9 Main Types of Low Quality Pages (To Find)
|PAGE TYPE||WHY THEY ARE A PROBLEM (AND SHOULD BE ADDRESSED!)|
|CATEGORY PAGES||If you think about a standard category page, it’s just a list of posts on your site combined with a few lines of text taken directly from those articles.|
Nothing on these category pages is unique, maybe apart from a few words up top. This is “internal duplicate content” and that’s why they’re problematic and should be addressed.
|TAG PAGES||Same reason as “Category Pages” (it's all duplicate content)|
|ARCHIVE PAGES||Same reason as “Category Pages” (it's all duplicate content)|
|AUTHOR PAGES||Same reason as “Category Pages” (it's all duplicate content)|
|OUTDATED PAGES||For example, pages that cover topics which are no longer relevant to your audience. This of course includes old pages from previous versions of the site (we often see this!).|
|THIN CONTENT PAGES||Assuming you’ve categorized the previous page type (old & stale), then you’ll likely find a LOT of pages that are thin on the amount of content that they have.|
|DUPLICATE CONTENT PAGES||As you know, Google hates pages that have duplicate content on them – whether that’s internal DC (same content repeated throughout your site) or external DC (content taken from other sites). They both cause serious harm to the quality score of a page.|
|ADMIN PAGES, PDF'S & “Behind The Scenes” URL’s||For large sites, you’ll be amazed at these sorts of pages that shouldn’t be in the index. Be careful with PDF’s because Google do crawl them and if the content is already on your site (or published first elsewhere) then it’ll hurt your entire domain due to a downgrade in quality score.|
|PAGES FROM OLD SITE VERSIONS (inc. “Dev” pages)||We’ve seen far too many cases where an entirely duplicate site will be on a subdomain which was used when a site was designed, or even as a “staging” site.|
Step 2 – Find These Indexed, Low Quality Pages in Google
Tip! Once you start to identify pages that need addressing, I recommend organizing those URL’s in a basic spreadsheet or document. Put each URL under its respective category (see above for types of LQ pages). This organization will be important for your sanity!
OK – now that you’re hopefully familiar with what you’re looking for, let’s dig into a few methods of finding them:
Recommended Options (you’ll probably need to use 1-2 of these methods)
Method 1: Site Operator Search (Great for seeing all indexed pages in Google – works well for smaller sites)
Simply go to Google.com (or whichever Google TLD is relevant to your site) and type in:
This will show all of the pages that are indexed in Google. This is the most manual way of doing it and for sites of moderate size (under 500 pages or so), I like this method.
It’s especially good for sites under 100 pages and avoids you having to fire up a tool to download all the pages.
Tip: First thing you should do is go to the last results page for this query and click on the link shown below (the message will look similar to this):
Why? Simply because we want to see everything that Google is indexing so nothing is missed out.
Method 2: Use Tools to Download All of Your Pages
Below are a selection of 3 tools that you can use (there are definitely more out there) to download all of your indexed pages.
Of course, once you’ve downloaded all the pages you can start to filter through them and categorize those appropriately.
Here are the 3 suggested tools:
Method 3: Find Very Low Traffic Pages (High Chance of being “Low Quality”)
This is a neat method to effortlessly find very low traffic volume pages for your site that need help.
Go to your analytics and filter by pages that received less than 10 visitors from Search in the last 30-60 days. That’s a great way to find them.
Warning: This WON’T find pages that get zero traffic – make sure you use one of the other methods to find those pages as they will be reducing your quality score.
And – Here’s How To Find Pages with Duplicate Content!
I wrote up a handy guide on how to use these 2 tools to hunt down duplicate content so you can get them fixed up!
Step 3 – Fix These “Problem Pages”
Here Are The Options Available:
1. Improve The Quality of The Page (highly recommended)
(You’ll only want to spend the effort boosting pages that you actually want to rank in Google. If you don’t need them anymore, go with option 2).
For the most part, these page types below are those you’ll want to look at improving.
* “Thin” content pages & articles (that are still relevant) – add more content to bolster these pages!
* Duplicate content pages & articles (that are still relevant) – remove the duplicate content and replace with fresh, unique and relevant content!
My Thoughts on Category Pages
Instead of instantly deciding to deindex category pages (which I would for TAG pages by the way), categories pages are often a LOT more visual and should be worked on and kept in the index.
For example, most ecommerce stores rely on their category pages to guide the user to where they want to go. These are pages you want to rank!
But, it’s likely that if you were to look at the traffic metrics of your category pages right now that it wouldn’t be great news. In 95% of cases, this will be because there is no unique, descriptive content on that category page which helps Google understand what it’s about.
What I’d therefore recommend is adding relevant content to the bottom (or the top, or both!) of your category pages so that you boost their quality and get them ranked. Instead of a page filled with duplicate content, help Google understand your category page and make the effort to produce original content.
You should be pointing links into your category pages so that it funnels through the rest of your site equally, and so your category pages probably aren’t far from performing well. Go ahead and get busy writing and you’ll see some excellent results.
2. De-index The Page
Critical! Before you decide to de-index ANY page, you must check the traffic that page is getting first in your analytics.
Google’s algorithm certainly isn’t perfect and there are times when low quality pages will be ranking (somewhat) in the spotlight and are getting search traffic.
So, if you just go ahead and de-index that page you’ll of course lose the traffic that page had. Don’t make that mistake!
It’s exciting when you see a low quality pages ranking fairly well because you just know that if you enhance the quality, it’ll do even better (and likely a LOT better).
Therefore – please login to your analytics (Google Analytics, Clicky etc) and find the traffic stats of any page before de-indexing.
If a low quality page (such as a category page, tag page etc) IS indeed getting traffic then I would recommend either:
1 – Boosting The Quality (see option 1 above)
2 – Consider redirecting the link equity to a more relevant page (see option 3)
Choosing to de-index a page applies mostly to the following page types:
* Archive Pages
* Author Pages
* Tag Pages
* Old and “stale” blog posts and articles
* Admin pages, PDF’s, “behind the scenes” URL’s
* Pages from previous site versions (including “Dev” pages)
* Duplicate content pages & articles (see my example above which spoke about a scenario where you might want to have a duplicated press release on your site because it’s something your audience would enjoy. But, if it’s indexed you’ll need to deindex it so that it’s not reducing your sites’ quality score and aggravating “The Panda”.
3. 301 Redirects
More often than – especially with larger sites that have more pages indexed – you’ll find 2 or 3 pages that cover the same topic during your site audit.
What I like to sometimes consider in these scenarios is 301 redirecting the inferior pages to the page that Google is ranking highest / giving the most exposure to.
This cleans up a duplicate content issue (or an issue where Google is confused at which page to rank) and it’ll boost the authority of the strongest page, due to the additional link equity being funneled its way.
Of course, you can take it a step further and follow Option 1 to take it up another couple of levels.
Conversely, if you want to keep let’s say 3 pages that talk about the same topic, you’ll want to ensure that each page addresses a different angle / sub-topic.
For example, don’t have 3 pages on your site talking about Link Building strategies unless they are each covering completely different topics of link building. (e.g. Link building for local SEO campaigns; link building for Ecommerce sites; link building for YouTube videos).
Ultimately, when it comes to onsite optimization – we’ve seen that there’s generally two groups of people:
Group 1: People that find onsite stuff really tedious, and therefore just try to make up the difference with stronger “offsite”.
Group 2: People that find onsite stuff really tedious, but realize its importance in in the SEO equation. Much like an aircraft, they realize that too much baggage will suppress (or even prevent) lift, and so they simply do what’s necessary to ensure their rankings can take flight.
I realize that I’ve been beating this topic to death lately – but the reason why is because this is truly the crux of whether or not a campaign will succeed. We see this every day…
I’d strongly encourage you to join the 2nd Group. You don’t have to “enjoy” onsite optimization. (No sane person does).
But the results are worth the temporary pain. And, furthermore – you don’t have to bear the load all by yourself. There’s plenty of on-demand options for you in the PosiRank platform to (greatly) reduce your workload.
But for the love of results… take this seriously. It’s usually THE deciding factor on whether organic marketing is viable.
Chief Explainer of Onsite Importance