Rand Fishkin still doesn’t understand NoFollow

by Michael Martinez on June 11, 2008

UPDATE: Rand let me know he had revised his post to clear up the ambiguity about the effect of using rel=’nofollow’ in his remark.

I hate doing this but enough people follow SEOmoz that if Rand Fishkin posts something wrong, the chances are pretty good that a lot of people will start repeating the erroneous idea and then we have yet another war against SEO ignorance to wage.


Begin Emily Latilla Voice – Never Mind


Today Rand wrote: “Nofollowing a link from a page makes it pass more juice and ranking power through the links that are followed.”

Really? So, what we should infer from this remark is that of three possible permutations, only the third (in the list given below) will produce optimum outbound PageRank for your links:

  1. You place six followable outbound links on your page
  2. You place five followable outbound links on your page
  3. You place five followable outbound links on your page and one NOfollowed outbound link on your page

If we’re to believe Rand, option 3 gives your five outbound links more juice than option 2. Is that really what he meant to say? I hope not.

Using rel=’nofollow’ on some of your outbound links in no way gives more juice or power to your other outbound links. All it does is tell the search engines that honor nofollow to not count those links when evaluating links. Masking outbound links from search engines is indeed, as Rand points out, an old black hat SEO trick. It’s also about as effective as pouring a glass of clear water into the ocean.

There are many billions of Web documents in the major search engines’ indexes. Some of those documents pass PageRank-like value to other document. Some don’t. In the overall scheme of things, you’re not in any position to influence PageRank-like value distributions by using or not using rel=’nofollow’. You’ll do better in the long run by including valuable expressions in your meta tags.


End Emily Latilla Voice


Rand also wrote: “…to say that no one should ever [use rel='nofollow'] because it doesn’t work, or because they’re too dumb to figure out how to do it properly sounds like you’re either in denial or don’t want competition.”

Sounds to me like Rand doesn’t want to let go of a failed idea. Supposedly (I cannot find the blog post where I read this because at the time I didn’t think anyone would be bold enough to say something pro-nofollow so soon after the conference) someone asked an SMX Advanced audience (last week) how many of them had tried sculpting PageRank with NoFollow. Most people in the room raised their hands. They were then asked how many of them felt it helped. Most people in the room dropped their hands.

If you really could work some benefit from using “rel=’nofollow’”, an identifiable trend would have emerged by now. Despite the fact that people like me, Shari Thurow, and others pointed out just how much of a waste of time it would be for people to implement nofollow on internal links, many of you tried it anyway. Most of you appear to have reached the conclusion that it didn’t do anything for you.

It doesn’t take much of an intuitive leap to see that if you cannot improve your PageRank by telling search engines to ignore your own content, you’re not going to improve it by telling search engines to ignore other people’s content.

That’s called PageRank Hoarding, which won the Most Stupid SEO Idea of the Year for 2005 (PageRank Sculpting won Most Stupid SEO Idea of the Year for 2007). You cannot hoard PageRank for the same reason you cannot sculpt it: you cannot see it, measure it, or control it.

You don’t get to decide which of your pages obtain PageRank and you don’t get to decide which of your pages pass PageRank.

I think it’s also interesting that Rand put words into Matt Cutts’ mouth by writing “Matt Cutts said All Links to a Domain Help Every Page on that Domain Rank Better”.

I’ve read several reports about Matt’s comments and Matt Cutts said no such thing. Of course, Rand admits in his follow up comments that he is putting words into Matt’s mouth but that’s just morally and ethically wrong in my opinion.

In fact, statistically Rand only has about at most a small chance of being right. Matt reportedly said “The original PageRank was purely a page-level document.” (Source: SmallBusinessSEM Blog – Cf. previous link)

If Matt was implying that PageRank now works differently, here are some possible variations:

  1. PageRank could be assigned on a page-zone basis
  2. PageRank could be weighted on a page-zone basis
  3. An Ask-style context-sensitive PageRank might have replaced the old PageRank
  4. PageRank might now only be a “Main Index page-level document”
  5. PageRank might be filtered by neighborhood or Web zone

I can think of many more ways to get away from the original PageRank distribution model, several more likely (based on patent applications and technical papers) than Rand’s assumption that there must some sort of domain-level PageRank.

There is no statistical evidence for domain-level PageRank assignment. Nor is there any anecdotal evidence for it. There is plenty of speculation, of course.

Can you test for domain-level PageRank? Well, first of all, you have to be able to measure PageRank. Since we have no way of measuring PageRank, we have to find it the same way that astronomers find black holes — we have to look at PageRank’s effects on the Web space around it. How do you do that?

Example 1: There is a fairly well-known wiki-style site that links out freely to other sites. I have obtained some links from that wiki-style site for various sites. Some of the links appear to pass value. Some of the links appear NOT to pass value. All of the destination sites pass value through their own links. Conclusion: During the period of observation (lasting several months), the wiki-style domain did not pass value through all of its outbound links (which were on separate pages).

This is a strong indication that no domain-level PageRank was working, but it’s not proof.

Example 2: I have several value-passing domains that all link to a specific destination (call it MyTestSite). Links on the destination page don’t appear to pass value. My value-passing domains consistently help me launch new content into the major search engines’ indexes. The new content consistently obtains the ability to pass value through its own links. But MyTestSite doesn’t pass value through its links. Why?

Conclusion: Simply pointing links from trusted domains to a new domain does not guarantee the new domain will be trusted.

This is a strong indication that no domain-level PageRank is at work, either for the linking domains or the destination domain.

Example 3: There are several popular social media sites that people use for link placement. I have created profiles on some of those social media sites to test my theory of “child inheritance” (which I discussed in On The Googleness of Being in early 2005). I believe that “child inheritance” works in a limited fashion and has more to do with trust than with PageRank. Still, if there were such a thing as domain-level PageRank, one would expect that a profile page on a PageRank-rich domain should have some minimal impact in search results.

My test profiles did not even appear in search engines’ indexes for many weeks. When they eventually did, their links accomplished nothing. Now, more than a year later, those links still accomplish nothing.

Conclusion: Simply creating pages with outbound links on link-rich sites does not create any value for you.

This is a strong indication that no domain-level PageRank is at work, but it’s not proof.

However, if we assume for the sake of discussion that Google does have some sort of domain-level PageRank, they don’t have to use it for weighting links or adjusting relevance scores in search results. That is, a domain-level PageRank-like mechanism could be used to establish trust thresholds, to identify neighborhoods, to create filtering processes that help Google figure out where Web spam is, where new fountains of legitimate content have sprung up, etc. That is, by mapping relationships between domains, Google can analyze the Web without actually using that analysis to directly impact its linking valuations or search results.

Some people actually crawl the Web and collect terabytes of data without publishing that data. These home-crawl indexes are neither professionally designed nor professionally managed. These home-crawl indexes are used for “analysis” that is, in my opinion, shoddy at best.

One of the most common mistakes people make with their home-crawl analysis is that they treat their database as if it is similar to the major search engines’ database. This is about as stupid an idea as using Yahoo! SiteExplorer to do Google link analysis.

Another common mistake is that the home-crawl analysts don’t refresh their snapshots. Since they don’t have the resources to recrawl the Web once a month, they just lump all their old Web site images together with their most recent Web site images. What they end up analyzing is an Elephant Man metaphor for the Web that bears no resemblance to the real Web.

Some home-crawl analysts try to duplicate seed-based crawling. That is, they may start crawling the Web from Yahoo!’s directory or the Open Directory Project in the hope of duplicating the major search engines’ crawl patterns. However, without access to the major search engines’ filters, penalty lists, and ban lists the home-crawl crowd cannot help but index radically different versions of the Web from what the major search engines index.

Now, doing your own crawl is not necessarily a bad thing as long as you don’t try to second-guess what the major search engines are doing based on your own experience. If you become so bloated with home-crawl arrogance that you start making sweeping generalizations on the basis of your unsubstantiated analysis, you risk saying some pretty stupid things.

I seriously doubt we’ll ever get the SEO pundits to stop advocating the use of “rel=’nofollow’”. Every time someone claims to have accomplished real results with it, desperate people in the audience will go away and think, “If it worked for him it can work for me”.

The truth is that if you’ve concluded that using “rel=’nofollow’” has somehow helped you, then you’ve missed something BIG. Nofollow is not a solution. It’s just a band-aid that covers up a problem.

{ 11 comments… read them below or add one }

edelabar 06.11.08 at 9:38 am

I’m not going to necessarily disagree with you, but looking at the original idea of pagerank I think I’d have to agree with Rand.

http://citeseer.ist.psu.edu/cache/papers/cs/7144/http:zSzzSzwww-db.stanford.eduzSz~backrubzSzpageranksub.pdf/page98pagerank.pdf

So, assuming the calculation of page rank is still close to the way this document describes it, and nofollow links are not counted as links, if a page has 5 followable links and 1 nofollow link, it should pass equal juice to example two which has only five followable links and 0 nofollow links.

Michael Martinez 06.11.08 at 10:35 am

You missed my point (OR I just did a poor job of presenting it). Using “nofollow” on a page is no different from not putting the link on the page to begin with. But the way Rand stated it in his post today, it sounds like using nofollow on a 6th link makes the other 5 links more valuable than if there were only 5 links on the page.

And that makes no sense whatsoever. I hope that is not what Rand intended to say.

Based on what has been published about PageRank, the 5 links should carry equal weight in both option 2 and option 3. They get no boon from being accompanied by a 6th nofollowed link.

It would require an immense number of links for us to test a hypothesis like that. Perhaps no fewer than 1,000,000 linking pages would be required. It’s not worth the effort, as the difference in PageRank that is distributed across 5 links versus 6 links is absolutely microscopic.

The whole idea of trying to manage PageRank by limiting the number of value-passing links on your page is completely ridiculous. You might as well try herding microbes.

edelabar 06.11.08 at 10:44 am

“Using “nofollow” on a page is no different from not putting the link on the page to begin with.”

That’s basically my point. I can definitely see cases where providing the link causes a better user experience, but just because you provide it doesn’t mean you need to share your juice with it. Too many SEOs forget that there’s a user looking at these pages too.

DangerMouse 06.11.08 at 10:55 am

I agree thats Rand’s comment was poorly phrased, however I’m suprised that you’ve jumped upon such a minor error with such vigour (imo it’s clearly a case of a loose sentence rather than the meaning you suggest). Generally I agree with your points about ‘PR sculpting’ being an imperfect science (and in many cases not neccessary) and as a result you cannot demonstrate clear ROI from the activity, however should the limited information available on PR be accurate surely you must concede that ‘no-follow’ theoretically can have some effect on the flow of PR style authority around a sites internal architecture?

seomexfan 06.11.08 at 11:02 am

Michael.

You did make your point crystal clear. Even me, a Mexican with English as a second language understood your point.

I think that what happens here, it´s that several guys in SEO think that Rand´s assumptions and comments (specially when Rand makes a refference to Matt Cutts) are “All-Mighty” Trues and Facts. However, this cannot be more far from true.

This morning, when I red Rand´s Post about this subject (Seven Short SEO Scribbles), to be honest I was kind of confused, and I thought I misunderstood Rand´s point, maybe because of my poor English, so I had to re-read it twice and more carefully. But, it didn´t make any sense to me that a page with X number of followable links and Y number of nofollow links, will pass more Link-Juice (PR) to those followable links.

Michael, thank you for making a clarification about this issue.

By the way, You made me laugh with this “You might as well try herding Microbes.”

Saludos.

Carlos 06.11.08 at 2:03 pm

I certainly agree that rel=nofollow is a bad choice for anyone with long-term ranking in mind, but opening post with a semantic attack hurts this posts total value.

The use of neutered links is old-school. Why are people switching to nofollow? Probably laziness, it takes less time than javascripting. Or maybe they just spontaneously forgot the other technology options to make the major search bots turn a blind eye.

You use the example of changing 6 links to 5 links being a minimal, that is probably not the case (5 way redistribution of 17% of total value). The redistribution at small numbers is likely far greater than most nofollow schemes are working with, since they are usually nofollowing about 12 links on pages that have between 75-200 links. For example Zappos nofollows 39 out of 193 links on their homepage (~20%). So Zappos is redistributing 20% of their total value across 154 links. And as you point out total value is an unknown.

Michael Martinez 06.11.08 at 2:07 pm

DangerMouse: “… however should the limited information available on PR be accurate surely you must concede that ‘no-follow’ theoretically can have some effect on the flow of PR style authority around a sites internal architecture?”

Michael: If you’re referring to the Google Toolbar PR values (which most people in the SEO community equate with a putative logarithmic equation), Googlers have already told us that by the time the PR data is published it’s out-of-date. Unless one uses a very broad, flexible, and unrealistic definition for “accurate”, I don’t see how one can say we have any accurate information to work with.

If you’re referring to something else, it will help me if you’re a little more specific. I admit to being entirely ignorant about any other published PageRank data (except for the valuations that have been shared through the Google Directory — but I believe someone recently reported those valuations were modified to conform to the Toolbar’s reported data).

There really is no way to determine which pages have PageRank, what the PageRank is, and which links are passing PageRank. You can sort of guess — based on crawl rates and the passing of anchor text — which pages probably have some decent PageRank, but how do you reliably quantify that information?

DangerMouse 06.12.08 at 1:35 am

My meaning was a little unclear then, I was referring to the now out of date PR algo, where I thought it was generally accepted that the a page may distribute some of this mysterious ‘value’ to other pages via outbound links. I agree completely with your point that it is not possible to accurately quantify what ‘value’ a page currently holds and what ‘value’ it shares.

deInternetMarketeer 06.14.08 at 3:41 am

Using nofollow to try to sculpt pagerank is saying to Google : Hey, i’m trying to influence your rankings.

I’m wondering how long it will take before we have nofollow penalties …

But to be honest, nofollow should completely be dropped by Google. It’s not normal that 1 (!) search engine decides how websites need to be builded.

Especially not if you have to use it to help ‘big billion Google’ to make their SERP’s.
Crawl, calculate and make the rankings. That’s what they have to do, not saying how websites need to be builded to rank in their search engine.

seo_scientist 06.15.08 at 1:41 pm

while i agree that it is not something that will make or break a site (well, maybe break :) ), i think that on a theoretical level, yes, if a site has 6 outgoing links and one is nofollowed, then the other 5 will pass more linkjuice than if there was no nofollow. The question about whether it would be the same amount of linkjuice if there were 5 outgoing links in the first place is a legit one and should be answered empirically, but my gut feeling says that it should be the same.
Since not placing all the links that you do not want to leech linkjuice from your ranking pages is not a viable option for the majority of websites, nofollow would be the way to achieve the same result.

Again, I am not prescribing this as something that is utterly important for all websites, or even for a majority of websites, I am discussing it only on a theoretical level.

Another thing, I think that everyone would be much better off if this discussion was limited to mentioning Rand only for the quoting purposes. Otherwise, it could be perceived (IMHO) that one of the purposes of the [i]ad hominem[/i] arguments are taking a shot at the big dog…

And you really should try herding microbes. It actually works wonderfully. You can order them into tiny spots on a plate, when each spot represents a different population of microbes, according to the gene you have inserted into a plasmid that they received. Pretty easy to do, once you get a gist of it… ;)

John H. Gohde 06.17.08 at 8:53 am

The entire idea behind using rel=’nofollow’ was to prevent spam. Google intended it to be used by software developers only. And, it is just as obvious for me to see that SEO bloggers with their never ending need to write new content started to quote Google out of context in order to justify their nonsense. And, just as predictably all the other SEO bloggers blindly repeated this rel=’nofollow’ nonsense due to their herd mentality.

Gives me a chuckle and plenty of opportunity for writing new posts on just how obviously stupid the entire notion really is.