Linking patterns: Why link-heavy pages win, lose, or draw
Posted by Michael Martinez on September 18, 2007 in Link Theory
How many links per page is okay?
John Biundo made an interesting point in Ten Lessons From SES San Jose. Let me quote him for you:
2. Who says you should only have 200 links per page?
This is something that has passed as wisdom since I’ve been following SEO, and it’s always made some sense to me. Indeed, when asked directly, representatives from Ask and Google seemed at first to agree. Their common response was effectively “gosh, if you’re designing pages with more than 200 links on them, you have to ask yourself whether you’re doing the user any favors.â€When reminded that user sitemaps often have more than 200 links, both vendors said, in effect, “yeah, well, that’s probably OK.†Both acknowledged that there’s no hard limit, but that with some notable exceptions, pages with lots and lots of links could start to look like spam. I take the usability comment to heart, and in general, will continue to advise my clients to try to design pages that users can easily understand and navigate, which will naturally tend to produce pages with fewer than a couple of hundred outbound links.
Usability is not defined by how many links you have on a page or how much scrolling the user has to endure to find a specific link. That’s more of an accessibility issue and that is why you want your site search tools to be easy to find and use (regrettably, my site search plans usually fall short of perfection).
What is the chief difference between an HTML sitemap with 200 links and a reciprocal links page with 40-60 links? The answer to that question should be obvious. But how many of you would want to say, “The sitemap links to relevant content”?
However, a coherent Web site (like an eclectic blog, for example) can have several hundred pages of content on as many topics. The HTML sitemap would not be linking to “relevant content”, would it?
Analyzing link and URL structure for meaning
How many of you would look at the URL structure of the links on the HTML sitemap and say, “They’re all pointing to the same host”? Does linking to the same host really make that big a difference to a search engine? For example, what if the host is home to 100 separate user accounts, each distinguished in its URL structure only by a tilde (~)?
There was a time when URL theorists argued that sub-directories designated by a tilde were probably treated as separate hosts by search engines. Those were the pre-Google dominance days. How do you think such URL structures work now?
If the majority of a site’s content is scattered across sub-domains, as in an information portal that uses sub-domains for each city, state, or topic category, should all those sub-domains be treated as a single host by the search engines? Do you think they do treat them that way?
So then why should search engines care if you place 300 links on a Web page? In fact, why should anyone care? There is no sure-fire algorithmic way to ensure that every URL on a domain (with or without all its sub-domains) really belongs to the same “host” or “site”. That’s a limitation of search engine technology, not of Web technology.
What are Web sites and who cares?
The Web is designed to be fluid, tolerant, and pervasive. It is intended to connect everything that can be connected. “Web sites” are afterthoughts and because they are afterthoughts they are not easy to define. If you and I cannot agree on what constitutes a Web site, then how are the search engines — who only want to document the Web content we create — to figure out what constitutes a Web site?
The link between accessible Web sites and optimization
Now let’s go back to that accessibility issue. If we blunder through our Web design, confusing search engines with blurred distinctions between Web sites, hosts, and domains, what happens to the people who are dependent upon those guides we are expected to place in our page design for their convenience?
Many SEOs know there is a connection between strong on-page optimization and good accessibility. But people have been hard pressed to explain why, other than to say that the easier it is to find what you are looking for, the easier it is to rank for what people want to find.
Nonetheless, accessibility doesn’t help us to distinguish between Web sites. Hence, accessibility is not a standard we can use to measure the boundaries of our content. Those boundaries, however, are important because they establish rules of behavior for search engines, people, and accessibility software.
What is strong linkage?
When I speak of on-site optimization I usually say you want “tight linkage”, “strong internal linkage”, “good linkage”, etc. I mean more than just putting a lot of links between your pages, however. You need to create meaningful links. Drawing connections between relevant pages is one way to create meaningful links. Putting descriptive anchor text in your links is another way to construct them.
How do URLs help Web site structure?
But if you step back and consider your naming conventions, you’ll see that embedding keywords in URLs strips you of some of that tight internal linkage. That isn’t to say that I wouldn’t want to use Grace Park’s name in a Grace Park URL. Rather, I want to make sure that the whole URL (xenite.org/features/grace-park/) tells both me and my visitors something intuitive about the content. In this case, it’s a “feature”.
Spend enough time browsing Xenite and you’ll find that we have F.A.Q. pages, directories, dedicated mini-sites, and feature article sections. Some of those feature article sections are larger than many business Web sites. Some of those feature article sections are occasionally updated with new information or content but their original design is intended to be comprehensive. They aren’t really designed for growth.
The mini-site sections, on the other hand, are indeed ready for more content. They look different, act different, feel different from the feature article sections. Xenite has become a slurry of Web content because it is host to many different types of Web sites, but those sites all promote Xenite itself and some of them heavily cross-promote other parts of Xenite.
How does linking work for superlarge sites?
Of course, not everyone wants to create a Xenite.Org. In fact, you can study superlarge content sites that are so homogenous there is no mistaking what they are: Web forums. Popular Web forums accumulate hundreds of thousands or millions of posts as the years role by. Amazingly, search engines can find some of the oldest posts on those popular forums while struggling to find newer posts on small forums.
What is the difference between a small forum and a superlarge forum?
One somewhat obvious distinction is the link profile of the superlarge forum. It’s undoubtedly pulling in a lot of links from other sites if it has hundreds of thousands of posts. People link to those discussions from other forums, from blogs, sometimes from news stories, and sometimes from directories. Social media sites will also link to forum discussions (in fact, some social media sites won’t link to root URLs — you must deep link).
A relatively inactive forum or new forum doesn’t have much of a link profile. Its internal linkage won’t help it much unless it uses static HTML pages. Do you know why? The reason may not be obvious to everyone.
The linking differences between forums and blogs
Forum software, unlike blog software, is designed to hide content. Blog software is designed to make content more visible, often by replicating that content (Blogger is an exception — they now bury old content). Forum software designers make it almost impossible to keep old discussions crawlable. They scroll off the active discussions list after a day, a week, or a month (you can usually configure how long the posts remain visible).
Some forum software offers “search engine-friendly archives” that don’t really offer any advantage over live forum posts. Old archive pages may hang around longer than the live posts but archive pages tend to strip out all formatting.
Forums tend to have distinctive link patterns. Their administrative functions often have more links than their index pages and discussion pages. Every post usually has a collection of “QUOTE”, “REPLY”, “START THREAD”, etc. links to incidental functions that really don’t need to be crawled or indexed. Blogs lack these kinds of navigational link structures.
So another distinction between a blog page and a forum page is that the forum page may be honeycombed with duplicate links to incidental functions. Some blogs can be covered with blogrolls but many blogs only show their blogrolls on their front pages. The heavy internal linkage common to modern forum packages is both annoying and distracting. But that heavy linkage actually improves usability while both improving and diminishing accessibility and it also impedes efficient crawling.
Ambiguous linkage: How to compensate for ambiguous link structures
Forum software tends to create ambiguous linkage so it should not surprise anyone that forums need a lot of external links in order to be well-indexed. It’s not that those external links qualify pages (a lot of external links may actually point to incidental function pages — do you know why?). It’s just that people will intentionally link to actual discussions more often than they’ll intentionally link to incidental functions. Statistically, you’ll get a lot of older content crawled and reindexed when there are static external links pointing to that older content.
Some forum operators will bump old discussions to bring them back up to the top. I’ve done that occasionally myself. Bumping threads helps improve their visibility for both people and search engines. But you cannot bump a blog post. Once the post is made it’s archived and stored and forgotten. The blog mostly lives through its front page. If you don’t allow your blog to create individual archive page for each post, you’ll eventually lose granularity in your visibility because people don’t have anything to link to.
Deep-linking to individual blog posts thus helps your friends and clients even though you may seem to be helping confuse the search engines. Many people choose to promote their collective archive pages (weekly or monthly) rather than individual blog posts. If you can attract links to those weekly or monthly archives that’s okay. If you don’t get links to them, you have a problem. But some older blogs now have dozens, even hundreds of weekly archive links in their margins. Unlike blogrolls, archive rolls may be carried from page to page on a blog.
Is it okay to put a lot of links on a page?
The bottom line here is that putting large numbers of links on your pages is not usually a problem. The search engines may say they don’t like that but the truth is that they have rewarded millions of Web sites with intensive crawling for having large numbers of links on their pages. There are legitimate concerns about how automated reciprocal linking programs may get some people into trouble, but it’s the fact of high volume reciprocation — where the reciprocal links are concentrated into one or only a few pages — that gets sites into trouble most often.
Some reciprocal linking programs have caused trouble for other reasons, but generally speaking the more you rely on reciprocal linkage the less likely you are to rely on better linking practices anyway. A powerful linking strategy doesn’t need to depend on external links. In fact, if external links are integral to your linking strategy, you have a relatively weak linking strategy.
Does link baiting help, hinder, or matter?
There may be people who quickly ask, “What about link baiting?” Link baiting is a perfectly valid Web marketing strategy, but it’s not search engine optimization and most Web sites are not suitable for link baiting anyway. As far as search engine optimization is concerned, link baiting is inefficient. You have to create quality content but superlarge content sites depend on volume more than they depend on quality.
Link building for volume content works on a much slower, more gradual basis, although as your community grows you accrue links at a faster rate. Getting a site to that point takes a lot of work but you can make your work more efficient by improving your internal linkage. You improve your internal linkage by creating more meaningful, more visible links. In short, you create more links as you grow your site.
Links are not the problem
Agonizing over how many links you can put on a Web page is a waste of time. Placing a lot of links on a page is not the problem. Creating a page that cries out to search engine algorithms, “SPAM! SPAMMED PAGE! I AM SPAMMING” is the problem.
7 Comments on Linking patterns: Why link-heavy pages win, lose, or draw
By dodito on September 19, 2007 at 1:45 am
Well Michael, you seem to touch a bunch of topics I just spent some time on, trying to understand and/or learn (in a few weeks time since I am not into SEO, nor a webmaster or anything like that etc).
I have never understood the not many links on page part. Aside from the fact yes, it is ugly, unreadable at some point etc.. I see quite a few pages that are crawled weekly if not daily, well established and ranking well. They are lists of libraries or expert lists that even on such an “old fashioned page” still group their links in topics on that same page.
Matt Cuts said it recently.. ok… great.. from a usability perspective.. so are the google guidelines. It never said: we will not crawl anything above 100 links. Matt also said.. keep page below 100 K.. well.. does this mean 100 links ? Where does that logic come from ? Maybe my javascript files are already more than 100 K. Maybe I have a tiny bit of html and huge amount of links.. I can surely pack more than 100 links in 100 K.
What people forget these days is ajax. On yahoo.com you can see the ajaxian tabs, and usually (though not checked with yahoo just now) SE’s will crawl all tabs. They do with us anyway.
So.. what if you have a lot of boxes, ajaxian tabs, fields etc etc.. it can be an amazingly well designed page, with tons of links. Is that bad from a usability point of view ? And from a spam point of view ? I wouldn’t say so. Then the NYT with 400 links would be banned, blacklisted and sent to the google dungeons.
And I can go on and on and on..
Couldn’t agree more.. it’s not about applying a blanket template: all reciprocal is wrong, linkpages are wrong, lots of links on a page is wrong. It’s about who does it and why (and OK… How).
By dodito on September 19, 2007 at 1:48 am
“A relatively inactive forum or new forum doesn’t have much of a link profile. Its internal linkage won’t help it much unless it uses static HTML pages. Do you know why? The reason may not be obvious to everyone.”
I haven’t got the faintest idea.
By wibbler on September 19, 2007 at 3:37 am
“(a lot of external links may actually point to incidental function pages — do you know why?)”
Well, I cant know why anymore, because I cant seem to find a tool that tells me what pages are linking to pages on my site - never mind move forward to think on why they may have linked in the first place.
Googles list of inbound links has been crud for ages, MSN recently switched off its link facility - and that was crud because their index is incomplete, and today (although its probably been of for a bit now for all I know - Yahoo site explorer isnt responding).
I realise links are not everything - but in my quest to become master SEO - can anyone guide me to a link checking site which has its own database of the links on the web?
(a comprehensive one - which doesnt rely on scraping any SEs to provide the data)
Oh man Im pulling my hair out here - every door seems to be slamming shut in the web site owners / webmasters face.
By Michael Martinez on September 19, 2007 at 7:35 am
AJAX is implemented through Javascript and style sheets. A good AJAX developer provides for accessibility (through static HTML code and text) that doesn’t look like anything is being hidden. The search engines don’t have a problem with good AJAX design, just with bad AJAX design. But that is true of all types of design styles. You can do great Web design with HTML tables that looks clean and neat under the hood and that is fully accessible.
Look at “The linking differences between forums and blogs” for the answer. To a non-technical person it may not be intuitively obvious.
By Michael Martinez on September 19, 2007 at 7:37 am
All SEO link checking tools are crap. But one example of inbound linkage to incidental pages is scraped content. Archived content in search engines and Web archives is another example. There are others.
By wibbler on September 20, 2007 at 4:48 pm
Heres a quick laugh for you guys.
I run a PC which is a bit dated on the “media content” front. (in other words I dont give a flying pink pig what “media” is available to me to waste my time on).
An example of this is when I set my homepage to MSN.com, every now and then they must put some drivel up which crashes my browser.
Now - ive no real time to mess around with MSN’s output - so instead I set my homepage to this
http://fgdfgdfgdfgdfg.com/
This way, everytime I load FFox, (a typical load of this browser represents about 10 entries in my taskbar under windows), all I have to download is “PAGE NOT FOUND”.
FANTASTIC - I think - I now dont need to download MSN every time I load an instance of a browser, and I dont get rubbish news in my face - all I get is “PAGE NOT FOUND”
So,
After a couple of weeks of bliss………..
why not visit http://www.fgdfgdfgdfgdfg.com - great site eh?
How about we all key in daft strings and let this company????? what - do their stuff?
Ive had two of these registered btw - how about getting them to register
fdfjksghru.com - its a great domain name - must be worth zillions.
Am I missing something?
By Michael Martinez on September 21, 2007 at 7:44 am
Not to steal your thunder, but I just set my browser home page to “BLANK”. Saves me tons of frustration and time when I start up a browser window.
Comment
Log in or Register to post a comment.