As the owner of more than one large content domain that has been actively adding content and links since 1997, I can honestly say that I have personally seen the so-called “large domain effect” many times, where you add a new page or section about a new topic and you get almost instant rankings success.
As the owner of more than one small content domain that has been actively creating content and links since 1999 or thereabouts, I can also say that I have personally seen small domains trump large, tightly focused domains for their keywords in a variety of search engines, including today’s Google and Yahoo!.
Size doesn’t matter and anyone who tells you otherwise doesn’t have a clue about search engine optimization.
Search engines are looking for relevance and user satisfaction. Sometimes technically relevant content isn’t what the users are looking for. Sometimes a query is relevant to too many types of content. So each engine plays with its relevance scoring and makes adjustments. They also ban sites, filter sites, penalize sites, and favor sites in their search results.
But many highly visible, outspoken people in the search engine optimization industry still struggle with the fundamental concepts of optimization. Just because someone writes something on a blog or a forum doesn’t mean he knows what he is talking about. All it really means is that he has the nerve to share his ideas with the public. And while there is something to be said for the confidence we express in our willingness to share our ideas, that don’t have anything to do with search results.
There are four reasons for why your site performs well (or poorly) in search results:
- You do something good or bad with your site
- Other people do something good or bad with their sites
- The search engines do something good or bad with their data
- People search for whatever you’re most relevant to or they search for something else
That’s the “this ain’t rocket science” part of search engine optimization.
At its simplest, most basic and fundemantal level search engine optimization is all about striking a balance between what you do and what everyone else is doing. You can read all the nonsense in the blogs and forums about how you need to build links, build links, build links that you want — I’ll kick your butt in the SERPs every time if you’re just going to scrabble around the Web for links.
Some people do build up huge networks of Web sites that throw thousands of links at campaigns almost instantly (the smarter folks roll out their links over a measured period of time with some randomization built into the process). But most people don’t have 10,000 Web sites to work with.
Most people have to do the optimization thing the hard way, one page at a time. The famous Greek mathemetician is reported to have said, “Give me a fulcrum and a place to stand — and I will move the world.” That principle remains as true today as it was thousands of years ago, and it’s equally true for search engine optimization as for engineering.
When you set out to influence search results you are putting forces into motion that act according to certain laws and principles. You don’t have to be any more aware of what those laws or principles are than you are aware of the physics involved in prying a closed door open with a crowbar.
You do something you were shown how to do and it works. That’s the way most people practice search engine optimization today. Whole crowds follow after gurus who have tricks or techniques that work. The B.S. meisters who don’t tell you everything — the guys behind the curtains — quickly lose their followers because after a while even the most dense idiot has to realize that his failure to rank based on Super Secret Formula 13 has nothing to do with his own ability.
Either the formula no longer works or it never worked at all. A fair number of people offer marketing advice who don’t apply it themselves — but they sell a lot of marketing tutorials, tools, and Web services. You may never have heard of those people. I’ve run into quite a few of them in my own unending quest for Super Secret Formula 14. Sadly, the first 13 didn’t work (and I’ve always believed that 13 was my lucky number).
In search engine optimization today, most people babble endlessly about links. The link effect undisputably works, but for most people it only works temporarily. You play the reciprocal-links-that-are-not-reciprocal-links game over and over again, starting over as soon as someone cooks up a new variation on the old recipe because the search engines keep figuring out and filtering the old variations.
At its deepest level reciprocation works because it makes up the structure of the Web. If you have 100 pages on your site you unquestionably have a lot of reciprocation among your pages. But people in the industry excuse that away and say those reciprocal links don’t really count because they all come from the same domain.
But if you have a network of business sites (say, in the travel industry) and all your sites link to each other, you may very well dominate numerous search queries. Is it because of the reciprocal links or is it for another reason? Some people would be quick to argue either that the links are “relevant” and therefore okay or maybe that “the search engines know the sites all belong to the same company”.
The fact that reciprocal links are considered acceptable by the search engines should never be questioned. What the search engines care about are obviously manipulative links where the only business reason for two sites to swap links is that both may benefit in the search results. You have the right to boost your own content from within your own network, and you have the right to build brand equity throughout your network. Search engines, on the other hand, recognize that brand equity means people will search for those equitable brand sites that have proven themselves.
A site doesn’t have to do much to prove itself except stand out from the crowd. Anyone can do a travel site. But there is only one Travelport. Hm. I may have linked to the wrong site. Maybe I want to link to this Travelport site instead. No, maybe not. Well, it could be there is a crowd of Web sites out there that are all owned by Travelport.
So what? Most people are going to type in travelport.com and get to the right place anyway, aren’t they? Ah, the power of dot-com. I remember being at a book signing one day where the store manager excitedly got on his P.A. system and announced my presence to his customers. He concluded by saying, “And you can find out more about Michael on his Web site, Xenite-dot-com”.
“dot-ORG!” I shouted hastily.
“dot-ORG — sorry, Michael” he said.
Now, do the search engines have magic algorithms that figure out which site is actually more official than the other? Or is it that they are looking at more than just one factor? Maybe search engines really do try to take a lot of things into consideration.
A few years ago I became annoyed with a hyperpromotional self-promoter. His picks and shovels are (not) guaranteed to help you make millions of dollars on the Internet in the latest gold rush. I noticed he was proudly ranking for a particular well-trafficked query. I also noticed he had about 10,000 very tightly focused backlinks boosting his flagship site.
So I created a page on a Web site and pushed it up to the top position for his query. Now, I pointed a few links at the page but I didn’t do anything sneaky and I wasn’t able to slam it with 10,000 pieces of anchor text.
So why did my page knock his out of the SERPs? Maybe because I just happened to make a more relevant page than he did and I only pointed enough links at the page to help the search engines find it and see that its backlink profile matched its content.
That’s a secret you won’t find many people talking about on the Web: building agreement between links and content. Oddly, it is the assumption that there is agreement between links and content that leads search engineers to foolishly trust links as much as they do. Spammers happily exploit that misplaced trust every day, but legitimate Web site operators exploit it, too.
The problem is that people wander around SEO blogs and forums, and maybe hobnob with their best drinking buddies at conferences and bars, and they conclude that links are the secret to search success because every one they know just uses links to boost their pages.
Show of hands: how many of you have built a site and just allowed it to rank on its own for 3 months? No link building? No rushing out to directories…no dropping links in blogs and forums…no buying links…no boosting from your personal domain?
I have to do this quite often. Now, don’t get me wrong. If I have the ability to leverage links, I will leverage links. But I have domains I had to boostrap into search results success for lack of resources. You can do it with enough unique content pages and an XML sitemap. Does this approach guarantee success? Nope. Nothing guarantees success.
After all, you may go out and buy 20 PR 8 links but the guy above you may have bought 40 PR 8 links. He may have twice as much content as you or half as much. He may have bought an aged domain or not. He may or may not be doing anything special.
Xenite.Org may be a large content domain by some standards (more than 1,000 pages of content) but I compete with huge professional sites in the entertainment industry: SciFi.com, AskMen, IMDB, et. al. I also compete with Wikipedia. I don’t win every battle. But I outrank all of those huge, link-richer, content-heavier sites for many competitive queries.
I have even smaller sites that also compete against educational resources, heavily linked-to scientific resource sites, professionally edited information archives — again, I win some and I lose some. If the search engines were really favoring large content domains or older domains I’d never be able to launch a new site.
I don’t link bait. I don’t run out to social media sites and plug all my content. I think I have two active social media accounts and both are just barely active. I’ve rarely posted anything to a social media site. There are so-called SEOs who live and breathe social media. You have to ask yourself, however, are they making most of their money from those social media operations or are they bringing home the bacon some other way?
My job is to influence organic search results with good content. My personal avocation for 11 years has been to create interesting content for other people to read and enjoy. I’ve never been daunted by the goliaths on the Web. They may outrank me for more queries than I outrank them but then they have more content and I’m just not chasing that many queries. I’m good with that.
Most commercial sites see less traffic than my personal sites do, too. I’m also good with that.
Search engine optimization begins with what you do on your own pages. You cannot control what other people do on their pages. You can whine on forums and blogs but you cannot control what the search engines do with their data.
But when it comes to what people search for, you also have a little leeway there. You can suggest to people what they should be searching for. If you make good suggestions then people will search for your content and find it. That’s truly, honestly part of the search engine optimization process.
Link building can enhance search engine optimization, but it’s only a very small part of true optimization. Most people who struggle with search engine success — in my experience — depend way too much on off-site factors. They don’t think about how to insert those keywords into every nook and cranny on their pages. They don’t think about leveraging their own navigational links. They don’t stop agonizing over the fact that someone else grabbed a good domain name and just create their own good URLs.
In short, most people who practice search engine optimization — real search engine optimization — don’t look up from the keyboard long enough to care about whose site has more pages. If all else fails you can always create more content.
And no one can take that away from you. No one.
{ 13 comments… read them below or add one }
wibbler 10.03.07 at 4:38 am
“They don’t think about how to insert those keywords into every nook and cranny on their pages. ”
I do – but then I start to fear keyword stuffing and overoptimisation penalties kicking in.
So my head starts going round in a spin and I dont actually end up “KNOWING” what to do.
Michael Martinez 10.03.07 at 7:23 am
Wibbler: “I do – but then I start to fear keyword stuffing and overoptimisation penalties kicking in.”
keyword stuffing keyword stuffing keyword stuffing keyword stuffing keyword stuffing keyword stuffing keyword stuffing keyword stuffing
I don’t fear it.
If you’re not sure whether you’re stuffing keywords into a page, you probably are NOT stuffing keywords into a page. Matt Cutts gave a pretty good example of keyword stuffing on his blog in July.
Now, I’m not saying everyone should repeat their favorite expression 8 times in a row, but if you use an expression, say, 100 times on a page, and it comprises only a very, very small percentage of the page’s text, you’re most likely okay.
Think of a long blog post (like some on Matt’s blog) where you see “posted by” or “said” or the current date 100 times.
Is that keyword stuffing?
I don’t think so.
Intent usually leaves an algorithmic footprint as large as Godzilla’s.
tinkerbellchime 10.03.07 at 8:15 am
As regards content, I too worry about keyword stuffing–but less now than when I first started. In the beginning, I believed the experts who advised using keywords only 4 or 5 times on a page. Now, I know that this isn’t true. It’s just another SEO myth. Matt’s example on keyword stuffing helped. But I have had a problem with online BINGO games that I created and posted in pdf files. The games have 25 cards with 25-35 repeating words that are shuffled around from page to page. At the end of the file are the calling cards. Can you imagine how that looks to Googlebot? The same words over and over, but on different pages? Well, sure enough, there is a pattern that has developed. I post a new game and the page disappears from Googles SERPS for a few days. Last time the page was gone for over a week. But I stuck it out and didn’t remove the game. In the past, the pages would disappear for about two or three days. If this happened once, I wouldn’t say that the repeating words caused it. But this has happened at least three times–and maybe more. It’s hard to say exactly what is causing it, but there does seem to be a pattern. It is preventing me from posting as many downloadable BINGO games as I’d like. Michael, your SEO advice to not worry too much about keyword stuffing is good, and I believe you, but I have had a little problem with it due to the nature of my site. Do you think that my pages got put into a que for manual checking and were released after being checked? Do search engines do this? I’d love to create about 150 downloadable BINGO games because they would be a great resource for teachers and students, but I don’t have the guts.
cdelrio 10.03.07 at 2:28 pm
Hmm…really? No algorithmic favoring of large sites? Okay I can buy that, for now.
But you certainly know that the difference in the time that it takes for SciFi.com and mypersonal-fansite.com to be crawled and indexed can certainly make a difference in initial visibility.
The a weeks head start on the first page can make a very stark difference on inbound links. And in a great many cases the internal links from powerful pages within a large site can overpower even a well linked small site.
While the search engines may not programmatically favor large domains they do pragmatically favor frequently spidered domains.
Michael Martinez 10.03.07 at 3:01 pm
tinkerbellchime:”The same words over and over, but on different pages? Well, sure enough, there is a pattern that has developed. I post a new game and the page disappears from Googles SERPS for a few days.”
Michael: The words “about seo theory” and “seo glossary” appear on around 200 pages of SEO-Theory.com, along with many other words. As you expand the templated text that is replicated across your pages, you reach a threshold point where your pages start to look like duplicate content. I can see how that might be causing your occasional losses of search visibility.
Or, if you have weak inbound linkage (and weak internal linkage), I can see how your page might drop out of the index every now and then only to be reindexed.
It’s possible to suggest alternative explanations. When you see several possible explanations for why something happens, the best thing you can do is start eliminating the possible explanations that are not the right one. Eventually, you either clear your list and then have to figure out something else or you get down to the right explanation.
cdelrio: “But you certainly know that the difference in the time that it takes for SciFi.com and mypersonal-fansite.com to be crawled and indexed can certainly make a difference in initial visibility.”
Michael: Any site can be crawled and indexed quickly. I can create a brand new site right now and get it indexed within an hour. Now, the question is, how would I do that without breaking any rules? You already know the answer, although I don’t think it has occurred to you.
“While the search engines may not programmatically favor large domains they do pragmatically favor frequently spidered domains.”
You and I are looking at the same data but I’ll interpret it differently by saying that large content sites that have good, strong internal navigation statistically favor themselves in the crawling process.
That is not a competitive advantage over a small content site. SEO-Theory.com was getting indexed in a matter of minutes back when it had only a few dozen pages on it.
Size really doesn’t matter.
Carlos 10.03.07 at 4:08 pm
I am unclear on how statistical favoring themselves in the crawling process doesn’t constitute a competitive advantage.
A site doesn’t just come into existence as a large site it takes time to build. Even if it is built offline. As soon as it comes live the clock is running, a small site with 12 pages has 12 chances per second to introduce new content, amongst thousands of page that are being spidered. CNN has 816,000 chances and while CNN has a high loss rate on position it often capture significant visibility during the opening days of any new content. Right now when I search CNN (about 5pm October 3rd) the first four caches are:
Oct 3, 2007 03:23:19 GMT
Oct 3, 2007 02:31:59 GMT
Oct 3, 2007 02:31:45 GMT
Oct 3, 2007 02:27:26 GMT
Look at that, three of the most relevant pages for CNN where hit in the span of 4.5 minutes. I genuinely feel like that would be considered an advantage over a site that say gets hit once in a week.
tinkerbellchime 10.03.07 at 5:39 pm
You said: “As you expand the templated text that is replicated across your pages, you reach a threshold point where your pages start to look like duplicate content.”
Question: Is it a good practice to change the template text that is used on your webpages? If so, how often? Every 100 pages or so? If you have a new area that you want to add to your website, would that be a good time to change the template text? Should your left hand navigation change with the new section? For example, are you going to change your navigation titles here on SEO Theory after another 200 pages? You already have captured the top spot for that term, so what’s the point of beating it to death? Why not go after another term for awhile? Would this be a good strategy? You would, of course, have to monitor the situation to make sure that you didn’t lose your main term. Would it make sense to move ‘SEO Theory’ down one place and put SEO in its place? Of course, I’m really asking for myself because I have a similar linkage situation. This is my not so sly way of getting free advice from you. :>)
Michael Martinez 10.03.07 at 9:46 pm
tinkerbellchime: “Question: Is it a good practice to change the template text that is used on your webpages? If so, how often? Every 100 pages or so?”
Michael: I don’t mean that you should change out your template text. I mean that the percentage of template text versus unique text increases the likelihood of two pages appearing to be similar enough that a duplicate filter kicks in (you see one such filter whenever Google shows you “Omitted results”).
Using SEO Theory as a baseline to explain the principle, when we reach a point where I get a lot of “Omitted results” links in site searches I’ll start thinking about changing meta data on the blog. Generally speaking, even the blog archives should appear to be relatively unique with respect to other pages on the site because I write such long posts.
Carlos: “A site doesn’t just come into existence as a large site it takes time to build. Even if it is built offline. As soon as it comes live the clock is running, a small site with 12 pages has 12 chances per second to introduce new content, amongst thousands of page that are being spidered. ”
Michael: We can hypothetically define a large content site that presents a real challenge for quick indexing, but the fact of the matter is that Google quickly indexes tens of thousands, perhaps hundreds of thousands of pages from one particular domain every day.
CNN doesn’t push out 100,000 pages a day but other sites do push out thousands of pages a day and those sites can and in some cases DO get all those news pages into at least part of Google’s index.
Think of the tools available to the Webmaster. You have more than just links and content. How does a search engine quickly find and start to ingest 10s of thousands of pages of content in the space of a few hours?
Carlos 10.04.07 at 10:39 am
Thanks Michael I get the “wink wink nudge nudge” but my sitemap being downloaded by Google is not the same as being indexed.
Perhaps I am wrong but it appears to me that means, frequency, and diversity of introduction are all factors (signals) for indexing. I certainly see a difference in depth and alacrity of change dispersion between between a thin sitemap and a robust one. But the effect is small compared to the change due to link disparity (i.e. 10 inbound vs. 100 inbound).
Michael Martinez 10.05.07 at 7:50 am
I’m not talking about XML sitemaps. Neither am I going to be specific.
Chas 02.17.08 at 1:42 am
“Size doesn’t matter and anyone who tells you otherwise doesn’t have a clue about search engine optimization.”
Google doesn’t rank sites, it ranks pages. If you have a large site with relevant pages, each page added might well have instant links with 200 relevant pages. This is a huge boost in Google rankings. Because of “internal” linking, size may be a huge factor. But a single page may cover a topic fully and get lots of “external” links too. So size MAY matter in some cases, in others…not.
being1 09.11.08 at 8:06 pm
That is a very cool and right article. I was wondering about the xact same things myself, and I truly believe that people give way too much importance to linking. In my own website I am doing great with just content management and excellent informative forum. In fact, when you look for the Keyword on Google Suggest, the second suggestion is the name of my website (!) : Google understood that the name of my website is equivalent to the name of the keyword, even though my domain name does not consist of the keyword itself.
Although I am ranked 3rd on the main keyword, and my website is the 2nd in size, I dominate hundreds of other keywords, much more than my competition. My competition is having 6 websites with thousandds of links to the main site, yet I dominate the area and my PR is higher than his (though I never cared much for PR, it does show, somehow, what Google thinks).
What always amazed me is the fact that big directives, for example- never have incoming links. They just point OUT, nobody points INTO their site. Yet, they rank first on many occassions. And this proves what you have yourself said about the over-value of links. Plus it demonstrated to me the other side of the story : that people who are constantly refusing to give links to other sites (in the same topic)- are damaging their own site.
2 months I have made an experiment, just like you did: I opened a new website, using a correct Domain name. I made a directive out of it. I simply collected all articles from the web, which related to that keyword / topic. I than collected the list of all related products, all related pictures, all related shops. The result is astonishing: I went into page one in one-week time. I am now located 2nd in that keyword, beating the big ones. And I only handle it 30 minutes a day. In fact, it went up so fast, I am afraid to overdo it, so I just give it a little touch here and there. Now, that brought up a whole new concept into my universe :
It is evident that the best website is Google. Now what is it doing ? Do the same on any topic, and you shall be Number 1. Very simple. And I do not have even 10 incoming links !
Intelligence is the key.
And thanks for your article too.
Michael Martinez 09.12.08 at 8:00 am
Toolbar PR does show what Google “thinks” — that is, how important Google algorithmically (or otherwise) determines a page to be, compared to the rest of the Web. But Toolbar PR is not a predictor for performance in the search results or for a page’s ability to pass value to other pages.
If you achieve a PR 4 or 5 or whatever, and you feel a sense of accomplishment, don’t let SEO cautionary wisdom diminish that sense of accomplishment. As long as you don’t optimize on the basis of Toolbar PR, there is nothing wrong with appreciating whatever score you achieve.
It’s still a metric, and one that a lot of people pay attention to.
BTW — Google arbitrarily assigns a PR 10 to every view in our Google Analytics account.
Not bad for dynamic content that receives no inbound linkage, wouldn’t you agree?
You must log in to post a comment.