Search Engines Optimization (SEO) blog SEO blog | Profile | Archives | Friends
We live in SERP!

SEO advice. URL canonicalizationMay. 22, 2006

Before I start collecting feedback on the Bigdaddy data center, I want to talk a little bit about canonicalization, www vs. non-www, redirects, duplicate urls, 302 “hijacking,” etc. so that we’re all on the same page.

 

Q: What is a canonical url? Do you have to use such a weird word, anyway?
A: Sorry that it’s a strange word; that’s what we call it around Google. Canonicalization is the process of picking the best url when there are several choices, and it usually refers to home pages. For example, most people would consider these the same urls:

  • www.example.com
  • example.com/
  • www.example.com/index.html
  • example.com/home.asp

But technically all of these urls are different. A web server could return completely different content for all the urls above. When Google “canonicalizes” a url, we try to pick the url that seems like the best representative from that set.

 

Q: So how do I make sure that Google picks the url that I want?
A: One thing that helps is to pick the url that you want and use that url consistently across your entire site. For example, don’t make half of your links go to http://example.com/ and the other half go to http://www.example.com/ . Instead, pick the url you prefer and always use that format for your internal links.

 

Q: Is there anything else I can do?
A: Yes. Suppose you want your default url to be http://www.example.com/ . You can make your webserver so that if someone requests http://example.com/, it does a 301 (permanent) redirect to http://www.example.com/ . That helps Google know which url you prefer to be canonical. Adding a 301 redirect can be an especially good idea if your site changes often (e.g. dynamic content, a blog, etc.).

 

Q: If I want to get rid of domain.com but keep www.domain.com, should I use the url removal tool to remove domain.com?
A: No, definitely don’t do this. If you remove one of the www vs. non-www hostnames, it can end up removing your whole domain for six months. Definitely don’t do this. If you did use the url removal tool to remove your entire domain when you actually only wanted to remove the www or non-www version of your domain, do a reinclusion request and mention that you removed your entire domain by accident using the url removal tool and that you’d like it reincluded.

 

Q: I noticed that you don’t do a 301 redirect on your site from the non-www to the www version, Matt. Why not? Are you stupid in the head?
A: Actually, it’s on purpose. I noticed that several months ago but decided not to change it on my end or ask anyone at Google to fix it. I may add a 301 eventually, but for now it’s a helpful test case.

 

Q: So when you say www vs. non-www, you’re talking about a type of canonicalization. Are there other ways that urls get canonicalized?
A: Yes, there can be a lot, but most people never notice (or need to notice) them. Search engines can do things like keeping or removing trailing slashes, trying to convert urls with upper case to lower case, or removing session IDs from bulletin board or other software (many bulletin board software packages will work fine if you omit the session ID).

 

Q: Let’s talk about the inurl: operator. Why does everyone think that if inurl:mydomain.com shows results that aren’t from mydomain.com, it must be hijacked?
A: Many months ago, if you saw someresult.com/search2.php?url=mydomain.com, that would sometimes have content from mydomain. That could happen when the someresult.com url was a 302 redirect to mydomain.com and we decided to show a result from someresult.com. Since then, we’ve changed our heuristics to make showing the source url for 302 redirects much more rare. We are moving to a framework for handling redirects in which we will almost always show the destination url. Yahoo handles 302 redirects by usually showing the destination url, and we are in the middle of transitioning to a similar set of heuristics. Note that Yahoo reserves the right to have exceptions on redirect handling, and Google does too. Based on our analysis, we will show the source url for a 302 redirect less than half a percent of the time (basically, when we have strong reason to think the source url is correct).

 

Q: Okay, how about supplemental results. Do supplemental results cause a penalty in Google?
A: Nope.

 

Q: I have some pages in the supplemental results that are old now. What should I do?
A: I wouldn’t spend much effort on them. If the pages have moved, I would make sure that there’s a 301 redirect to the new location of pages. If the pages are truly gone, I’d make sure that you serve a 404 on those pages. After that, I wouldn’t put any more effort in. When Google eventually recrawls those pages, it will pick up the changes, but because it can take longer for us to crawl supplemental results, you might not see that update for a while.

That’s about all I can think of for now. I’ll try to talk about some examples of 302’s and inurl: soon, to help make some of this more concrete.


(Posted in Google and SEO)
Permanent Link

Google And Webmasters. A True Love or Hate RelationshipMay. 22, 2006
Any SEO fact finding mission will inevitably lead to reports from webmasters on how Google doomed or saved their website. True to human nature, those who feel hard done by are most often the loudest of the talkers. For every report of how Google has lavished a website with tens of thousands of visitors there are probably ten, a hundred or even more reports of how Google have doomed sites to failure.

Persuading Google To Respect Your Site

In all honesty, a webmaster that is looking for search engine success must gently persuade Google to love them. The rewards for doing so are far greater than any other form of advertising or marketing. Conversely, the punishment for sites left out in the cold by Google really is enough to break the soul of any webmaster.

The Google Effect

According to statistics, Google received 91 million American searches per day in the month of March 2006. Their closest rival, Yahoo, managed 60 million and MSN, in third place racked up only 28 million searches per day in the same period. It would be foolhardy to ignore other search engines but to ignore the 43% Google market share is outright online suicide and persuading Google that your site offers everything their visitors want will inevitably give you advantage with the remaining search engines too.

The Most Important Point

OK, so this is hardly breaking news – after all, every SEO site on the Internet harps on about Google being the be all and end all of online marketing but it is a vitally important point that you must pay attention to. One of the major contributing factors for Google’s success is trust. So many billions of people regularly use Google to search because they trust the results will be as close to relevant as is virtually possible. There is no way to financially hammer your way to the top of the listings and even the sponsored listings at the side of the page are quite clearly listed as exactly that. Remember this point:

Google has become successful by providing relevant results to their visitors and they reward websites for doing the same.

Understanding Google Algorithms

The exact science behind Google algorithms are as much of a mystery as the whereabouts of your missing socks but there are certain factors that all good SEOs and webmasters know.

The right amount of relevant content and a clean design indicates a useful website.

Tricks and underhand SEO tactics are a thing of the past. They are best left to webmasters who know no better. That’s not to say there aren’t guidelines you should follows to ensure a greater success for your site. However, SEO guidelines are beginning to merge with visitor optimization tactics.

Love Your Visitors

Including “keywords” is still vital but not to set percentages. The inclusion of keywords makes the reading of your content easier for your visitors and has the added benefit that it also signifies a topic for your website making it easier for Google to assess it’s relevance. Similarly, a clean and simple website design is easy on the eye and quick to navigate for visitors and Google spiders can easily crawl your website.

So, in order to optimize your website for Google you should primarily optimize for visitors.

Give It A Little Time

This neatly brings us to the question of the Google sandbox. This name has been given to the phenomenon that Google essentially ignore websites for the first few weeks of their inception. Clearly, this is an irritant to many webmasters and one of the least understandable parts of their algorithms.

The principle behind the action is to ensure that brand new sites don’t pay for thousands of inbound links and propel themselves to the top of the rankings illegitimately. Google have always strived to ensure that links are gained organically based on the relevance and strength of a website and its content. Purchasing links completely bypasses this and, in Google’s opinion, degrades the value of linking.

What To Do

The next time you sit down to promote a website with the view of increasing your standings within the search engine results remember that ignoring Google because you’ve read bad press from other webmasters is a dangerous choice. Google is, by far, the most widely used search engine on the web and “optimizing” your site to climb the ranks is easier than many would have you believe. Among the key ingredients is fresh, original content that is useful to your visitors. In order to build up links to your website, the submission of articles to article directories is one tactic that shows positive results.

(Posted in Google and SEO)
Permanent Link

Cheap phone card at PB - Telephone cards for international calls
Cheap prepaid calling cards at CC21 - Calling cards online
Homes, Real Estate For Sale By Owner - Buy and sell homes online without realtors
Jewelry news and stroties at JT - Jewelry Blog
Telecommunications World at JT - Telecommunications news
Blog Search Bot at BSB - HQ Blog Search Engine