If you are a webmaster, there are a lot of ways to optimize a website or a blog to be placed high in the SERP (search engine results page). One of this is, without any doubt, eliminate duplicate content from your web pages, articles and posts. As a matter of fact web engine spiders (such as Google Bot, Yahoo Bot, MSN Bot etc.) while crawling a website tend to look for suspicious content which may look too alike to something they already “saw” while surfing other sites. For a spider, the more a website has got unique content, the better. This, to make sure that what a web author produces is original and has not been forged or copied from somewhere else. That’s why, if your website has got a lot of similar content, Google and other web engines will place it low in the search result for a particular keyword or a whole set of them, with the side-effect that less people will read what you have written. But, what is a duplicate content exactly? Duplicate content is everything which looks similar to something else. Two articles speaking about the same topic will inevitably have something alike, but if the similarities are too many and whole sentences are exactly the same, because they are the result of a copy-and-past action, well, this is duplicate content. But web spiders go beyond all this. As a matter of fact, if two posts have the same, or too similar addresses, for the these spiders this is another good example of duplicate content.
We don’t know when, why or because, but sometimes it seems like that when Google spiders a website, it might end up getting wrong URL link locations by arbitrarily adding an extra or a triple slash (//). For example, Google might crawl the article www.mywebsite.com/testpage.php correctly, but at the same time it could even crawl something like www.mywebsite.com//testpage.php, which in your website doesn’t exist at all, of course, but for Google is another real page belonging to it. At this point Google bots will mark them as duplicate content, and when next time a person will look for the keyword testpage, your article could be placed in the 344 position, lessening in this way the chance to be read. Luckily for us there is a little trick to avoid all this. All you have to do is write this little piece of code in your .htaccess file which usually is located in the plublic_html folder, in the root directory of your site.
- Open your FTP client and reach your website.
- Open public_html folder.
- Right click on the .htaccess file and click edit. If the .htaccess doesn’t exist, just create a htaccess.txt file with Windows notepad, put it in the public_html folder and rename it .htaccess
- Copy and past the following code in the .htaccess:
- If you want something more efficient copy and past the following code in the .htaccess:
- These codes will just remove douple or trible slashes anywhere in your website addresses.
# Remove multiple slashes anywhere in URL
RewriteEngine On
RewriteCond %{REQUEST_URI} ^(.*)//(.*)$
RewriteRule . %1/%2 [R=301,L]
# Remove multiple slashes after domain
RewriteEngine On
RewriteRule ^/(.*)$ http://www.yourwebsitename.com/$1 [R=301,L]


Until some years ago Google Dance was very well-known to every webmaster knowing a little bit about Google and the way it works. Google Dance was that lapse of time, usually from 6 to 8 days, where a major index and ranking update of the Google search engine was performed. During these days Google also recalculated backlinks of every page in the Internet to determine how important and authoritative a certain website was. Once this big check was finished Google had to update all its datacenters scattered all over the world. This was made little by little because a lot of data was involved and, as a result, some datacenters were updated right away while other ones not. Google Dance occurred on average every 36 days or 10 times per year. It was very easy to spot because your website, your most competitive keywords and such could disappear or be placed low in the SERP from one minute to the very next one, while other not-very-famous websites and their pages were placed high in the search for no apparent reasons. To check if a Google Dance is on and if your website is affected by it, just perform this simple check. As I said there are a lot of datacenters which have to be updated. A datacenter is just a particular server having a web address giving you, as a result, the well-known Google search page. On this page just writesite:www.yourwebsite.com. Do this for every datacenter available. After that compare all the results (I mean the number of your website’s pages which are indexed by that datacenter you performed the query on). If you see that some datacenter has not indexed all of your website’s pages while other ones do, it is highly probable that your website is undergoing a Google Dance. Here are the datacenters’ addresses:
(3 votes, average: 4.67 out of 5)
Comments
How to remove Windows Genuine Advantage (WGA) plug-in from Firefox: “If any info related to the plugin...
Windows XP SP3 endless reboot boot (Stop c0000139): No found it via google searching for a problem
Windows XP SP3 endless reboot boot (Stop c0000139): I found this to be the most useful info… i...
Delete and clean Internet Explorer 7 Browsing History from the Command Line: Isn’t there any...
iPhone firmware 2.0 available for download: Wow!! Lots of important information on this site! I especially...