NEWS CATEGORIES:



NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
Home / News / Webmaster / Tips and Tricks

Tips and Tricks


Methods to Block Spiders

Removal of Web Content from Google

By Catalin Bocanu, Web News Editor

10th of January 2008, 17:35 GMT

Adjust text size:


Google Logo
Enlarge picture
There are situations when you do not want that certain URLs of your website to be indexed by search engines
robots. Fortunately, solutions exist if you need to remove outdated content
like entire web pages, individual images and more.

The general methods for blocking search engines robots use robots.txt files, specific meta tags definitions or a .htaccess file. If you want to remove web content from Google or to prevent the indexing of a website or parts of it, the most recommended options would be the creation of a custom robots.txt file or the implementation of robots meta tags into the HTML code of your pages. In order to block the Google bots for further indexing actions, the file robots.txt, which must be placed in the root of your domain, will have to contain the next two lines:

CODE
User-agent: Googlebot
Disallow: /

The directive specified in the robots.txt file disallow the entire website indexing by the Google bot. In a similar manner you can specify only a directory of your website. Or, as an alternative, you can insert appropriate robots META tags into the HEAD of your HTML pages:

CODE



The meta tag considered in the example tells to Google robots not to index the current web page and also not to follow the links existing on that page. After you define a robots.txt file or the meta tag that blocks the indexing actions of Google bots, go to the Google Webmaster Tools website and select the desired content you want to remove from Google index. In a similar way you can also remove a cached copy of a Google search result.

It is good to know that certain robots does not respect the directives defined by robots.txt or META tags. In order to create a better protection for your website content privacy it is recommended to use .htaccess files to block spiders.

TAGS:

Webmaster Tips | Google Search Engine | Indexing Options | Robots Meta Tags | Google Robots
Read by 1,387 user(s) | Add comment | Link to this article TWEET THIS


Article rating:
Fair (2.1/5) 7 vote(s)    

Subscribe to news | Print article | Send to friend

© Copyright 2001-2009 Softpedia
Contact:

 

 

SEARCH THE NEWS ARCHIVE :




Today's News
| Yesterday's News | News Archive


MORE RELATED ARTICLES:


Search Engine Spider Simulators

Adding a Search Engine Feature to a Website

Break Down Your Internet Competitors with SEO SpyGlass

Management of A Free Custom Search Engine

The Efficient Usage of Google Webmaster Tools

RSS Feeds' Visibility in Search Engines

A Free Web Directory

Deployment of a Free, SEO Ready Made Website in 10 Minutes

Do You Know the Value of Your Keyword Density ?

The Basic Rules of Search Engine Optimization

User opinions:

No user comments yet.
Be the first to express your opinion using the form below!

Share your opinion:

Your Name:
Your Email Address:
(will not be used for commercial purposes)
Solve this to prove you're not a bot: =
Your review/opinion:

 




Windows tabGames tabDrivers tabMac tabLinux tabScripts tabMobile tabHandheld tabGadgets tabNews tab

SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   ENTER NEWS SITE   |   ENGLISH BOARD   |   ROMANIAN FORUM