Search Perform an advanced search query SOFTPEDIA
 
SOFTPEDIA
Updated one minute ago
HomeSubmit a program for being reviewedAdvertise on our websiteGet help on surfing our websitesSend us your feedbackGet information about our XML/RSS backend and how to use itBrowse the news archiveVisit our discussion forumVizitati forumul in limba romana



KLIP
  1. HOME
  2. SCIENCE
  3. TECHNOLOGY
  4. WEBMASTER
  5. SECURITY
  6. MICROSOFT
  7. LINUX
  8. APPLE
  9. GAMES
  10. TELECOMS
  11. REVIEWS
  12. LIFE & STYLE
  13. EDITORIALS
  14. INTERVIEWS
  15. RSS
Welcome!
Hello, Guest

Login if you have a Softpedia.com account.

Otherwise, register for one.

TIPS AND TRICKS

Methods to Block Spiders

- Removal of Web Content from Google

By: Catalin Bocanu, Web News Editor

There are situations when you do not want that certain URLs of your website to be indexed by search engines
robots. Fortunately, solutions exist if you need to remove outdated content
like entire web pages, individual images and more.

The general methods for blocking search engines robots use robots.txt files, specific meta tags definitions or a .htaccess file. If you want to remove web content from Google or to prevent the indexing of a website or parts of it, the most recommended options would be the creation of a custom robots.txt file or the implementation of robots meta tags into the HTML code of your pages. In order to block the Google bots for further indexing actions, the file robots.txt, which must be placed in the root of your domain, will have to contain the next two lines:

CODE
User-agent: Googlebot
Disallow: /


The directive specified in the robots.txt file disallow the entire website indexing by the Google bot. In a similar manner you can specify only a directory of your website. Or, as an alternative, you can insert appropriate robots META tags into the HEAD of your HTML pages:

CODE

<head>
<meta name="googlebot" content="noindex,nofollow">
</head>


The meta tag considered in the example tells to Google robots not to index the current web page and also not to follow the links existing on that page. After you define a robots.txt file or the meta tag that blocks the indexing actions of Google bots, go to the Google Webmaster Tools website and select the desired content you want to remove from Google index. In a similar way you can also remove a cached copy of a Google search result.

It is good to know that certain robots does not respect the directives defined by robots.txt or META tags. In order to create a better protection for your website content privacy it is recommended to use .htaccess files to block spiders.




MORE RELATED ARTICLES: Search Engine Spider Simulators Adding a Search Engine Feature to a Website Break Down Your Internet Competitors with SEO SpyGlass Management of A Free Custom Search Engine The Efficient Usage of Google Webmaster Tools RSS Feeds' Visibility in Search Engines A Free Web Directory Deployment of a Free, SEO Ready Made Website in 10 Minutes Do You Know the Value of Your Keyword Density ? The Basic Rules of Search Engine Optimization
 
Comments | Link here | Subscribe
Print | Send to friend
Today's News | Yesterday's News

Search:


10th January 2008, 17:35 GMT | Copyright (c) 2008 Softpedia | Contact:
Read by 658 user(s) | Rating: | 7 vote(s) so far | Cast your vote:
Methods to Block Spiders - USER OPINIONS




We are sorry, there are no opinions available for this article.






SHARE YOUR OPINION ABOUT Methods to Block Spiders

Since you are not logged on, your comments will have to be approved before being displayed.
Click here to login, or register.
Your Name:
Your Email:
Type in the result:
Your Opinion:
 


DO YOU WANT TO CONTACT US?  

If you have some comments or you want to send us some information you can send us an email directly to .
You can use the form below for the same purpose.
Your full name: (at least 3 characters)
Your email address: (at least 5 characters)
Message subject: (at least 5 characters)
Message text:
(at least 10 characters)
Type in the result:
 
 



© 2001 - 2008 Softpedia. All rights reserved.
Softpedia™ and Softpedia™ logo are registered trademarks of SoftNews NET SRL.
Copyright Information | Privacy Policy | Terms of Use | Contact Softpedia | Update your software | Archive