Question:

How google spider works?

by Guest9153 | 12 years, 9 month(s) ago

0 LIKES Like UnLike

Tags: Google, spider, works?

Report

Answer The Question I've Same Question Too

Follow Question

1 ANSWERS

Sort By: Date | Rating

Guest3361

Googlebot(Google Spider) is Google’s web crawling robot, which finds and retrieves pages on the web and hands them off to the Google indexer. It’s easy to imagine Googlebot as a little spider scurrying across the strands of cyberspace, but in reality Googlebot doesn’t traverse the web at all. It functions much like your web browser, by sending a request to a web server for a web page, downloading the entire page, then handing it off to Google’s indexer.

Googlebot consists of many computers requesting and fetching pages much more quickly than you can with your web browser. In fact, Googlebot can request thousands of different pages simultaneously. To avoid overwhelming web servers, or crowding out requests from human users, Googlebot deliberately makes requests of each individual web server more slowly than it’s capable of doing.

Googlebot finds pages in two ways: through an add URL form, www.google.com/addurl.html, and through finding links by crawling the web

When Googlebot fetches a page, it culls all the links appearing on the page and adds them to a queue for subsequent crawling. Googlebot tends to encounter little spam because most web authors link only to what they believe are high-quality pages. By harvesting links from every page it encounters, Googlebot can quickly build a list of links that can cover broad reaches of the web. This technique, known as deep crawling, also allows Googlebot to probe deep within individual sites. Because of their massive scale, deep crawls can reach almost every page in the web. Because the web is vast, this can take some time, so some pages may be crawled only once a month.

To keep the index current, Google continuously recrawls popular frequently changing web pages at a rate roughly proportional to how often the pages change. Such crawls keep an index current and are known as fresh crawls. Newspaper pages are downloaded daily, pages with stock quotes are downloaded much more frequently. Of course, fresh crawls return fewer pages than the deep crawl. The combination of the two types of crawls allows Google to both make efficient use of its resources and keep its index reasonably current.
Report (0) (0) | 12 years, 9 month(s) ago

Question Stats

Latest activity: 15 years, 3 month(s) ago.
This question has 1 answers.

BECOME A GUIDE

Share your knowledge and help people by answering questions.

Recently Answered Questions

Is smoking weed bad for your health?
Answers: 2
can u get me permission wazeefa hazrat daued alehis salam for my lost lo...
Answers: 1
plz tell me any dua i can get my love back
Answers: 1
i want te get my love back
Answers: 1
i love one guy he loved me also but now he stopped contacting with me he...
Answers: 1

View All

How google spider works?

1 ANSWERS

Question Stats

BECOME A GUIDE

Questions & Answers

Users

Categories