LoneeBoyett457

出自 女性百科
於 2012年12月23日 (日) 09:36 由 LoneeBoyett457 (對話 | 貢獻) 所做的修訂 (新页面: Webmasters may instruct lions to not crawl certain files or directories through the common robots.txt file in the root directory of the domain. Moreover, a full page can be expl... Cont...)

(差異) ←上個修訂 | 最新修訂 (差異) | 下個修訂→ (差異)
前往: 導覽搜尋

Webmasters may instruct lions to not crawl certain files or directories through the common robots.txt file in the root directory of the domain. Moreover, a full page can be expl...

Contrary to popular opinion, the search engine spiders sent by the major search engines do not need certainly to search everything on a website. You could officially hold a se index away from a page by advised it through a specific programs meta draw or a document not to come nearby the page.

Webmasters could tell lions not to get certain files or directories through the standard robots.txt document in the root directory of the domain. In addition, a page could be explicitly excluded from the search engine's database with a robots meta tag. If for some reason a search engine spider wasn't wanted by you to crawl a typical page you do have the means to do so.

Each time a se visits a niche site, the robots.txt located in the root folder is the first report crawled. The robots.txt document is then parsed, and only pages not banned is going to be crawled. This really is not always fool proof. Internet search engine spiders have a habit of going far from a page and then coming back and looking at the page another time later. As a se crawler may keep a cached copy of this record, it may occasionally spider pages a webmaster does not wanted crawled.

Pages that many webmasters choose perhaps not be crawled include login particular pages such as shopping carts and user-specific information such as search results from internal searches. Other pages that you might not need crawled, based on this content might be described as a guest book that you expect to be filled with spam or perhaps a feedback process that's not very flattering to you. It is also a good idea to instruct the lions not to investigate a page with a large amount of animation or display on it as this is often mistakenly read by a spider as a deteriorating site. official site