Google bots get the red carpet treatment

By Robert Jaques

Nov 20 2007 7:12AM

Webmasters who control automated web-crawler access to their sites using 'robots.txt' files have a bias that favours Google over other search engines, according to new research..

Google bots get the red carpet treatment

The claim was made by researchers at Penn State University based on the results of a study of more than 7,500 websites.

C. Lee Giles, David Reese professor of Information Sciences and Technology at Penn State, who led the research team which developed the BotSeer search engine for the study, described the pro-Google bias as "surprising".

"We expected that 'robots.txt' files would treat all search engines equally, or maybe disfavour certain obnoxious bots," he said.

"So we were surprised to discover a strong correlation between the favoured robots and search engine market share."

'Robots.txt' files are not an official standard but, by informal agreement, regulate web-crawlers, also known as 'spiders' and 'bots', which mine the web continuously.

Web policy makers use the files found in a website's directory to restrict crawler access to non-public information.

'Robots.txt' files also are used to reduce server load which can result in denial of service and force a website to shut down. But some web policy makers and administrators are writing 'robots.txt' files which are not uniformly blocking access.

Instead, those files give access to Google, Yahoo and MSN while restricting other search engines, the researchers found.

While the study does not include explanations for why web policy makers have opted to favour Google, the researchers know that the choice was made consciously. Not using a 'robots.txt' file gives all robots equal access to a website.

"'Robots.txt' files are written by web policy makers and administrators who have to intentionally specify Google as the favoured search engine," said Professor Giles.

Not every site has a 'robots.txt' file, although the number is growing. About four in 10 of the 7,500 sites analysed by the researchers had such a file, up from fewer than one in 10 in 1996.

Got a news tip for our journalists? Share it with us anonymously here.

Tags:

bots get google red software the treatment

Partner Content

Partner Content ElasticON Sydney 2025: Deriving value from your data with Search AI

Partner Content Australian organisations must act on security – or risk AI ambitions falling flat

Partner Content Logicalis APAC CIO Report: The CIO’s 2025 Mandate

Partner Content Machine identity a key priority for organisations’ security strategies: CyberArk

Microsoft launches 'superintelligence' team

In pictures: The 2025 iTnews Benchmark Security Awards winners

NAB hits milestone with tech role insourcing

Microsoft in damage control over Copilot bundling bungle

Australia and US impose sanctions on North Korean cyber ops

Google bots get the red carpet treatment

Webmasters who control automated web-crawler access to their sites using 'robots.txt' files have a bias that favours Google over other search engines, according to new research..

Partner Content

Sponsored Whitepapers

Events

Most Read Articles

NSW Office for AI appoints its first director, looks for 13 more staff

Palantir sues engineers who left to form 'copycat' AI firm

Services Australia to "uplift" child support online account platform

Microsoft in damage control over Copilot bundling bungle

Most popular tech stories

ABC drops Salesforce for Braze

Westpac Intelligence Layer breaks cover

Suncorp creates a "clear execution roadmap" for agentic AI

Qantas' digital and customer head steps down

Coles to transform finance as 'cloud ERP' program evolves

HamiltonJet partners with digital services provider Fortude

SentinelOne signs distribution agreement with Sektor

Rapid7’s new SIEM combines exposure management with threat detection

The techpartner.news podcast, episode 3: Why security consultancy founder Kat McCrabb started with the hard stuff

Bluechip Infotech enters final stage of Goodson Imports acquisition

Blackberry celebrates "giant step forward"

Photos: Australian industry explores data for net zero

Telstra Purple acquires IoT specialists Alliance Automation, Aqura Technologies

'Touch-free' smartphone controlled with head movements

Photos: The 2024 IoT Awards winners

Microsoft launches 'superintelligence' team

In pictures: The 2025 iTnews Benchmark Security Awards winners

NAB hits milestone with tech role insourcing

Microsoft in damage control over Copilot bundling bungle

Australia and US impose sanctions on North Korean cyber ops

Google bots get the red carpet treatment

Webmasters who control automated web-crawler access to their sites using 'robots.txt' files have a bias that favours Google over other search engines, according to new research..

Partner Content

Sponsored Whitepapers

Events

Most Read Articles

NSW Office for AI appoints its first director, looks for 13 more staff

Palantir sues engineers who left to form 'copycat' AI firm

Services Australia to "uplift" child support online account platform

Microsoft in damage control over Copilot bundling bungle

Most popular tech stories

ABC drops Salesforce for Braze

Westpac Intelligence Layer breaks cover

Suncorp creates a "clear execution roadmap" for agentic AI

Qantas' digital and customer head steps down

Coles to transform finance as 'cloud ERP' program evolves

HamiltonJet partners with digital services provider Fortude

SentinelOne signs distribution agreement with Sektor

Rapid7’s new SIEM combines exposure management with threat detection

The techpartner.news podcast, episode 3: Why security consultancy founder Kat McCrabb started with the hard stuff

Bluechip Infotech enters final stage of Goodson Imports acquisition

Blackberry celebrates "giant step forward"

Photos: Australian industry explores data for net zero

Telstra Purple acquires IoT specialists Alliance Automation, Aqura Technologies

'Touch-free' smartphone controlled with head movements

Photos: The 2024 IoT Awards winners

Log In