All the latest UK technology news, reviews and analysis

Search engines cleared of bias favouring big sites

by Robert Jaques

09 Aug 2006

Be the first to comment

  • Tweet this

Search engines are not biased towards popular websites, and may even be egalitarian in the way they direct traffic, computer scientists claimed today.

An Indiana University (IU) School of Informatics study, entitled 'Topical Interests and the Mitigation of Search Engine Bias', challenges the view of a web-dominating 'Googlearchy' in which search engines are accused of pushing all internet traffic to established, mainstream websites.

"Empirical data do not support the idea of a vicious cycle amplifying the rich-get-richer dynamic of the web," said Filippo Menczer, associate professor of informatics and computer science at Indiana University School of Informatics.

"Our study demonstrates that popular sites receive on average far less traffic than predicted by the Googlearchy theory and that the playing field is more even."

Menczer was joined in the study by IU post-doctoral fellow Santo Fortunato; Alessandro Flammini, assistant professor of informatics; and Alessandro Vespignani, professor of informatics.

The IU team aimed to collect empirical data from various search engines. In one scenario, users browsed the web using only random links. In another, users visited only pages returned by the search engines. The researchers also studied the way in which search engines have influenced the web's evolution.

"A simple ranking mechanism provides an elegant model to understand the genesis of a broad class of complex systems, including social and technological networks such as the internet and the world wide web," Fortunato said. "These networks possess a peculiar 'long-tail' TM structure in which a few nodes attract a great majority of connections."

The long tail structure of the web is commonly explained through rich-get-richer models that require knowledge of the prestige of each node in the network. However, those who create and link web pages may not know the prestige values of target pages.

In another study, Scale-Free Network Growth by Ranking, Menczer, Fortunato, and Flammini claim that for a search engine to give rise to a long tail network, it must simply sort nodes according to any prestige measure, even if the exact values are unknown. If new nodes are linked to old ones according to their ranking order, a long tail emerges.

"By sorting results, search engines give us a simple mechanism to interpret how the web grows and how traffic is distributed among websites," said Menczer.

Do you agree?

 

Add your comment

We won't publish your address
By submitting a comment you agree to abide by our Terms & Conditions. Your comment will be moderated before publication.

Poll

IT priorities for 2012

What is the most important IT priority for your company this year?

99%

0%

1%

0%

0%

Connect with V3.co.uk

Sign up to our daily or weekly newsletters

Accurev

Top 5 software development challenges

This paper focuses on a series of best practices and techniques for development teams looking to improve their software development processes

Talend

Rubbish in, rubbish enterprise

Why good data management at all levels is essential in the modern business (video, 6mins)

Systems Analyst - Project Lead - Chelmsford - £50k-55K+Bens

Systems Analyst - Project Lead - Chelmsford, Essex...

Windows Systems Engineer (Windows Log File, Syslog) learn SIEM

Windows Systems Engineer (Windows Log File, Syslog) learn...

PHP Developer - Zend, MVC

Role: MVC PHP Developer Location: London, Central...

Senior Web Developer / Engineer (HTML, JavaScript, CSS)

Title: Senior Web Developer / Engineer (HTML, JavaScript...

To send to more than one email address, simply separate each address with a comma.