PageRank fails on quality – proved again

IR always belonged to the realm of digital libraries, then the search engines arrived and often IR is associated with this area, which uses a lot of technology and methods from digital libraries anyway.

Some experts in digital libraries, Michael L. Nelson, Martin Klein, and Manoranjan Magudamudi did an interesting evaluation and compared expert rankings to search engine rankings.  The paper is called “Correlation of Expert and Search Engine Rankingsmini rdf PageRank fails on quality   proved again“, and it was released 21st October 2008.
Expert ranking means that experts contribute to the rankings, rather than it being an automated machine task.  They chose a good example to test on, lists from ARWU, IMDB, Billboard, ATP, Fortune, Money, US news, WTA.
Their question is “Does authority mean quality?” and the answer is “although authority means quality, quality does not necessarily mean authority”.
“US News & World Report publishes a list of (among others) top 50 graduate business schools to answer this question we conducted 9 experiments using 8 expert rankings on a range of academic, athletic, financial and popular culture topics. We compared the expert rankings with the rankings in Google, Live Search (formerly MSN) and Yahoo (with list lengths of 10, 25, and 50). In 57 search engine vs. expert comparisons, only 1 strong and 4 moderate correlations were statistically significant. In 42 inter-search engine comparisons, only 2 strong and 4 moderate correlations were statistically significant. The correlations appeared to decrease with the size of the lists: the 3 strong correlations were for lists of 10, the 8 moderate correlations were for lists of 25, and no correlations were found for lists of 50.”
Interestingly they state that if a webpage doesn’t rank in the first few pages, it’s as if it doesn’t exist.  I think this is true of search engine rankings but I know a lot of blogs with low ranking that are popular through word of mouth and social networks.  Jill mini rdf PageRank fails on quality   proved againis right, rankings really aren’t the be all and end all.
“We then created a program that will create an ordinal ranking of the URLs in a SE independent of any keyword query. We then used Kendall’s Tau (t ) to test for statistically significant (p < t =" 0.60)" t =" 0.80)"> moderate (0.40 < t ="0.60)" t =" 0.80)">
They found that the bigger the list, the fewer the correlations, and in fact they found very few.  They say that PageRank showed its limitations because it’s a conventional hyperlink method, which doesn’t take into account quality scores.  They say that Cho and Baeza-Yates found that PageRank was biased against new pages, even if they were of the highest quality.  
Really important papers to read from their refs:

Post to Twitter Tweet This Postmini rdf PageRank fails on quality   proved again

Related Posts:


1 Trackbacks/Pingbacks

  1. Google branding algorithm fuss | Science for SEO 28 02 09

Your Comment






© 2009-2010 Science for SEO All Rights Reserved -- Copyright notice by Blog Copyright

SEO Powered by Platinum SEO from Techblissonline

Twitter links powered by Tweet This v1.6.1, a WordPress plugin for Twitter.