Whilst I go through the monumental task of doing all my thesis corrections and finishing my PhD (finally), I don’t have much time for blogging and writing at length. Especially since I also work a full-time job! As a result I’ve decided to use Tumblr to share thoughts and snippets with[...]
Archive for the ‘Uncategorized’ Category
TGIF – o[-<]:
Welcome to another edition of TGIF. I trust that you all had a wonderful week, be it either at work or on holiday somewhere. For those into listening to music while working, I’ve decided to add a little Friday tune to the TGIF post as from today! You may all woop and clap with excitement[...]
Yahoo! Research on the healthy Sem Web
Ricardo Baeza-Yates, Peter Mika and Hugo Zaragoza from Yahoo! Research wrote a really insightful, meaningful and down to earth article called “Search, Web 2.0, and the semantic web”. It is a response to all of the buzz around those topics. If anyone can give a straight answer on thes[...]
Science for Seo is back
Hi all, due to some annoying complications, as many of you have noticed, this blog has been down for 4 days. I’m sorry about that, and am quite certain that this won’t be happening again! Thanks for all the messages from all you brilliant people! cj Tweet This PostRelated Posts:No Rela[...]
What is an algorithm?
Let’s get right back down to basics and look at what an algorithm is. There are several different types, and it can all get a little bit confusing sometimes. It can be very complicated and it can also be simple. If you intend to make complicated ones, then you’ll need to delve in[...]
Effective Query Log Anonymization
Check out this very good Google tech talk about using query logs: “User search query logs have proven to be very useful, but have vast potential for misuse. Several incidents have shown that simple removal of identifiers is insufficient to protect the identity of users. Publishing such inadequ[...]
The RankMass crawler
The paper entitled “RankMass Crawler: A Crawler with High Personalized PageRank Coverage Guarantee” (Cho, Schonfeld, University of California) deals with the important topic of how many pages should be collected to cover most of the web, and how to ensure that important documents are [...]

