Ross Malaga (Professor in information systems at Montclair State University) wrote an article for the ACM in December 08 about what the worst practises in SEO were and which ones got you banned from Google. I am sure many of you SEO’s will have plenty to say about this. The article is an A[...]
Posts Tagged ‘Search engines’
Document clustering – a short intro
Clustering is super important in all systems that deal with any kind of information. In information retrieval systems like digital libraries and search engines they are used to group the documents into clusters. These are all documents that share similarities. This can get really really compl[...]
Semantic method for keyword research
The paper “Keyword Generation for Search Engine Advertising using Semantic Similarity between Terms” by Vibhanshu Abhishek, Kartik Hosanagar (The Wharton School Philadelphia), was presented at ICEC’07. That conference will be of particular interest to online marketing professionals[...]
The impact of SEO on the online advertising market
This paper written by BO Xing and Zhangxi Lin from the Texas Tech University in 2006 discusses the impact of SEO online. The study is conducted in an analytical way, using a number of good resources but has at times a simplistic view of the SEO effort. SEO’s are considered to be of “para[...]
The semantic web is not research as usual
Frank van Harmelen (Vrije Universiteit Amsterdam) did a nice lecture called “Where Does It Break? Or: Why the Semantic Web is Not Just ‘Research as Usual’” – I think that there is a lot of confusion about what the semantic web is and how complicated the entire thing is.[...]
Why writing a search engine is hard
Anna Patterson, research Associate to the formal reasoning group at Stanford and ex-Googler, also head lady at the Cuil search engine explains why writing a search engine is hard at the ACM queue. Some main points: Building good search engines has never been done in a big group but in teams of 1 to [...]
Google Tech talk
On the Google channel on YouTube you’ll find a tech talk called “Knowledge-based Information Retrieval with Wikipedia” from October 31st 2008. It covers the limitations of search engines today. Documents and queries aren’t really understood at all, because they’re sti[...]
Blogosphere vs Web – ranking issues
I came across a very cool paper from SIGKDD 2008 called “Blogosphere: Research Issues, Tools, and Applications” by Nitin Agarwal and Huan Liu from the University of Arizona. It’s an easy but long read, for the geek, but can also be quite happily understood by the layman. I̵[...]
Using link structure to fight webspam
We are all familiar with webspam but not a lot of people know that it is a classification problem in computing. It’s hard to get all of the features right, and difficult to find an efficient classifier. There’s an interesting paper called “Improving Web Spam Classifiers Using [...]

