This is a fantastic presentation which takes you through all sorts of commonly used algorithms in text mining and for processing web data. Concepts and methods from machine learning are presented and there’s even some little stick men, so it’s gotta be good.
Who is this for? Those of you who are interested in understanding how text and language can be manipulated and abstracted by computers. This will allow you to understand far more about how machines deal with language and why we use particular techniques. If this were a sensationalist blog the title would be “Understand Google better!” or something like that